← Home

Perfect Test Isolation using Database Transactions

Created: 2025 Sep 13 , Updated: 2025 Sep 16
programming testing databases mvcc
6 min read

I believe that a hacker/script-kiddie evolves into a software engineer at the point when they realize that they need to write tests for their own benefit and not because someone else forced them to complete 100% test coverage in the code. That realization took me four years down the line after writing my first professional code, but that’s a story for another day. Anything more than a pure exploratory script that I write will have some sort of automated testing associated with it today.

Today I want to take a break from all the AI craziness and spend a little bit of time explaining a small technique that works across any testing framework and any MVCC-capable database - using database transactions to achieve perfect test isolation without teardown boilerplate.

Using Data in Tests

Any reasonably sized product and system will use a database to store/access data. At a really small scale, it can even be something as simple as SQLite/Files/Memory, but the moment you hit production, at any scale, you will want to use a real database.

Whether you’re using pytest, unittest, Jest, RSpec, or any other testing framework, the pattern for database tests is always the same:

Add data into the DB
Verify that the functionality works

For example, let’s say we have a User model in our application and want to test user permissions. Here’s how it might look in Python with PostgreSQL (but the same concept applies to any language and database):

def test_user_permissions():
    # Create a user and role directly in the database (setup)
    conn = get_db_connection()
    cursor = conn.cursor()
    cursor.execute(
        "INSERT INTO users (username, email) VALUES (%s, %s) RETURNING id",
        ("testuser", "[email protected]")
    )
    user_id = cursor.fetchone()[0]

    cursor.execute("INSERT INTO roles (name) VALUES (%s) RETURNING id", ("admin",))
    role_id = cursor.fetchone()[0]

    cursor.execute(
        "INSERT INTO user_roles (user_id, role_id) VALUES (%s, %s)",
        (user_id, role_id)
    )
    conn.commit()

    # Test permission functionality
    has_admin = check_user_permission(user_id, "admin_access")
    assert has_admin == True

The above setup works fine and if there are some issues with the application logic in check_user_permission, the test should find it correctly. However, the code above, as is, can cause insidious bugs regardless of which testing framework you’re using.

The Test Isolation Problem

The reason is that once there are multiple tests, they could secretly affect each other. If you create data in test A, it could still be hanging around when test B runs. This causes flaky tests that pass sometimes and fail other times depending on the order they run in. There is already lots of material available online on test isolation, so I am not going to go into that here.

The traditional solution to the problem is setup and teardown. Every testing framework has some version of this - pytest fixtures, unittest setUp/tearDown, Jest beforeEach/afterEach, etc. Before each test, you set up the data you need. After each test, you clean up everything you created. In this example we are doing setup and teardown directly in the test function.

def test_user_permissions():
    conn = get_db_connection()
    cursor = conn.cursor()

    # Setup - create test data
    cursor.execute(
        "INSERT INTO users (username, email) VALUES (%s, %s) RETURNING id",
        ("testuser", "[email protected]")
    )
    user_id = cursor.fetchone()[0]

    cursor.execute("INSERT INTO roles (name) VALUES (%s) RETURNING id", ("admin",))
    role_id = cursor.fetchone()[0]

    cursor.execute(
        "INSERT INTO user_roles (user_id, role_id) VALUES (%s, %s)",
        (user_id, role_id)
    )
    conn.commit()

    # Test - actual functionality we care about
    has_admin = check_user_permission(user_id, "admin_access")
    has_super = check_user_permission(user_id, "super_admin_access")
    assert has_admin == True
    assert has_super == False

    # Teardown - clean up everything we created
    cursor.execute("DELETE FROM user_roles WHERE user_id = %s", (user_id,))
    cursor.execute("DELETE FROM users WHERE id = %s", (user_id,))
    cursor.execute("DELETE FROM roles WHERE id = %s", (role_id,))
    conn.commit()

This seems to have fixed the problem to an extent, but now we have more issues.

We now have a lot of boilerplate code which we don’t care about.
Tests stopping in the middle for any reason (error in the code or even a power failure) can cause the data state to reach an invalid state, which can cause flakiness when running the test again.

MVCC

Many modern databases have this brilliant feature called Multi-Version Concurrency Control (MVCC). PostgreSQL, SQL Server, Oracle, and many others support it. Without getting too deep into the database internals, it can be summarized as: MVCC databases can handle multiple versions of the same data simultaneously.

When you start a transaction, the database creates a snapshot of the data at that moment. Any changes you make within that transaction are only visible to you on that connection. Other connections see the original data. If you rollback the transaction, it’s like your changes never happened - the database just discards your version and keeps the original. What is beautiful is that even if you don’t manually rollback and the transaction gets cancelled for any reason, it is happening within a “sandbox” and other queries are not affected by these changes.

This is what makes MVCC databases so good at handling concurrent queries without locks everywhere. But it also gives us a superpower for testing.

MVCC + Tests

The idea is simple: what if we run each test inside its own transaction? Any data we create during the test automatically becomes invisible to other tests. We don’t even need to worry about teardown!

We can update our code as:

# conftest.py
import pytest
import psycopg2
from typing import Generator, Any

@pytest.fixture(scope="function")
def db_connection() -> Generator[Any, Any, None]:
    # Create a new connection for each test
    conn = psycopg2.connect("your_test_db_connection_string")
    conn.autocommit = False  # Start transaction
    try:
        yield conn
    finally:
        conn.rollback()
        conn.close()

# test_something.py

def test_user_permissions(db_connection):
    cursor = db_connection.cursor()

    # Create test data - same as before but now within transaction
    cursor.execute(
        "INSERT INTO users (username, email) VALUES (%s, %s) RETURNING id",
        ("testuser", "[email protected]")
    )
    user_id = cursor.fetchone()[0]

    cursor.execute("INSERT INTO roles (name) VALUES (%s) RETURNING id", ("admin",))
    role_id = cursor.fetchone()[0]

    cursor.execute(
        "INSERT INTO user_roles (user_id, role_id) VALUES (%s, %s)",
        (user_id, role_id)
    )

    # Test permission functionality - same as before
    has_admin = check_user_permission(user_id, "admin_access")
    has_super = check_user_permission(user_id, "super_admin_access")
    assert has_admin == True
    assert has_super == False

    # No teardown needed! MVCC handles isolation automatically

This seems like a very small change, but the benefits can add up quickly since we are no longer worried about test isolation. The connection object is best managed at the framework level (pytest’s conftest.py, Jest’s setup files, etc.) with appropriate scoping and shared by all tests. This pattern has saved me countless hours of writing teardown code and hunting down flaky tests.

One thing to note is that this works great for database logic, but if your tests interact with external services or run E2E tests or files, you’ll still need traditional teardown for those. For the vast majority of backend tests this shouldn’t be a problem since you’re mostly testing application logic and using the DB to store/retrieve data.

Give it a try in your next project, regardless of which testing framework you’re using, and let me know how it goes!

You can use the same MVCC concepts within an API to keep it idempotent as well as handle failures cleanly! I’ll explain this in another post.

Thanks to acrarshin and RainingComputers for giving pointers to improve this post!

If you found this useful, please consider upvoting it on Hacker News!