What does ACID mean, and why does each property matter?

**Atomicity** — a transaction's operations all succeed or all roll back, never partially apply. **Consistency** — a transaction moves the database from one valid state to another, never violating constraints. **Isolation** — concurrent transactions don't see each other's uncommitted intermediate state. **Durability** — once committed, data survives crashes/power loss. Together they let application code treat a multi-statement operation as a single, safe unit even under concurrency and failure.

Explain the four SQL standard isolation levels and the anomalies they prevent

From weakest to strongest: **Read Uncommitted** allows dirty reads (seeing another transaction's uncommitted changes). **Read Committed** prevents dirty reads but allows non-repeatable reads (a row you read twice can change between reads). **Repeatable Read** prevents non-repeatable reads but standard-allows phantom reads (a query re-run can return new rows). **Serializable** prevents all of these by making concurrent transactions behave as if run one at a time. Higher isolation = stronger correctness guarantees, at the cost of more blocking/aborts and lower concurrency.

What is MVCC (multi-version concurrency control), and how does it let readers avoid blocking writers?

MVCC keeps multiple versions of a row simultaneously — when a row is updated, the engine creates a new version rather than overwriting the old one in place, and each transaction sees a consistent snapshot of the data as of some point in time. This means readers never need to wait for writers (and vice versa) to see a consistent view, because a reader simply looks at the version of each row that was committed as of its snapshot, ignoring newer, uncommitted, or later versions.

Optimistic vs pessimistic concurrency control — what are the tradeoffs?

**Pessimistic** concurrency acquires a lock upfront before reading/modifying a row, blocking other transactions from touching it until release — safe by construction, but reduces concurrency and risks deadlocks/long waits. **Optimistic** concurrency assumes conflicts are rare: it reads without locking, then checks at write time (typically via a version/timestamp column) whether the row changed since it was read, retrying or failing if it did. Optimistic control scales better under low contention; pessimistic is safer and simpler under high contention.

What is a deadlock, and how do databases detect and resolve them?

A deadlock occurs when two (or more) transactions each hold a lock the other needs, so neither can proceed — a circular wait. Databases detect this by building a wait-for graph and looking for cycles (or by using a timeout), then resolve it by picking one transaction as the "victim," forcibly rolling it back and returning an error, letting the other(s) proceed. Applications must be prepared to catch a deadlock error and retry the aborted transaction.

What's the difference between row-level, table-level, and page-level locking?

**Row-level** locking locks only the specific rows a transaction touches, maximizing concurrency but with more overhead to track many small locks. **Table-level** locking locks the entire table, simple and low-overhead but blocks unrelated concurrent access to any row in it. **Page-level** locking is a middle ground, locking a whole disk page (which holds multiple rows). Most modern OLTP engines default to row-level locking for regular DML, escalating to table-level only for schema changes or explicit bulk operations.

What's the difference between a shared lock and an exclusive lock?

A **shared (read) lock** allows multiple transactions to hold it simultaneously on the same resource — any number of readers can proceed concurrently, but no one can acquire an exclusive lock while any shared lock is held. An **exclusive (write) lock** allows only one transaction to hold it at a time, and blocks both other exclusive locks and other shared locks — while one transaction holds it, no one else can read (under lock-based, non-MVCC read semantics) or write that resource.

What is a phantom read, and which isolation level prevents it?

A phantom read happens when a transaction re-runs the same range-based query twice and gets a different set of rows the second time, because another transaction inserted or deleted rows matching that range and committed in between — the "phantom" rows weren't there, then were. Per the SQL standard, only `Serializable` isolation is guaranteed to prevent phantom reads; some engines (notably PostgreSQL, via snapshot isolation) also prevent them at Repeatable Read, which is stricter than the standard technically requires.

What is two-phase commit, and when is it needed?

Two-phase commit (2PC) is a protocol for atomically committing a transaction that spans multiple independent databases/resources: a coordinator first asks every participant to "prepare" (durably promise it can commit), and only after all participants confirm does it tell everyone to actually commit. It's needed for distributed transactions across separate database instances or heterogeneous systems (e.g., a database and a message queue) where a single engine's normal atomicity guarantee doesn't span both.

How would you prevent double-booking or a lost-update race condition when two transactions modify the same row?

The core issue is a "read, check, then write" sequence where two transactions can both read the same starting state before either writes, so both proceed as if their check passed. Fix it with either pessimistic locking (`SELECT ... FOR UPDATE` before checking availability) or an atomic conditional update (`UPDATE ... WHERE available = true` and checking the affected row count) so the check and the write happen as one indivisible database operation rather than two separate round-trips.

Transactions and Concurrency Control

ACID guarantees, isolation levels, locking, MVCC, and handling concurrent access safely.

Questions

10 total

10 questions in this section

Difficulty

Open as page

ACID describes the guarantees a database transaction makes, letting you reason about a group of statements as a single, safe operation even in the presence of concurrent access and crashes.

Atomicity — all or nothing

BEGIN;
UPDATE accounts SET balance = balance - 100 WHERE id = 1;  -- debit
UPDATE accounts SET balance = balance + 100 WHERE id = 2;  -- credit
COMMIT;

If the second UPDATE fails (constraint violation, crash, connection drop) before COMMIT, the first UPDATE must also be undone — you should never end up with money debited from account 1 but not credited to account 2. Without atomicity, every multi-step write needs manual, error-prone compensating logic in application code.

Consistency — valid state to valid state

The transaction must leave the database satisfying all defined constraints (NOT NULL, CHECK, foreign keys, unique constraints) — if a CHECK (balance >= 0) constraint exists, no committed transaction can ever leave a negative balance, even mid-transaction states are allowed to (temporarily) violate it as long as they don't at commit time. Note: this is the least precisely defined of the four letters, and is partly just "atomicity + isolation + valid constraints together imply the DB stays consistent," rather than a fully independent mechanism.

Isolation — concurrent transactions don't interfere

-- Transaction A                          -- Transaction B
BEGIN;
UPDATE accounts SET balance = 500
  WHERE id = 1;
                                           BEGIN;
                                           SELECT balance FROM accounts
                                             WHERE id = 1;  -- should NOT see 500 yet
                                           -- (depending on isolation level)
COMMIT;

Isolation determines exactly what "shouldn't see yet" means in practice — this is where the four standard isolation levels (Read Uncommitted, Read Committed, Repeatable Read, Serializable) come in, each allowing or preventing different classes of interference (see that question).

Durability — survives a crash

COMMIT;
-- Power fails immediately after this returns successfully to the client.
-- On restart, the committed data MUST still be there.

Achieved via a write-ahead log (WAL) — the engine writes a durable log record of the change before acknowledging the commit, so it can replay the log to recover committed-but-not-yet-flushed-to-disk data after a crash (see the WAL question in the scaling/HA topic).

Why this matters for interviews

ACID isn't just trivia — it's the contract that lets you write BEGIN ... COMMIT blocks around multi-step business logic (like a funds transfer) and trust the database to handle failure and concurrency correctly, instead of hand-rolling that safety in application code. Being able to explain a concrete failure each property prevents (not just recite the acronym) is what distinguishes a strong answer.

Related Resources

PostgreSQL: Transaction Isolation

Open as page

The three anomalies

Anomaly	Description
Dirty read	Transaction A reads data that Transaction B has written but not yet committed. If B rolls back, A read data that "never really happened."
Non-repeatable read	Transaction A reads a row, Transaction B commits an update to that same row, A reads it again and gets a different value within the same transaction.
Phantom read	Transaction A runs a query with a `WHERE` filter, Transaction B commits a new row matching that filter, A re-runs the same query and sees an extra row that wasn't there before.

The four levels

Level	Dirty read	Non-repeatable read	Phantom read
Read Uncommitted	Possible	Possible	Possible
Read Committed	Prevented	Possible	Possible
Repeatable Read	Prevented	Prevented	Possible (per SQL standard)
Serializable	Prevented	Prevented	Prevented

BEGIN TRANSACTION ISOLATION LEVEL REPEATABLE READ;
SELECT balance FROM accounts WHERE id = 1;   -- e.g., 500
-- (another transaction commits a change to this row)
SELECT balance FROM accounts WHERE id = 1;   -- still 500, guaranteed, under REPEATABLE READ
COMMIT;

Engine-specific reality checks

Read Uncommitted is rarely meaningfully different from Read Committed in practice — PostgreSQL doesn't implement it at all (it silently upgrades to Read Committed); it exists mostly in the standard and in SQL Server.
PostgreSQL's Repeatable Read actually prevents phantom reads too (stricter than the SQL standard requires for this level), because it's implemented via snapshot isolation rather than row-level locking — worth knowing that "the same isolation level name" doesn't guarantee identical behavior across engines.
MySQL/InnoDB's default is Repeatable Read, while PostgreSQL's and SQL Server's default is Read Committed — a common source of subtly different application behavior when porting code between engines without adjusting isolation level assumptions.

The tradeoff

Higher isolation levels give stronger guarantees but at a real cost: more locking/blocking, more transaction aborts due to serialization conflicts (an application must be prepared to retry a transaction that fails at Serializable), and lower overall throughput under contention. Read Committed is the practical default for most OLTP workloads because it prevents the most dangerous anomaly (dirty reads) cheaply; Serializable is reserved for logic that's genuinely sensitive to subtle concurrency bugs (e.g., enforcing an invariant across multiple rows, like "total allocated seats can never exceed capacity").

Don't reach for Serializable by default "to be safe" — it can meaningfully hurt throughput and requires retry logic for serialization failures. Instead, identify which specific anomaly your business logic is actually vulnerable to, and choose the lowest isolation level that prevents it (often supplemented with an explicit row lock via SELECT ... FOR UPDATE rather than raising the whole transaction's isolation level).

Related Resources

PostgreSQL: Transaction Isolation

Open as page

The problem MVCC solves

A simpler concurrency model — pure locking, no versioning — would require every reader to acquire a shared lock and every writer to acquire an exclusive lock, meaning a long-running read blocks writers, and a write blocks all readers until it commits. That's simple but kills concurrency for read-heavy workloads.

How MVCC works

Instead of updating a row in place, an UPDATE (conceptually) creates a new version of the row, tagged with the transaction ID that created it, while the old version stays around (marked as superseded, but not yet physically removed) as long as any active transaction might still need to see it.

Row (id=1), before update:  [ version 1: balance=500, created_by=txn_10, valid_until=txn_15 ]
After UPDATE by txn_15:     [ version 1: ...valid_until=txn_15 ]
                            [ version 2: balance=400, created_by=txn_15, valid_until=(open) ]

Each transaction operates against a snapshot — effectively "the set of row versions committed as of the moment my transaction/statement started." A reader's SELECT simply picks the version of each row that was valid as of its snapshot, entirely ignoring whatever a concurrent writer is doing to create newer versions.

The key benefit: readers don't block writers, and vice versa

-- Transaction A (long-running report query)
BEGIN;
SELECT SUM(balance) FROM accounts;   -- takes 30 seconds, sees a consistent snapshot

-- Transaction B, running concurrently, is NOT blocked by A's SELECT
BEGIN;
UPDATE accounts SET balance = balance - 100 WHERE id = 1;
COMMIT;   -- succeeds immediately, doesn't wait for A's SELECT to finish

Transaction A's report simply doesn't see B's update (it wasn't committed as of A's snapshot) — it gets a consistent, if slightly stale, view rather than being blocked or seeing a torn/partial state.

The cost: old versions must be cleaned up

Since old row versions stick around until no transaction could possibly need them, MVCC engines need a garbage-collection mechanism:

PostgreSQL: VACUUM (usually via autovacuum) reclaims space from dead row versions ("dead tuples"). Under-vacuumed tables can bloat significantly, and in extreme, badly-managed cases risk transaction ID wraparound issues.
MySQL/InnoDB: maintains an undo log of old versions, purged by a background thread once no longer needed; a very long-running transaction can bloat the undo log by preventing purges.
Oracle: similarly uses undo segments/tablespaces for the same purpose.

Why this matters for interviews

MVCC explains a lot of otherwise-confusing behavior: why a SELECT in PostgreSQL basically never blocks on a concurrent UPDATE, why a long transaction can cause table bloat, and why "Repeatable Read" is comparatively cheap to implement in an MVCC engine (it's mostly "pick your snapshot once, at transaction start, instead of per-statement") compared to a purely lock-based concurrency model.

Related Resources

PostgreSQL: MVCC

Open as page

Pessimistic concurrency control

Lock the row before doing anything, so no one else can modify it until you're done:

BEGIN;
SELECT * FROM seats WHERE id = 42 FOR UPDATE;   -- acquires an exclusive row lock
-- ... application logic decides whether the seat can be booked ...
UPDATE seats SET status = 'booked' WHERE id = 42;
COMMIT;   -- lock released here

Any other transaction trying to SELECT ... FOR UPDATE (or update) row 42 simply blocks until this transaction commits or rolls back. Pros: conceptually simple, guaranteed to prevent conflicts, no retry logic needed. Cons: reduces concurrency (other transactions wait), and holding locks across slow operations (a network call, user think-time) can cause serious contention or even deadlocks under load.

Optimistic concurrency control

Read without locking, then verify at write time that nothing changed:

-- Read
SELECT id, status, version FROM seats WHERE id = 42;
-- app gets: status='available', version=7

-- ... application logic, possibly slow (user confirms booking) ...

-- Write: only succeeds if version is still 7
UPDATE seats SET status = 'booked', version = version + 1
WHERE id = 42 AND version = 7;

-- Check rows affected: if 0, someone else updated it first -- retry or fail

Pros: no locks held during the "thinking" period, so no blocking of other transactions, and no deadlock risk from this pattern. Cons: requires a version/timestamp column and retry logic in the application, and under high contention, many transactions may repeatedly fail and retry (worse throughput than a lock would have given, ironically) — optimism is a bad bet exactly when conflicts are actually common.

When to use which

Pessimistic: high contention on the same rows, or when the cost of retrying a failed operation is high (e.g., a multi-step external side effect that's hard to safely redo).
Optimistic: low contention, read-heavy workloads, or web applications where holding a database lock across a slow client round-trip (think-time) would be unacceptable — e.g., editing a document where two users rarely edit the exact same record simultaneously.

Where this shows up in ORMs

Most ORMs (Entity Framework, Hibernate, etc.) implement optimistic concurrency natively via a RowVersion/@Version column, throwing a concurrency exception on a version mismatch that the application must catch and handle (typically by reloading and prompting the user, or retrying). This is usually preferred over pessimistic locking in typical web applications specifically because holding a database transaction open across an HTTP request/user interaction is almost always the wrong design.

Related Resources

PostgreSQL: Explicit Locking

Open as page

A minimal deadlock example

-- Transaction A                      -- Transaction B
BEGIN;                                BEGIN;
UPDATE accounts SET balance = balance - 10
  WHERE id = 1;                       UPDATE accounts SET balance = balance - 10
  -- A now holds a lock on row 1        WHERE id = 2;
                                       -- B now holds a lock on row 2
UPDATE accounts SET balance = balance + 10
  WHERE id = 2;                       UPDATE accounts SET balance = balance + 10
  -- A waits for B's lock on row 2      WHERE id = 1;
                                       -- B waits for A's lock on row 1
                                       -- DEADLOCK: neither can proceed

A is waiting for a lock B holds; B is waiting for a lock A holds. Neither will ever release, without intervention.

Detection

Most engines maintain a wait-for graph — a graph where an edge from transaction X to transaction Y means "X is waiting on a lock held by Y." Periodically (or on each new lock wait), the engine checks this graph for a cycle — a cycle means a deadlock exists. This is generally cheaper and faster than the alternative (a pure timeout, used by some simpler systems), since it detects the problem the moment a cycle forms rather than waiting for an arbitrary timeout to expire.

Resolution

Once a cycle is found, the database picks a victim transaction — typically the one that would be cheapest to roll back (least work done, fewest locks held, or simply the one that most recently joined the cycle, depending on engine) — and forcibly aborts it with a deadlock error, releasing its locks so the other transaction(s) can proceed.

ERROR: deadlock detected
DETAIL: Process 1234 waits for ShareLock on transaction 5678; blocked by process 5678.
        Process 5678 waits for ShareLock on transaction 1234; blocked by process 1234.
HINT: See server log for query details.

The application's responsibility

A deadlock is not a bug in the database — it's an expected, occasional outcome of concurrent access patterns, and the losing transaction's work is fully rolled back (atomicity holds). Applications must catch this specific error and retry the whole transaction from the beginning (not just the last statement), typically with a short randomized backoff to avoid immediately re-colliding with the same transaction.

How to reduce deadlock frequency

Access rows/tables in a consistent order across all transactions — in the example above, if both transactions always updated account 1 before account 2, no cycle could ever form.
Keep transactions short — the longer a transaction holds locks, the more opportunity for another transaction to also be waiting on something it holds.
Use the lowest isolation level that satisfies your correctness needs (higher isolation levels generally take more/broader locks, or in MVCC engines, cause more serialization failures which are a related-but-distinct phenomenon from classic lock deadlocks).
Consider explicit, application-level lock ordering (e.g., always lock the row with the lower ID first) for code paths known to touch multiple rows that could also be touched in the opposite order elsewhere.

Related Resources

PostgreSQL: Deadlocks

Open as page

Row-level locking

BEGIN;
UPDATE accounts SET balance = balance - 10 WHERE id = 1;
-- Only row id=1 is locked; other transactions can freely update rows 2, 3, 4...
COMMIT;

This is the default granularity for InnoDB (MySQL), PostgreSQL, and SQL Server's row-locking mode — it maximizes concurrency because unrelated rows in the same table remain fully accessible to other transactions. The cost is more bookkeeping: the engine must track potentially many individual row locks per transaction.

Table-level locking

LOCK TABLE accounts IN EXCLUSIVE MODE;   -- blocks ALL other access to the whole table

Locks the entire table regardless of which specific rows are touched — simple to implement and low per-operation overhead, but drastically reduces concurrency, since even transactions touching completely unrelated rows must wait. Common uses: DDL operations (ALTER TABLE) that must see a globally consistent view of the schema, or explicit bulk maintenance operations where you deliberately want exclusive access.

Page-level locking

Locks a whole disk page (which typically holds multiple rows) rather than a single row or the whole table — a middle ground some engines (older SQL Server versions, some MyISAM configurations) used for lower locking overhead than pure row-level, at the cost of unrelated rows on the same page blocking each other (a form of false contention sometimes called "false sharing" at the storage layer).

Lock escalation

Some engines (notably SQL Server) automatically escalate from many row-level locks to a single table-level lock once a transaction holds "too many" row locks (a threshold, often ~5,000), to reduce lock-management memory overhead — this can unexpectedly turn a seemingly-fine-grained update into a full-table lock if it touches a large number of rows in one transaction, a subtle gotcha worth knowing when debugging unexpected blocking on large batch updates.

Default to trusting your engine's row-level locking for normal OLTP transactions — it's what's designed for high-concurrency workloads. Reach for explicit table-level locks only for genuinely table-wide maintenance operations, and be aware that a very large single-transaction batch update might unintentionally trigger lock escalation (SQL Server) or simply hold a very large number of row locks for a long time (PostgreSQL/MySQL), either of which can significantly increase blocking for other transactions — consider batching large updates into smaller committed chunks instead.

Related Resources

MySQL: InnoDB Locking

Open as page

Compatibility matrix

	Shared held by another txn	Exclusive held by another txn
Request Shared	Allowed (both can hold it)	Blocked
Request Exclusive	Blocked	Blocked

Shared lock — for reading

-- Explicit shared lock (SQL Server style)
SELECT * FROM accounts WITH (HOLDLOCK) WHERE id = 1;

Multiple transactions can hold a shared lock on the same row/table simultaneously — any number of concurrent readers is fine, since none of them are modifying the data. A shared lock only conflicts with an exclusive lock request: if any transaction holds a shared lock, no one else can acquire an exclusive lock on that resource until all the shared locks are released.

Exclusive lock — for writing

UPDATE accounts SET balance = balance - 10 WHERE id = 1;
-- Acquires an exclusive lock on row id=1 for the duration of the transaction

Only one transaction can hold an exclusive lock on a given resource at a time, and while held, no other transaction can acquire any lock (shared or exclusive) on that same resource — it must wait.

Why MVCC changes the practical picture

In a pure lock-based engine, a long-running reader holding a shared lock can block a writer, and vice versa. Under MVCC (PostgreSQL, InnoDB, SQL Server's snapshot isolation), plain SELECTs generally don't take shared row locks at all — they read a consistent snapshot instead (see the MVCC question), so ordinary reads and writes don't block each other. Shared/exclusive locks in an MVCC engine mostly come into play for explicit locking reads (SELECT ... FOR SHARE / SELECT ... FOR UPDATE) and for the writes themselves, not for plain reads.

-- Explicit shared lock in PostgreSQL/MySQL: "I'm reading this, don't let anyone modify it until I'm done"
SELECT * FROM accounts WHERE id = 1 FOR SHARE;

-- Explicit exclusive lock: "I intend to modify this, block others from reading-for-update or writing it"
SELECT * FROM accounts WHERE id = 1 FOR UPDATE;

Use FOR UPDATE (exclusive) when you're about to modify a row based on its current value and must prevent another transaction from changing it in between your read and your write (classic "read, check, then write" race condition). Use FOR SHARE when you need to ensure a row doesn't change while you rely on its value, but you're not modifying it yourself and are fine with other readers also holding a shared lock concurrently.

Related Resources

SQL Server: Lock Modes

Open as page

A phantom read example

-- Transaction A                                -- Transaction B
BEGIN;
SELECT COUNT(*) FROM orders
  WHERE status = 'pending';  -- returns 5
                                                 BEGIN;
                                                 INSERT INTO orders (status, ...)
                                                   VALUES ('pending', ...);
                                                 COMMIT;
SELECT COUNT(*) FROM orders
  WHERE status = 'pending';  -- returns 6 !!
COMMIT;

Transaction A ran the identical query twice within the same transaction and got two different row counts, because Transaction B committed a new row matching the filter in between. The new row is the "phantom" — it wasn't part of the original result set, then suddenly was.

Why this differs from a non-repeatable read

A non-repeatable read is about a specific, already-fetched row changing value; a phantom read is about the set of rows matching a condition changing, specifically due to inserts or deletes (not updates to existing rows already in the result). This distinction is why some isolation levels can prevent one but not the other — locking a specific set of already-read rows (preventing non-repeatable reads) doesn't automatically prevent a new row from being inserted into that range.

Which levels prevent it

Per the ANSI SQL standard: only Serializable is required to fully prevent phantom reads — Repeatable Read is standard-permitted to still allow them, because traditional row-locking implementations lock the rows you've already read, not a "gap" covering rows that don't exist yet.

However, engine implementation varies: PostgreSQL implements Repeatable Read via snapshot isolation (MVCC), which takes a full snapshot at transaction start — since the "new" row in the example above wasn't part of that snapshot at all, PostgreSQL's Repeatable Read actually does prevent this specific phantom scenario, stricter than the standard's minimum requirement for that level. MySQL/InnoDB's Repeatable Read additionally uses gap locks and next-key locks specifically to prevent phantom inserts within a locked range for locking reads.

Don't assume "Repeatable Read" means the same guarantee across every database — verify your specific engine's actual behavior rather than relying on the SQL standard's minimum bar, since several major engines (PostgreSQL, MySQL/InnoDB) exceed it. If phantom reads matter to your business logic and you're not certain of your engine's exact behavior at Repeatable Read, Serializable is the only level the standard guarantees will prevent them everywhere.

Related Resources

PostgreSQL: Transaction Isolation

Open as page

The problem: atomicity across multiple systems

A normal BEGIN...COMMIT transaction is atomic within one database. But suppose you need to atomically update two separate databases (e.g., debit an account in DB-A and credit an account in DB-B, where A and B are physically separate database servers) — a plain commit on each individually can't guarantee both succeed or both fail together; one could commit while the other crashes or fails.

The two phases

Phase 1 — Prepare: The coordinator asks every participant to do everything needed to commit (validate constraints, write to its own durable log) and reply "yes, I can commit" or "no, I can't" — but without actually making the change visible/permanent yet.

Coordinator -> Participant A: PREPARE
Coordinator -> Participant B: PREPARE
Participant A -> Coordinator: READY (durably logged, could commit or abort from here)
Participant B -> Coordinator: READY

Phase 2 — Commit (or Abort): Only if every participant replied "yes" does the coordinator tell everyone to actually commit. If even one participant said "no" (or timed out), the coordinator tells everyone to abort instead.

Coordinator -> Participant A: COMMIT
Coordinator -> Participant B: COMMIT
Participant A -> Coordinator: COMMITTED
Participant B -> Coordinator: COMMITTED

Because every participant durably logged "I'm ready to commit" before replying yes in phase 1, even a crash between phase 1 and phase 2 is recoverable — on restart, a participant can check with the coordinator (or replay its own log) to find out whether the overall transaction ultimately committed or aborted, and finish accordingly.

When it's actually needed

Genuine distributed transactions across separate database engines/instances — e.g., updating two different PostgreSQL clusters, or a database plus a JMS message queue, as a single atomic unit (XA transactions implement this pattern in many enterprise stacks).
Rare in modern web-scale architectures, because 2PC has real costs: it's blocking (participants hold locks/resources while waiting through both phases), and the coordinator is a single point of failure/bottleneck during the protocol.

Why most modern systems avoid it

Distributed systems at scale generally prefer eventual consistency with compensating actions (the Saga pattern: a sequence of local transactions, each with a corresponding "undo" action if a later step fails) or idempotent, retryable operations with outbox patterns (write the "intent to do X" in the same local transaction as the primary change, then a background process reliably delivers it) rather than 2PC, because these avoid 2PC's blocking behavior and single coordinator bottleneck, at the cost of only eventual (not immediate) cross-system consistency.

2PC is worth knowing conceptually — it's the classical answer to "how do you get atomicity across two databases" — but a strong answer also mentions why modern distributed architectures usually avoid it in favor of sagas/outbox patterns, since that shows awareness of its real operational costs, not just the protocol's mechanics.

Related Resources

Wikipedia: Two-Phase Commit Protocol

Open as page

This is a very common system-design-flavored SQL interview question — it tests whether you understand that "check, then act" in application code is inherently racy against another concurrent request doing the same thing.

The bug, first

-- Application code, naive version:
-- Step 1: check availability
SELECT available FROM seats WHERE id = 42;   -- returns true

-- (time passes — two concurrent requests can both get 'true' here)

-- Step 2: book it
UPDATE seats SET available = false WHERE id = 42;

If two users' requests both execute Step 1 before either executes Step 2, both will see available = true and both will proceed to "book" the seat — a classic lost-update/double-booking race condition. The check and the write are two separate round-trips with a gap between them where another transaction can interleave.

Fix 1: pessimistic locking — make the check-then-act atomic via a lock

BEGIN;
SELECT available FROM seats WHERE id = 42 FOR UPDATE;  -- locks the row
-- Any other transaction's FOR UPDATE on row 42 now blocks here until we commit/rollback
IF available THEN
    UPDATE seats SET available = false WHERE id = 42;
    COMMIT;   -- lock released, second transaction now proceeds and sees available=false
ELSE
    ROLLBACK;
END IF;

The second transaction's SELECT ... FOR UPDATE blocks until the first commits, then correctly sees available = false and can reject the booking.

Fix 2: atomic conditional update — no explicit lock needed

Often simpler and doesn't require holding a transaction open across application logic:

UPDATE seats
SET available = false
WHERE id = 42 AND available = true;

-- Check rows affected (returned by most drivers/ORMs):
-- If 1 row affected: you got the seat.
-- If 0 rows affected: someone else already booked it (or it never existed) -- reject/retry.

This works because the check (available = true) and the write happen as a single atomic statement at the database level — the database itself guarantees no other transaction's update can interleave in the middle of one UPDATE statement's row evaluation, regardless of isolation level.

Which to prefer

The atomic conditional UPDATE (Fix 2) is usually the better default — it avoids holding open transactions/locks across application logic (which is risky if that logic is slow or fails unexpectedly), and it composes well with optimistic-concurrency patterns more broadly. Reach for explicit FOR UPDATE locking when the "check" involves more complex logic than a single column comparison that can't be expressed as a single atomic UPDATE ... WHERE clause.

The general principle

Never trust a "read, then act based on what I read" sequence to be safe under concurrency unless the read and the act are combined into a single atomic database operation, or the row is explicitly locked for the entire duration between them.

Related Resources

PostgreSQL: Explicit Locking