What is the difference between optimistic and pessimistic locking?

Pessimistic locking acquires a lock before reading or writing, assuming conflicts are likely, so other transactions wait. Optimistic locking takes no lock; it records a version, does the work, and checks at commit time whether the version changed, retrying if it did. Pessimistic suits high-contention writes; optimistic suits read-heavy, low-contention workloads.

Multi-Version Concurrency Control keeps multiple versions of a row so readers see a consistent snapshot without blocking writers, and writers don't block readers. Each transaction reads the version valid as of its start, which is how databases like PostgreSQL provide snapshot isolation efficiently.

System Design Fundamentals / Concurrency & Locking

Fundamental 02~13 min readIntermediate

Deep Dive

Two writers,
one truth. Locking keeps it honest.

Q: What are the four SQL isolation levels?

Read Uncommitted (allows dirty reads), Read Committed (prevents dirty reads but allows non-repeatable reads), Repeatable Read (prevents non-repeatable reads but can allow phantoms), and Serializable (prevents all anomalies and behaves as if transactions ran one at a time). Higher levels are safer but slower.

Q: Why are distributed locks dangerous?

A process can acquire a lock, pause (GC or network delay) past the lock's expiry, then resume believing it still holds the lock while another process has taken it. The fix is a fencing token: a monotonically increasing number issued with the lock that downstream storage uses to reject stale writers.

The instant two operations touch the same data at the same time, correctness is on the line — lost updates, dirty reads, double-spends. Concurrency control is the set of tools that decide which interleavings are allowed. Here's the full toolkit, from row locks to fencing tokens.

The races you're preventing

A race condition is when the result depends on the timing of operations that should have been independent. The classic is the lost update: two transactions read a balance of 100, each add 10, each write 110 — one update vanishes. Concurrency control exists to make these impossible.

Anomaly	What happens
Dirty read	You read data another transaction wrote but hasn't committed — and it may roll back.
Non-repeatable read	You read the same row twice in one transaction and get different values.
Phantom read	You re-run a range query and new rows have appeared.
Lost update	Two read-modify-write cycles overlap; one overwrites the other.
Write skew	Two transactions read overlapping data, then write disjoint rows that together break an invariant.

Pessimistic vs optimistic locking

The strategy you choose comes down to one question: how often do operations actually collide?

Two philosophies of conflict

Pessimistic — "assume conflict"

Lock before reading/writing
SELECT … FOR UPDATE
Others block and wait their turn
No wasted work, but risks deadlocks
Best for hot rows, high contention

Optimistic — "assume no conflict"

Read a version number, do the work
Compare-and-swap at commit time
No blocking; retry if version changed
Wasted work only when conflict is real
Best for read-heavy, low contention

Pessimistic locks come in shared (read) and exclusive (write) flavours; the danger is the deadlock — two transactions each holding a lock the other needs. Databases detect cycles and abort a victim, but you avoid them by acquiring locks in a consistent order. Optimistic concurrency avoids deadlocks entirely by never holding locks — at the price of a retry path you must actually write.

A deadlock is precisely a cycle in the wait-for graph, and consistent lock ordering makes the cycle impossible:

T1 holds A wants B · T2 holds B wants A → T1→T2→T1 = cycle · everyone acquires A→B → T2 waits T1, no cycle

Whoever holds the lower-ordered lock always makes progress, so no one waits in a circle. The runnable version below detects the deadlock and shows the fix.

→ Interview tip

"How do you stop two people booking the same seat?" Name the trade-off out loud: a pessimistic row lock is simplest at low scale; an optimistic version check or a short-lived distributed lock scales better but needs a retry. The wrong answer is to never mention the race.

Isolation levels — the anomaly dial

SQL isolation levels are a contract: each one promises to prevent certain anomalies, in exchange for less concurrency. Higher is safer and slower.

Isolation level	Dirty read	Non-repeatable	Phantom
Read Uncommitted	Possible	Possible	Possible
Read Committed	Prevented	Possible	Possible
Repeatable Read	Prevented	Prevented	Possible*
Serializable	Prevented	Prevented	Prevented

* In the ANSI standard. PostgreSQL's Repeatable Read (snapshot isolation) actually prevents phantoms but allows write skew; true Serializable closes that too.

MVCC — readers and writers stop fighting

Multi-Version Concurrency Control is how modern databases (PostgreSQL, MySQL InnoDB, Oracle) avoid the read-write blocking that naïve locking causes. Instead of overwriting a row, a write creates a new version tagged with the transaction that made it. Each transaction reads the version that was valid as of its start — a consistent snapshot.

Snapshot isolation via versions

Txn A starts
(sees v1)

→

Txn B writes
v2

→

Txn A reads
still v1

→

A commits
no block

Readers never block writers and vice-versa; old versions are garbage-collected once no transaction needs them.

The result: readers don't block writers, writers don't block readers. The cost is version bloat (the VACUUM problem in Postgres) and the subtle write-skew anomaly that only true serializable isolation removes.

Distributed locks & fencing tokens

When the contended resource lives across services, you reach for a distributed lock (Redis, ZooKeeper, etcd). These are far trickier than in-process locks, because a holder can pause — a long GC, a network stall — past the lock's expiry, then wake up still believing it holds the lock while someone else has taken it.

The fix is a fencing token: a monotonically increasing number issued with each lock grant. The protected resource records the highest token it has seen and rejects any write carrying a lower one — so a stale holder's writes bounce, even if its lock looks valid.

Fencing token rejects the stale holder

Client 1
token 33

→

pauses (GC)

→

Client 2
token 34

→

Storage: 33 < 34
reject Client 1

→ Key insight

A lock with a timeout is a lease, not a guarantee. Without a fencing token, any distributed lock can be held by two clients at once. Idempotency (see consensus & coordination) is the other half of staying correct under retries.

RUN IT YOURSELF

Detect a deadlock, then order it away

A deadlock is a cycle in the wait-for graph — nothing more. This simulates two transactions grabbing locks A and B in opposite orders: each ends up holding one and waiting for the other, and a cycle-detector finds the circular wait. Then it reruns them with a single rule — everyone acquires locks in the same global order — and the cycle vanishes: whoever gets the lower lock finishes, the other simply queues behind it. Same locks, same transactions, no deadlock. Change the lock sets or the order.

CPython · WebAssembly

# A deadlock = a cycle in the wait-for graph (who is waiting on whom).
def has_cycle(wait_for):
    WHITE, GRAY, BLACK = 0, 1, 2
    color = {n: WHITE for n in wait_for}
    def dfs(n):
        color[n] = GRAY
        for m in wait_for.get(n, ()):
            if color.get(m, WHITE) == GRAY: return True          # back-edge = cycle
            if color.get(m, WHITE) == WHITE and dfs(m): return True
        color[n] = BLACK
        return False
    return any(color[n] == WHITE and dfs(n) for n in wait_for)

def simulate(txn_locks, order=None):             # acquire locks, build wait-for graph
    seq = {t: (sorted(ls, key=order) if order else list(ls)) for t, ls in txn_locks.items()}
    holder, wait_for, idx = {}, {t: set() for t in txn_locks}, {t: 0 for t in txn_locks}
    progressed = True
    while progressed:
        progressed = False
        for t in txn_locks:
            if idx[t] < len(seq[t]):
                lock = seq[t][idx[t]]
                if holder.get(lock) in (None, t):
                    holder[lock] = t; idx[t] += 1; progressed = True
                else:
                    wait_for[t].add(holder[lock])
    return wait_for

txns = {"T1": ["A", "B"], "T2": ["B", "A"]}       # opposite acquisition orders
print("no ordering :", simulate(txns), "-> deadlock?", has_cycle(simulate(txns)))
print("A->B ordering:", simulate(txns, order=str), "-> deadlock?", has_cycle(simulate(txns, order=str)))

See concurrency control in real designs

Ticketmaster — seat locking Stock Exchange — order matching Payment System — no double-spend Google Docs — concurrent edits Job Scheduler — lock a job once

Frequently asked

Quick answers

Optimistic vs pessimistic locking?

Pessimistic locks before touching data (best for high contention). Optimistic takes no lock, checks a version at commit, and retries on conflict (best for read-heavy, low-contention workloads).

What are the four SQL isolation levels?

Read Uncommitted, Read Committed, Repeatable Read, and Serializable — each preventing more anomalies (dirty reads → non-repeatable reads → phantoms) at the cost of concurrency.

What is MVCC?

Multi-Version Concurrency Control keeps multiple row versions so readers see a consistent snapshot without blocking writers. Each transaction reads the version valid as of its start.

Why are distributed locks dangerous?

A holder can pause past the lock's expiry and resume while another process holds it. A fencing token — a monotonic number checked by downstream storage — rejects the stale writer.

▶ Watch it explained

Prefer a video walkthrough?

Finished this one? 0 / 177 Handbooks done

Explore the topic

See this alongside everything else on the same subject — handbooks, system designs, challenges and tools, in one place.

Databases & Storage

Two writers,
one truth. Locking keeps it honest.

The races you're preventing

Pessimistic vs optimistic locking

Pessimistic — "assume conflict"

Optimistic — "assume no conflict"

Isolation levels — the anomaly dial

MVCC — readers and writers stop fighting

Distributed locks & fencing tokens

Detect a deadlock, then order it away

Quick answers

Optimistic vs pessimistic locking?

What are the four SQL isolation levels?

What is MVCC?

Why are distributed locks dangerous?

Prefer a video walkthrough?

Explore the topic

More Handbooks

Explore more from Vibe Engines

Two writers,one truth. Locking keeps it honest.

The races you're preventing

Pessimistic vs optimistic locking

Pessimistic — "assume conflict"

Optimistic — "assume no conflict"

Isolation levels — the anomaly dial

MVCC — readers and writers stop fighting

Distributed locks & fencing tokens

Detect a deadlock, then order it away

Quick answers

Optimistic vs pessimistic locking?

What are the four SQL isolation levels?

What is MVCC?

Why are distributed locks dangerous?

The other fundamentals

System Design Fundamentals →

CAP & Consistency →

Consensus & Coordination →

Prefer a video walkthrough?

Explore the topic

More Handbooks

Explore more from Vibe Engines

Get the next one in your inbox.

Two writers,
one truth. Locking keeps it honest.