Postgres

Postgres-Lock-Chains: die Session finden, die deine blockiert

Praktische Anleitung zu pg_locks, pg_blocking_pids und der rekursiven CTE, die dir die komplette Kette gibt — inklusive AccessExclusiveLocks, die deine DB still und leise lahmlegen.

Published 2026-04-15·11 min read

Most Postgres outages don’t come from slow queries. They come from queries waiting on locks. Reproducing this on a developer’s laptop is hard — the chain forms and dissolves in seconds. Catching it in production needs a specific set of queries you should have memorized. This post is that set.

On this page

What a lock chain is
The two views you need
pg_blocking_pids — the easy mode
The recursive CTE for the full chain
Reading wait_event_type and wait_event
AccessExclusiveLock — the silent killer
Automating capture for postmortems
FAQ

What a lock chain is

Two sessions: A is holding lock X, B is waiting for X. That’s a chain of length 1. Add C waiting on B (because B holds lock Y that C wants), and you have length 2. In bad incidents the chain stretches to 50+ — and then connection limits hit, and your app falls over with confusing errors that look like a DB outage but are really one stuck transaction.

The two views you need

pg_locks — every lock currently held or wanted, by every session
pg_stat_activity — every session, what it’s running, when it started, what it’s waiting on

Lock chains live in pg_locks. Context (which query, which user, since when) lives in pg_stat_activity. You always join the two.

pg_blocking_pids — the easy mode

Postgres 9.6+ has a function that does the work for you. It returns the array of PIDs blocking a given PID:

SELECT
  pid,
  pg_blocking_pids(pid) AS blocked_by,
  state,
  wait_event_type,
  wait_event,
  query
FROM pg_stat_activity
WHERE cardinality(pg_blocking_pids(pid)) > 0
ORDER BY query_start;

This gives you length-1 visibility — every session, who is blocking it. Good enough for most incidents. For the full chain, recurse.

The recursive CTE for the full chain

This walks the graph from any blocked session up to the root holder.

WITH RECURSIVE chain AS (
  -- seed: every blocked session
  SELECT pid,
         pg_blocking_pids(pid) AS blocked_by,
         1 AS depth,
         ARRAY[pid] AS path
  FROM pg_stat_activity
  WHERE cardinality(pg_blocking_pids(pid)) > 0
  UNION ALL
  -- recurse: follow the blockers
  SELECT a.pid,
         pg_blocking_pids(a.pid) AS blocked_by,
         c.depth + 1,
         c.path || a.pid
  FROM chain c
  JOIN unnest(c.blocked_by) AS bp(pid) ON TRUE
  JOIN pg_stat_activity a ON a.pid = bp.pid
  WHERE NOT a.pid = ANY(c.path)   -- prevent infinite loops on cycles
)
SELECT depth, path, a.state, a.wait_event_type,
       a.wait_event, age(now(), a.query_start) AS age,
       substring(a.query for 100) AS q
FROM chain c
JOIN pg_stat_activity a USING (pid)
ORDER BY depth DESC, age DESC;

The session at the bottom of the chain (highest depth) is your culprit. If its state is 'idle in transaction' with a long age, you’ve found a stuck transaction.

Reading wait_event_type and wait_event

wait_event_type	Common wait_event values	What it usually means
Lock	transactionid, relation, tuple	Waiting on another session to commit/release
LWLock	BufferContent, ProcArrayLock, WALInsert	Internal Postgres latch — usually a sign of true contention or a hot page
IO	DataFileRead, WALSync, BufFileRead	Disk-level wait, not lock-level
Client	ClientRead, ClientWrite	Waiting on the client app — connection idle or slow consumer
IPC	MessageQueueSend, ParallelFinish	Internal coordination — usually parallel workers

For lock-chain triage, focus on wait_event_type = 'Lock'. Other types matter, but they’re different problem classes.

AccessExclusiveLock — the silent killer

AccessExclusiveLock blocks everything, including read queries. Operations that quietly take this lock:

ALTER TABLE in most variants (rewrite, type change, NOT NULL add)
VACUUM FULL and CLUSTER
DROP TABLE / INDEX
REINDEX without CONCURRENTLY
CREATE INDEX without CONCURRENTLY
Some TRUNCATE patterns

Find them with:

SELECT pid, mode, granted, relation::regclass, query
FROM pg_locks l
JOIN pg_stat_activity a USING (pid)
WHERE mode = 'AccessExclusiveLock'
ORDER BY granted, query_start;

Anything ungranted is waiting; anything granted is the active blocker.

Automating capture for postmortems

Lock chains dissolve before you finish reading the runbook. The only reliable way to debug them is to capture pg_stat_activity + pg_locks at 1 Hz, persistently, so you can scrub backward to the moment.

That’s exactly what Obsfly Query Activity and Deadlock Detection do — sample at 1 Hz, store in ClickHouse for 30 days, render the lock graph at any point in time. If you’d rather build it yourself, the recipe is: cron a snapshot every second to a side table, plus a ring-buffer to bound storage.

FAQ

Why does pg_blocking_pids return a list and not a single PID?+

A session can be blocked by multiple holders simultaneously (e.g. it wants a lock that several other sessions hold in shared mode). Postgres reflects that.

What's the difference between a lock chain and a deadlock?+

A deadlock is a cycle. Postgres detects them in 1 second and aborts one session — they're loud. A lock chain is just a long wait line; nobody dies, everything stalls. That's worse.

Can I terminate a session safely?+

pg_cancel_backend(pid) cancels the current statement; pg_terminate_backend(pid) drops the session entirely. Cancel first, then terminate if cancel doesn't take effect within a few seconds.

How do I prevent lock chains in the first place?+

Short transactions. Set statement_timeout and lock_timeout. Always use CONCURRENTLY for index ops in production. Avoid 'idle in transaction' by configuring idle_in_transaction_session_timeout.

Keep reading

Postgres

Postgres slow queries: 12 causes and how to find each one

A field-tested playbook for diagnosing a slow Postgres query in production — from missing indexes to plan flips to bloated tables — with the SQL to find each cause and the fix.

Postgres

pg_stat_statements: the complete 2026 guide

Every column, every gotcha, the queries you should run today, and why pg_stat_statements is still the most useful 80 lines of telemetry in Postgres — even with five new alternatives in 2026.

← All posts

Postgres-Lock-Chains: die Session finden, die deine blockiert

What a lock chain is

The two views you need

pg_blocking_pids — the easy mode

The recursive CTE for the full chain

Reading wait_event_type and wait_event

AccessExclusiveLock — the silent killer

Automating capture for postmortems

FAQ

Überwache deine Datenbanken wie deine Services.