pgsql: Fix remaining race condition with CLOG truncation and LISTEN/NOT

From: Heikki Linnakangas <heikki(dot)linnakangas(at)iki(dot)fi>
To: pgsql-committers(at)lists(dot)postgresql(dot)org
Subject: pgsql: Fix remaining race condition with CLOG truncation and LISTEN/NOT
Date: 2025-11-12 19:16:38
Message-ID: E1vJGKc-006HeL-28@gemulon.postgresql.org
Views: Whole Thread | Raw Message | Download mbox | Resend email
Thread:
Lists: pgsql-committers

Fix remaining race condition with CLOG truncation and LISTEN/NOTIFY

Previous commit fixed a bug where VACUUM would truncate the CLOG
that's still needed to check the commit status of XIDs in the async
notify queue, but as mentioned in the commit message, it wasn't a full
fix. If a backend is executing asyncQueueReadAllNotifications() and
has just made a local copy of an async SLRU page which contains old
XIDs, vacuum can concurrently truncate the CLOG covering those XIDs,
and the backend still gets an error when it calls
TransactionIdDidCommit() on those XIDs in the local copy. This commit
fixes that race condition.

To fix, hold the SLRU bank lock across the TransactionIdDidCommit()
calls in NOTIFY processing.

Per Tom Lane's idea. Backpatch to all supported versions.

Reviewed-by: Joel Jacobson <joel(at)compiler(dot)org>
Reviewed-by: Arseniy Mukhin <arseniy(dot)mukhin(dot)dev(at)gmail(dot)com>
Discussion: https://www.postgresql.org/message-id/2759499.1761756503@sss.pgh.pa.us
Backpatch-through: 14

Branch
------
REL_18_STABLE

Details
-------
https://git.postgresql.org/pg/commitdiff/7b069a1876e46e319690459c578750e8b532520f

Modified Files
--------------
src/backend/commands/async.c | 123 ++++++++++++++++++++++---------------------
1 file changed, 62 insertions(+), 61 deletions(-)

Browse pgsql-committers by date

  From Date Subject
Next Message Heikki Linnakangas 2025-11-12 19:34:09 pgsql: Clear 'xid' in dummy async notify entries written to fill up pag
Previous Message Daniel Gustafsson 2025-11-12 16:06:25 pgsql: doc: Document effects of ownership change on privileges