Re: Why is WAL-file based replica stuck for long time (hour?) in IPC:RecoveryConflictSnapshot ?

From: Thom Brown <thom(at)linux(dot)com>
To: depesz(at)depesz(dot)com
Cc: pgsql-general mailing list <pgsql-general(at)postgresql(dot)org>
Subject: Re: Why is WAL-file based replica stuck for long time (hour?) in IPC:RecoveryConflictSnapshot ?
Date: 2026-06-18 13:07:36
Message-ID: CAA-aLv4UZ9bOtEDbdeCa1KMmdgCxzaT-YQnEN=ZZ7LUQTp+GJw@mail.gmail.com
Views: Whole Thread | Raw Message | Download mbox | Resend email
Thread:
Lists: pgsql-general

On Thu, 18 Jun 2026 at 12:57, hubert depesz lubaczewski
<depesz(at)depesz(dot)com> wrote:
>
> Hi,
> we have set of clusters that run on pg14 (yes, I know, we are in process
> of upgrading but it's complicated).
>
> Recently we noticed that some of the DR nodes are lagging because they
> get stuck on some wal files, and starup process, in pg_stat_activity
> shows "IPC:RecoveryConflictSnapshot" as wait event.
>
> The thing is that there are no othger connections doing anything to the
> db. There are some idle monitor ones, running things like count(*) from
> pg_stat_activity, every now and then, but we're talking about pg being
> stuck on single wal up to an hour or so.
>
> Stuck as in:
> ps shows:
> postgres: 14/main: startup recovering 00000003000065EA000000C6 waiting
>
> What could it be, how to fix it?

Have you got anything in the logs that say "conflict with recovery"?

Also, I know you are just running count(*), but is there a transaction
being left open?

SELECT pid, state, xact_start, query
FROM pg_stat_activity
ORDER BY xact_start NULLS LAST;

Any 'idle in transaction' for too long?

What is max_standby_streaming_delay and hot_standby_feedback set to?

Thom

In response to

Responses

Browse pgsql-general by date

  From Date Subject
Next Message hubert depesz lubaczewski 2026-06-18 13:33:56 Re: Why is WAL-file based replica stuck for long time (hour?) in IPC:RecoveryConflictSnapshot ?
Previous Message hubert depesz lubaczewski 2026-06-18 11:57:04 Why is WAL-file based replica stuck for long time (hour?) in IPC:RecoveryConflictSnapshot ?