Quick Links

Re: Report reorder buffer size

From:	Bertrand Drouvot <bertranddrouvot(dot)pg(at)gmail(dot)com>
To:	Ashutosh Bapat <ashutosh(dot)bapat(dot)oss(at)gmail(dot)com>
Cc:	pgsql-hackers <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: Report reorder buffer size
Date:	2025-08-22 06:13:27
Message-ID:	aKgKh/vqWgBFTh1w@ip-10-97-1-34.eu-west-3.compute.internal
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

Hi Ashutosh,

On Thu, Aug 21, 2025 at 07:26:41PM +0530, Ashutosh Bapat wrote:
> Hi Bertrand,
> Thanks for your response. I am glad that you have found the proposal
> to be generally useful.
>
> On Thu, Aug 14, 2025 at 3:50 PM Bertrand Drouvot
> <bertranddrouvot(dot)pg(at)gmail(dot)com> wrote:
> >
> > > Using these trendlines, a user may decide to keep
> > > logical_decoding_work_mem to minimum and thus let all the large
> > > transactions spill to disk or be streamed.
> >
> > Right. Also, the issue with spill files is that when they are read back from disk
> > then the reorder buffer can consume a lot of memory. That's why I think tracking
> > the xid, sub xid and LSNs could be useful too. That could help diagnose why
> > it was using that memory.
>
> When the replication is stuck or is slow or WAL builds up a detailed
> look at the reorder buffer's contents would be useful. It will need to
> be done outside of pg_stat_replication or pg_stat_replication_slots. I
> think it will be useful to provide a function which will trigger a
> given WAL sender to dump the information about the contents of the
> reorder buffer to the server error log or send them to the requesting
> client.

Maybe we could add such functionalities in contrib/pg_logicalinspect?

> > Maybe we could log 2 metrics: the reorder buffer size before creating the spill
> > files and the reorder buffer size while reading back the spill files? That would
> > help make the distinction when logical_decoding_work_mem is involved (i.e before
> > creating the spill files) and when it is not (reading them back).
> >
> > pg_stat_replication_slots could be used to add some counters too: like the
> > number of times logical_decoding_work_mem has been reached while decoding from
> > that slot.
>
> I think pg_stat_replication_slots::spill_count and
> pg_stat_replication_slots::stream_count give that value.

Yes as long as subtrans are not involved. But if say I've one transaction made of
1500 subtrans, I'd get something like:

slot_name | spill_count | stream_count | total_txns
--------------+-------------+--------------+------------
logical_slot | 1501 | 0 | 1

So we don't know how many times logical_decoding_work_mem has been reached (
except by looking at total_txns).

But as soon as another transaction (that does not involve spill) is decoded:

slot_name | spill_count | stream_count | total_txns
--------------+-------------+--------------+------------
logical_slot | 1501 | 0 | 2

Then we don't know if logical_decoding_work_mem has been reached one or two
times.

> > Also maybe record the max value the reorder buffer reached and
> > the associated xid, number of sub xids, and LSN? That could be useful to diagnose
> > high memory consumption and/or OOM issues due to logical decoding.
>
> A function which dumps the information about reorder buffers can be
> used along with the trendline for this purpose. For example, if a user
> notices WAL accumulation and an upward trend in the metric exposed by
> my patch, they can trigger a dump of reorder buffer contents at that
> time and diagnose the problem and even fix it proactively.

Right. That would work for cases where the issue is occuring right now and
the system is still available (OOM not triggered for example).

> > Also, should we log some of information to the server log?
> >
>
> Information in server error logs is hard to access and analyse,
> especially in cloud environment. Having a SQL construct is better.

That's right as long as the system is still available. I think a mix of log
and SQL API help more use cases: real time, historical and when the system is
back to a usable state (if it was not anymore).

Regards,

--
Bertrand Drouvot
PostgreSQL Contributors Team
RDS Open Source Databases
Amazon Web Services: https://aws.amazon.com

In response to

Re: Report reorder buffer size at 2025-08-21 13:56:41 from Ashutosh Bapat

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Peter Eisentraut	2025-08-22 07:02:22	Re: make VALIDATE domain constraint lock on related relations as ShareUpdateExclusiveLock
Previous Message	Hayato Kuroda (Fujitsu)	2025-08-22 05:50:06	RE: [Patch] add new parameter to pg_replication_origin_session_setup