Quick Links

Re: Flush some statistics within running transactions

From:	Bertrand Drouvot <bertranddrouvot(dot)pg(at)gmail(dot)com>
To:	Sami Imseih <samimseih(at)gmail(dot)com>
Cc:	pgsql-hackers(at)lists(dot)postgresql(dot)org
Subject:	Re: Flush some statistics within running transactions
Date:	2026-01-15 12:18:13
Message-ID:	aWjbBd5z+LJNWX1l@ip-10-97-1-34.eu-west-3.compute.internal
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

Hi,

On Wed, Jan 14, 2026 at 09:54:17PM -0600, Sami Imseih wrote:
> I took a quick look at the patches and I have some general comments.

Thanks!

>
> > Long running transactions can accumulate significant statistics (WAL, IO, ...)
> > that remain unflushed until the transaction ends. This delays visibility of
> > resource usage in monitoring views like pg_stat_io and pg_stat_wal.
>
> +1. I do think this is a good idea. Long-running transactions cause accumulated
> stats to appear as spikes in monitoring tools rather than as gradual activity.
> This would help level out, though not eliminate, those artificial spikes.

Yeah.

> > The 1 second flush interval is currently hardcoded but we could imagine increase
> > it or make it configurable.
>
> Someone may want to turn this off as well. I think a GUC will be needed.

I gave this more thoughts and I wonder if this should be configurable at all.
I mean, we don't do it for PGSTAT_MIN_INTERVAL, PGSTAT_MAX_INTERVAL and
PGSTAT_IDLE_INTERVAL. We could imagine make it configurable if it produces
noticeable performance impact but that's not what I observed.

> > RELATION stats are making use of FLUSH_MIXED:
>
> > stats: numscans, tuples_returned, tuples_fetched, blocks_fetched,
> > blocks_hit
>
> I’m concerned that fields being temporarily out of sync might impact monitoring
> calculations, if the formula is dealing with fields that have
> different flush strategies.

That's a good point. Maybe we should document the fields flush strategy?

> That said, minor discrepancies are usually tolerable for monitoring
> data analysis.
>
> For the numscans, should we not also update the scan timestamp?

The problem is that we could not call GetCurrentTransactionStopTimestamp(), so
we would need to call GetCurrentTimestamp() instead. I'm not sure that calling
GetCurrentTimestamp() every second would be a real issue though, and if it is
maybe we could increase this 1s value.

That said I agree that having seq_scan being updated and not last_seq_scan is not
that great.

Maybe we should keep this in mind and see what to do depending where this thread
is going (I mean if the current proposed design has to be changed).

Regards,

--
Bertrand Drouvot
PostgreSQL Contributors Team
RDS Open Source Databases
Amazon Web Services: https://aws.amazon.com

In response to

Re: Flush some statistics within running transactions at 2026-01-15 03:54:17 from Sami Imseih

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Jakub Wartak	2026-01-15 12:28:30	Re: pg_plan_advice
Previous Message	Tatsuo Ishii	2026-01-15 12:04:57	Re: Row pattern recognition