Quick Links

Re: Reduce timing overhead of EXPLAIN ANALYZE using rdtsc?

From:	Andres Freund <andres(at)anarazel(dot)de>
To:	Lukas Fittl <lukas(at)fittl(dot)com>
Cc:	David Geier <geidav(dot)pg(at)gmail(dot)com>, Pavel Stehule <pavel(dot)stehule(at)gmail(dot)com>, Tomas Vondra <tomas(dot)vondra(at)enterprisedb(dot)com>, vignesh C <vignesh21(at)gmail(dot)com>, Michael Paquier <michael(at)paquier(dot)xyz>, Ibrar Ahmed <ibrar(dot)ahmad(at)gmail(dot)com>, Maciek Sakrejda <m(dot)sakrejda(at)gmail(dot)com>, pgsql-hackers <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: Reduce timing overhead of EXPLAIN ANALYZE using rdtsc?
Date:	2025-07-17 00:48:00
Message-ID:	bub2rnqvjrg6k2c6jl7c7spw6pb6gv3kk7mjhsqhtbhtpakkf7@kog6pllgdehz
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

Hi,

On 2025-02-28 23:45:58 -0800, Lukas Fittl wrote:
> From what I can gather, it appears this was an oversight when David first
> reapplied the work on the instr_time changes that were committed.

Heh, glad that that's now fixed. Unfortunately the patch needs an update,
primarily because of the recent pg_test_timing changes.

Applying just patch 2 results in a performance *regression* in pg_test_timing
on my machine, which is due to always hitting the unlikely() path in
INSTR_TIME_GET_NANOSEC() when INSTR_TIME_GET_NANOSEC() is used for an
"absolute" timestamp, rather than a differential timestamp. Which in turn
means hitting a division instruction every time, which on my slightly older
hardware is slower. That may be because my workstation has been up for 40
days, but clearly that can't lead us down to the slow-path

> On a c5.xlarge (Skylake-SP or Cascade Lake) on AWS, with the same test as
> done initially in this thread:
>
> SELECT COUNT(*) FROM lotsarows;
> Time: 974.423 ms
>
> EXPLAIN (ANALYZE, TIMING OFF) SELECT COUNT(*) FROM lotsarows;
> Time: 1336.196 ms (00:01.336)
>
> Without patch:
> EXPLAIN (ANALYZE) SELECT COUNT(*) FROM lotsarows;
> Time: 2165.069 ms (00:02.165)
>
> Per loop time including overhead: 22.15 ns
>
> With patch:
> EXPLAIN (ANALYZE, TIMING ON) SELECT COUNT(*) FROM lotsarows;
> Time: 1654.289 ms (00:01.654)
>
> Per loop time including overhead: 9.81 ns

I still think this would be a rather awesome improvement.

> Open questions I have:
> - Could we rely on checking whether the TSC timesource is invariant (via
> CPUID), instead of relying on Linux choosing it as a clocksource?

I don't see why not?

Greetings,

Andres Freund

In response to

Re: Reduce timing overhead of EXPLAIN ANALYZE using rdtsc? at 2025-03-01 07:45:58 from Lukas Fittl

Responses

Re: Reduce timing overhead of EXPLAIN ANALYZE using rdtsc? at 2025-07-27 19:50:54 from Lukas Fittl

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Michael Paquier	2025-07-17 00:50:12	Re: Fix lwlock.c and wait_event_names.txt discrepancy
Previous Message	Greg Hennessy	2025-07-17 00:44:25	simple patch for discussion