Re: Reduce timing overhead of EXPLAIN ANALYZE using rdtsc?

From: John Naylor <johncnaylorls(at)gmail(dot)com>
To: Lukas Fittl <lukas(at)fittl(dot)com>
Cc: Andres Freund <andres(at)anarazel(dot)de>, David Geier <geidav(dot)pg(at)gmail(dot)com>, Jakub Wartak <jakub(dot)wartak(at)enterprisedb(dot)com>, Hannu Krosing <hannuk(at)google(dot)com>, Robert Haas <robertmhaas(at)gmail(dot)com>, Pavel Stehule <pavel(dot)stehule(at)gmail(dot)com>, vignesh C <vignesh21(at)gmail(dot)com>, Michael Paquier <michael(at)paquier(dot)xyz>, Ibrar Ahmed <ibrar(dot)ahmad(at)gmail(dot)com>, Maciek Sakrejda <m(dot)sakrejda(at)gmail(dot)com>, pgsql-hackers <pgsql-hackers(at)postgresql(dot)org>
Subject: Re: Reduce timing overhead of EXPLAIN ANALYZE using rdtsc?
Date: 2026-02-25 11:15:49
Message-ID: CANWCAZZfcYMhhdXLUOhT7j=HAkOd688aOFw2Jr6N-OKda+8JUQ@mail.gmail.com
Views: Whole Thread | Raw Message | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

On Tue, Feb 24, 2026 at 2:22 PM Lukas Fittl <lukas(at)fittl(dot)com> wrote:
>
> On Mon, Feb 23, 2026 at 6:02 PM John Naylor <johncnaylorls(at)gmail(dot)com> wrote:
> >
> > On Tue, Feb 24, 2026 at 5:28 AM Andres Freund <andres(at)anarazel(dot)de> wrote:
> > > On 2026-02-23 16:24:57 +0100, David Geier wrote:
> > > > The code wasn't compiling properly on Windows because __x86_64__ is not
> > > > defined in Visual C++. I've changed the code to use
> > > >
> > > > #if defined(__x86_64__) || defined(_M_X64)
> > >
> > > Independently of this patchset I wonder if it'd be worth introducing a
> > > PG_ARCH_X64 or such, to avoid this kind of thing.
> >
> > +1
> >
> > I've already borrowed USE_SSE2 for this meaning in commit b9278871f,
> > but that's conflating two different things and I'd actually prefer the
> > above, plus one that includes 32-bit as well.
>
> +1, would be good to have a consistent definition for this, I hadn't
> realized this differs between platforms. John, do you want to take
> care of adding that since you recently added USE_SSE2?

In the attached I tried like this:

/*
* compiler-independent macros for CPU architecture
*/
#if defined(__x86_64__) || defined(_M_X64)
#define PG_ARCH_X64
#elif defined(__i386__) || defined(_M_IX86)
#define PG_ARCH_X32
#elif defined(__aarch64__) || defined(_M_ARM64)
#define PG_ARCH_ARM64
#elif defined(__arm__) || defined(__arm) || defined(_M_ARM)
#define PG_ARCH_ARM32
#endif

and adjusted a couple places to suit. The Arm ones aren't used yet so
I could leave them out for now.

> > is_rdtscp_available() is an easy thing to delegate to my patch, but I
> > agree it would be easier if that was abstracted a bit more so that a
> > different leaf can be passed each time. The latter could also be used
> > to simplify the frequency and hypervisor stuff as well.
>
> Yeah, that makes sense, agreed it'd be nice to centralize the CPU
> architecture specific code that utilizes cpuid/etc.
>
> Looking at your v5/0002 over there, that should work well. As you
> note, is_rdtscp_available is an easy delegation to your logic - I
> think we can probably always fetch the 0x80000001 leaf to check for
> RDTSCP presence in the proposed set_x86_features?

I think that would work. Your pg_cpuid() looks like the abstraction we
need. I think that would also allow my v5/0002 to avoid worrying about
ordering dependencies. (It resets to check leaf 7, but some AVX
features that we don't use need leaf 1)

--
John Naylor
Amazon Web Services

Attachment Content-Type Size
nocfbot-architecture-macros-v1.patch text/x-patch 1.4 KB

In response to

Browse pgsql-hackers by date

  From Date Subject
Next Message Peter Eisentraut 2026-02-25 11:33:58 Re: SQL Property Graph Queries (SQL/PGQ)
Previous Message Zsolt Parragi 2026-02-25 10:54:34 Re: More speedups for tuple deformation