Quick Links

Re: Optimizing nbtree ScalarArrayOp execution, allowing multi-column ordered scans, skip scan

From:	Tomas Vondra <tomas(dot)vondra(at)enterprisedb(dot)com>
To:	Peter Geoghegan <pg(at)bowt(dot)ie>, Matthias van de Meent <boekewurm+postgres(at)gmail(dot)com>
Cc:	PostgreSQL Hackers <pgsql-hackers(at)lists(dot)postgresql(dot)org>, Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>, Jeff Davis <pgsql(at)j-davis(dot)com>, benoit <benoit(at)hopsandfork(dot)com>, Alexander Korotkov <aekorotkov(at)gmail(dot)com>
Subject:	Re: Optimizing nbtree ScalarArrayOp execution, allowing multi-column ordered scans, skip scan
Date:	2023-11-28 12:29:27
Message-ID:	9c35e090-2ee3-286d-fed0-7dd380bc0939@enterprisedb.com
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

On 11/21/23 03:52, Peter Geoghegan wrote:
> On Sat, Nov 11, 2023 at 1:08 PM Matthias van de Meent
> <boekewurm+postgres(at)gmail(dot)com> wrote:
>> Thanks. Here's my review of the btree-related code:
>
> Attached is v7.
>

I haven't looked at the code, but I decided to do a bit of blackbox perf
and stress testing, to get some feeling of what to expect in terms of
performance improvements, and see if there happen to be some unexpected
regressions. Attached is a couple simple bash scripts doing a
brute-force test with tables of different size / data distribution,
number of values in the SAOP expression, etc.

And a PDF visualizing the comparing the results between master and build
with the patch applied. First group of columns is master, then patched,
and then (patched/master) comparison, with green=faster, red=slower. The
columns are for different number of values in the SAOP condition.

Overall, the results look pretty good, with consistent speedups of up to
~30% for large number of values (SAOP with 1000 elements). There's a
couple blips where the performance regresses, also by up to ~30%. It's
too regular to be a random variation (it affects different combinations
of parameters, tablesizes), but it seems to only affect one of the two
machines I used for testing. Interestingly enough, it's the newer one.

I'm not convinced this is a problem we have to solve. It's possible it
only affects cases that are implausible in practice (the script forces a
particular scan type, and maybe it would not be picked in practice). But
maybe it's fixable ...

regards

--
Tomas Vondra
EnterpriseDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company

Attachment	Content-Type	Size
saop-benchmark.pdf	application/pdf	245.4 KB
run-all.sh	application/x-shellscript	470 bytes
run.sh	application/x-shellscript	3.5 KB

In response to

Re: Optimizing nbtree ScalarArrayOp execution, allowing multi-column ordered scans, skip scan at 2023-11-21 02:52:32 from Peter Geoghegan

Responses

Re: Optimizing nbtree ScalarArrayOp execution, allowing multi-column ordered scans, skip scan at 2023-11-28 17:19:21 from Peter Geoghegan

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Masahiko Sawada	2023-11-28 12:32:19	Re: [PoC] pg_upgrade: allow to upgrade publisher node
Previous Message	Heikki Linnakangas	2023-11-28 12:19:34	Re: Fix a wrong comment in setrefs.c