Re: [HACKERS] path toward faster partition pruning

From: Amit Langote <Langote_Amit_f8(at)lab(dot)ntt(dot)co(dot)jp>
To: Robert Haas <robertmhaas(at)gmail(dot)com>
Cc: Kyotaro HORIGUCHI <horiguchi(dot)kyotaro(at)lab(dot)ntt(dot)co(dot)jp>, Jesper Pedersen <jesper(dot)pedersen(at)redhat(dot)com>, David Rowley <david(dot)rowley(at)2ndquadrant(dot)com>, Amit Langote <amitlangote09(at)gmail(dot)com>, Rajkumar Raghuwanshi <rajkumar(dot)raghuwanshi(at)enterprisedb(dot)com>, Dilip Kumar <dilipbalaut(at)gmail(dot)com>, Beena Emerson <memissemerson(at)gmail(dot)com>, "pgsql-hackers(at)postgresql(dot)org" <pgsql-hackers(at)postgresql(dot)org>
Subject: Re: [HACKERS] path toward faster partition pruning
Date: 2018-02-09 12:36:32
Message-ID: b4d88995-094b-320c-b614-2282fae0bf6c@lab.ntt.co.jp
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

On 2018/02/06 18:55, Amit Langote wrote:
>> How fast is this patch these days, compared with the current approach?
>> It would be good to test both when nearly all of the partitions are
>> pruned and when almost none of the partitions are pruned.
>
> I will include some performance numbers in my next email, which hopefully
> should not be later than Friday this week.

Here is the latest set of patches. I can see about 2x speedup in planning
time for various partition counts, although it grows linearly as the
partition count grows (same as with HEAD). Detailed performance figures
follow.

* Partitioned table schema:

H:
create table ht (a int, b int) partition by hash (b);
create table ht_* partition of ht for values with (modulus N, ...)

L:
create table lt (a int, b int) partition by list (b);
create table lt_1 partition of lt for values in (1)
..
create table lt_N partition of lt for values in (N)

R:
create table rt (a int, b int) partition by range (b);
create table rt_1 partition of rt for values from (1) to (<step>)
..
create table rt_N partition of rt for values in (N-1 * <step>) to (N * <step>)

* Queries

Prunes every partition but 1: select * from table_name where b = 1
Prunes none: select * from table_name where b >= 1

* Planning time in milliseconds (average of 5 runs).

On HEAD:

parts H-prune H-noprune L-prune L-noprune R-prune R-noprune
8 1.50 1.42 1.60 1.55 1.77 1.75
16 2.49 2.37 2.32 2.65 3.29 3.07
32 3.96 4.49 3.83 4.14 5.06 5.70
64 8.02 7.51 7.14 7.34 9.37 10.02
128 14.47 14.19 13.31 13.99 18.09 18.86
256 24.76 27.63 25.59 27.87 34.15 37.19
512 50.36 55.92 52.56 54.76 69.34 72.55
1024 102.94 110.59 104.97 110.41 136.89 146.54

Patched:

parts H-prune H-noprune L-prune L-noprune R-prune R-noprune
8 1.49 0.90 0.87 0.74 0.84 1.09
16 2.01 1.50 1.42 1.68 1.42 1.41
32 2.63 2.47 2.08 2.69 2.73 2.81
64 5.62 4.66 4.45 4.96 4.92 5.08
128 11.28 9.65 9.00 9.60 8.68 9.91
256 18.36 18.49 17.11 18.39 17.47 18.43
512 33.88 36.89 34.06 36.52 34.01 37.26
1024 66.40 72.75 66.37 73.40 67.06 67.06

Attached v25 patches.

0001-Modify-bound-comparision-functions-to-accept-mem.patch

This is Ashutosh's patch that he posted on the "advanced partition
matching algorithm for partition-wise join" thread.

0002-Refactor-partition-bound-search-functions.patch

This is similar to 0001. Whereas 0001 modifies just the comparison
functions, this one modifies the partition bound search functions, because
the main pruning patch uses the search functions.

0003-Add-parttypid-partcollation-partsupfunc-to-Parti.patch

This adds some of the fields to PartitionScheme that were needed by the
main pruning patch.

The above 3 patches do what they do, because we'd like the main pruning to
patch to add its functionality by relying on whatever information is made
available in the partitioned table's RelOptInfo.

0004-Faster-partition-pruning.patch

The main patch that adds src/backend/optimizer/util/partprune.c, a module
to provide the functionality that will replace the current approach of
calling relation_excluded_by_constraints() for each partition.

Sorry, but there is still this big TODO here, which I'll try to fix early
next week.

+ * partprune.c
+ * Provides functions to prune partitions of a partitioned table by
+ * comparing provided set of clauses with the table's partitions'
+ * boundaries
+ *
+ * TODO: write a longer description of things in this file

0005-Add-only-unpruned-partitioned-child-rels-to-part.patch

This one teaches the planner to put *only* the un-pruned partitioned child
tables into partitioned_rels list of certain plan nodes.

Thanks,
Amit

[1]
https://www.postgresql.org/message-id/CAFjFpRctst136uN2BvbWLAkS7w278XmKY4_PUB%2Bxk-%2BNezNq8g%40mail.gmail.com

Attachment Content-Type Size
v25-0001-Modify-bound-comparision-functions-to-accept-mem.patch text/plain 6.5 KB
v25-0002-Refactor-partition-bound-search-functions.patch text/plain 8.3 KB
v25-0003-Add-parttypid-partcollation-partsupfunc-to-Parti.patch text/plain 5.3 KB
v25-0004-Faster-partition-pruning.patch text/plain 103.9 KB
v25-0005-Add-only-unpruned-partitioned-child-rels-to-part.patch text/plain 8.1 KB

In response to

Responses

Browse pgsql-hackers by date

  From Date Subject
Next Message Alvaro Herrera 2018-02-09 12:42:18 Re: Proposal: partition pruning by secondary attributes
Previous Message Ashutosh Bapat 2018-02-09 12:34:39 Wait event names mismatch: oldserxid