| From: | Amit Langote <Langote_Amit_f8(at)lab(dot)ntt(dot)co(dot)jp> | 
|---|---|
| To: | Robert Haas <robertmhaas(at)gmail(dot)com> | 
| Cc: | Kyotaro HORIGUCHI <horiguchi(dot)kyotaro(at)lab(dot)ntt(dot)co(dot)jp>, Jesper Pedersen <jesper(dot)pedersen(at)redhat(dot)com>, David Rowley <david(dot)rowley(at)2ndquadrant(dot)com>, Amit Langote <amitlangote09(at)gmail(dot)com>, Rajkumar Raghuwanshi <rajkumar(dot)raghuwanshi(at)enterprisedb(dot)com>, Dilip Kumar <dilipbalaut(at)gmail(dot)com>, Beena Emerson <memissemerson(at)gmail(dot)com>, "pgsql-hackers(at)postgresql(dot)org" <pgsql-hackers(at)postgresql(dot)org> | 
| Subject: | Re: [HACKERS] path toward faster partition pruning | 
| Date: | 2018-02-09 12:36:32 | 
| Message-ID: | b4d88995-094b-320c-b614-2282fae0bf6c@lab.ntt.co.jp | 
| Views: | Whole Thread | Raw Message | Download mbox | Resend email | 
| Thread: | |
| Lists: | pgsql-hackers | 
On 2018/02/06 18:55, Amit Langote wrote:
>> How fast is this patch these days, compared with the current approach?
>>  It would be good to test both when nearly all of the partitions are
>> pruned and when almost none of the partitions are pruned.
> 
> I will include some performance numbers in my next email, which hopefully
> should not be later than Friday this week.
Here is the latest set of patches.  I can see about 2x speedup in planning
time for various partition counts, although it grows linearly as the
partition count grows (same as with HEAD).  Detailed performance figures
follow.
* Partitioned table schema:
H:
create table ht (a int, b int) partition by hash (b);
create table ht_* partition of ht for values with (modulus N, ...)
L:
create table lt (a int, b int) partition by list (b);
create table lt_1 partition of lt for values in (1)
..
create table lt_N partition of lt for values in (N)
R:
create table rt (a int, b int) partition by range (b);
create table rt_1 partition of rt for values from (1) to (<step>)
..
create table rt_N partition of rt for values in (N-1 * <step>) to (N * <step>)
* Queries
Prunes every partition but 1: select * from table_name where b = 1
Prunes none: select * from table_name where b >= 1
* Planning time in milliseconds (average of 5 runs).
On HEAD:
parts H-prune   H-noprune    L-prune   L-noprune   R-prune    R-noprune
   8     1.50        1.42       1.60        1.55      1.77         1.75
  16     2.49        2.37       2.32        2.65      3.29         3.07
  32     3.96        4.49       3.83        4.14      5.06         5.70
  64     8.02        7.51       7.14        7.34      9.37        10.02
 128    14.47       14.19      13.31       13.99     18.09        18.86
 256    24.76       27.63      25.59       27.87     34.15        37.19
 512    50.36       55.92      52.56       54.76     69.34        72.55
1024   102.94      110.59     104.97      110.41    136.89       146.54
Patched:
parts H-prune    H-noprune   L-prune   L-noprune   R-prune   R-noprune
   8     1.49         0.90      0.87        0.74      0.84        1.09
  16     2.01         1.50      1.42        1.68      1.42        1.41
  32     2.63         2.47      2.08        2.69      2.73        2.81
  64     5.62         4.66      4.45        4.96      4.92        5.08
 128    11.28         9.65      9.00        9.60      8.68        9.91
 256    18.36        18.49     17.11       18.39     17.47       18.43
 512    33.88        36.89     34.06       36.52     34.01       37.26
1024    66.40        72.75     66.37       73.40     67.06       67.06
Attached v25 patches.
0001-Modify-bound-comparision-functions-to-accept-mem.patch
This is Ashutosh's patch that he posted on the "advanced partition
matching algorithm for partition-wise join" thread.
0002-Refactor-partition-bound-search-functions.patch
This is similar to 0001.  Whereas 0001 modifies just the comparison
functions, this one modifies the partition bound search functions, because
the main pruning patch uses the search functions.
0003-Add-parttypid-partcollation-partsupfunc-to-Parti.patch
This adds some of the fields to PartitionScheme that were needed by the
main pruning patch.
The above 3 patches do what they do, because we'd like the main pruning to
patch to add its functionality by relying on whatever information is made
available in the partitioned table's RelOptInfo.
0004-Faster-partition-pruning.patch
The main patch that adds src/backend/optimizer/util/partprune.c, a module
to provide the functionality that will replace the current approach of
calling relation_excluded_by_constraints() for each partition.
Sorry, but there is still this big TODO here, which I'll try to fix early
next week.
+ * partprune.c
+ *      Provides functions to prune partitions of a partitioned table by
+ *      comparing provided set of clauses with the table's partitions'
+ *      boundaries
+ *
+ * TODO: write a longer description of things in this file
0005-Add-only-unpruned-partitioned-child-rels-to-part.patch
This one teaches the planner to put *only* the un-pruned partitioned child
tables into partitioned_rels list of certain plan nodes.
Thanks,
Amit
| Attachment | Content-Type | Size | 
|---|---|---|
| v25-0001-Modify-bound-comparision-functions-to-accept-mem.patch | text/plain | 6.5 KB | 
| v25-0002-Refactor-partition-bound-search-functions.patch | text/plain | 8.3 KB | 
| v25-0003-Add-parttypid-partcollation-partsupfunc-to-Parti.patch | text/plain | 5.3 KB | 
| v25-0004-Faster-partition-pruning.patch | text/plain | 103.9 KB | 
| v25-0005-Add-only-unpruned-partitioned-child-rels-to-part.patch | text/plain | 8.1 KB | 
| From | Date | Subject | |
|---|---|---|---|
| Next Message | Alvaro Herrera | 2018-02-09 12:42:18 | Re: Proposal: partition pruning by secondary attributes | 
| Previous Message | Ashutosh Bapat | 2018-02-09 12:34:39 | Wait event names mismatch: oldserxid |