pgsql: Get rid of artificial restriction on hash table sizes on Windows

From: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To: pgsql-committers(at)lists(dot)postgresql(dot)org
Subject: pgsql: Get rid of artificial restriction on hash table sizes on Windows
Date: 2021-07-25 18:03:02
Message-ID: E1m7iSs-00040X-B4@gemulon.postgresql.org
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-committers

Get rid of artificial restriction on hash table sizes on Windows.

The point of introducing the hash_mem_multiplier GUC was to let users
reproduce the old behavior of hash aggregation, i.e. that it could use
more than work_mem at need. However, the implementation failed to get
the job done on Win64, where work_mem is clamped to 2GB to protect
various places that calculate memory sizes using "long int". As
written, the same clamp was applied to hash_mem. This resulted in
severe performance regressions for queries requiring a bit more than
2GB for hash aggregation, as they now spill to disk and there's no
way to stop that.

Getting rid of the work_mem restriction seems like a good idea, but
it's a big job and could not conceivably be back-patched. However,
there's only a fairly small number of places that are concerned with
the hash_mem value, and it turns out to be possible to remove the
restriction there without too much code churn or any ABI breaks.
So, let's do that for now to fix the regression, and leave the
larger task for another day.

This patch does introduce a bit more infrastructure that should help
with the larger task, namely pg_bitutils.h support for working with
size_t values.

Per gripe from Laurent Hasson. Back-patch to v13 where the
behavior change came in.

Discussion: https://postgr.es/m/997817.1627074924@sss.pgh.pa.us
Discussion: https://postgr.es/m/MN2PR15MB25601E80A9B6D1BA6F592B1985E39@MN2PR15MB2560.namprd15.prod.outlook.com

Branch
------
REL_13_STABLE

Details
-------
https://git.postgresql.org/pg/commitdiff/2b8f3f5a7c0ede24903782fcffe2553fec01bfbe

Modified Files
--------------
src/backend/executor/execGrouping.c | 6 +-
src/backend/executor/nodeAgg.c | 42 +++++-----
src/backend/executor/nodeHash.c | 144 +++++++++++++++++++--------------
src/backend/optimizer/path/costsize.c | 5 +-
src/backend/optimizer/plan/planner.c | 6 +-
src/backend/optimizer/plan/subselect.c | 3 +-
src/backend/optimizer/prep/prepunion.c | 8 +-
src/backend/optimizer/util/pathnode.c | 3 +-
src/include/miscadmin.h | 1 +
src/include/port/pg_bitutils.h | 50 +++++++++++-
10 files changed, 169 insertions(+), 99 deletions(-)

Browse pgsql-committers by date

  From Date Subject
Next Message Michael Paquier 2021-07-26 02:15:11 pgsql: Fix a couple of memory leaks in src/bin/pg_basebackup/
Previous Message Andres Freund 2021-07-25 03:34:22 pgsql: Deduplicate choice of horizon for a relation procarray.c.