Quick Links

Re: tweaking NTUP_PER_BUCKET

From:	Robert Haas <robertmhaas(at)gmail(dot)com>
To:	Tomas Vondra <tv(at)fuzzy(dot)cz>
Cc:	"pgsql-hackers(at)postgresql(dot)org" <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: tweaking NTUP_PER_BUCKET
Date:	2014-07-09 14:07:45
Message-ID:	CA+TgmoZRWuAxAKzodDEjRUJyrcFVGhuE9_LAvexzx8ffWrUJwA@mail.gmail.com
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

On Tue, Jul 8, 2014 at 5:16 PM, Tomas Vondra <tv(at)fuzzy(dot)cz> wrote:
> Thinking about this a bit more, do we really need to build the hash
> table on the first pass? Why not to do this:
>
> (1) batching
> - read the tuples, stuff them into a simple list
> - don't build the hash table yet
>
> (2) building the hash table
> - we have all the tuples in a simple list, batching is done
> - we know exact row count, can size the table properly
> - build the table

We could do this, and in fact we could save quite a bit of memory if
we allocated say 1MB chunks and packed the tuples in tightly instead
of palloc-ing each one separately. But I worry that rescanning the
data to build the hash table would slow things down too much.

> Also, maybe we could use a regular linear hash table [1], instead of
> using the current implementation with NTUP_PER_BUCKET=1. (Although,
> that'd be absolutely awful with duplicates.)

Linear probing is pretty awful unless your load factor is << 1. You'd
probably want NTUP_PER_BUCKET=0.25, or something like that, which
would eat up a lot of memory.

--
Robert Haas
EnterpriseDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company

In response to

Re: tweaking NTUP_PER_BUCKET at 2014-07-08 21:16:50 from Tomas Vondra

Responses

Re: tweaking NTUP_PER_BUCKET at 2014-07-09 17:54:35 from Tomas Vondra
Re: tweaking NTUP_PER_BUCKET at 2014-07-10 19:33:50 from Tomas Vondra

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Robert Haas	2014-07-09 14:13:17	Re: Pg_upgrade and toast tables bug discovered
Previous Message	Greg Stark	2014-07-09 14:07:22	Re: Insert query hangs