Re: A better way than tweaking NTUP_PER_BUCKET

From: Atri Sharma <atri(dot)jiit(at)gmail(dot)com>
To: Simon Riggs <simon(at)2ndQuadrant(dot)com>
Cc: Stephen Frost <sfrost(at)snowman(dot)net>, Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>, Bruce Momjian <bruce(at)momjian(dot)us>, Jeff Janes <jeff(dot)janes(at)gmail(dot)com>, PostgreSQL-development <pgsql-hackers(at)postgresql(dot)org>
Subject: Re: A better way than tweaking NTUP_PER_BUCKET
Date: 2014-01-26 17:17:05
Message-ID: 870DB430-C717-4E3C-9E01-D3E2876B470D@gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

Sent from my iPad

> On 26-Jan-2014, at 4:38, Simon Riggs <simon(at)2ndQuadrant(dot)com> wrote:
>
>> On 25 January 2014 22:33, Stephen Frost <sfrost(at)snowman(dot)net> wrote:
>>
>> * Tom Lane (tgl(at)sss(dot)pgh(dot)pa(dot)us) wrote:
>
>>> AFAICT, there was no consensus in this thread on what to do, which
>>> probably has something to do with the lack of concrete performance
>>> tests presented to back up any particular proposal.
>>
>> This I entirely agree with- more testing and more information on how
>> such a change impacts other workloads would be great. Unfortunately,
>> while I've provided a couple of test cases and seen similar situations
>> on IRC, this is very data-dependent which makes it difficult to have
>> concrete answers for every workload.
>>
>> Still, I'll try and spend some time w/ pg_bench's schema definition and
>> writing up some larger queries to run through it (aiui, the default set
>> of queries won't typically result in a hashjoin) and see what happens
>> there.
>
> The case that action of some kind was needed was clear, for me.
> Hopefully some small improvement can be found from that investigation,
> even if the greatest gain is in some way

I really don't see a way to improve performance in this field except for introducing completely new logic (if we don't hack NTUP_PER_BUCKET).

My previous patch added bloom filters as an additional layer which was checked before the actual hash join lookup, hence reducing the number of hash lookups for negatives.

The size of bloom filters required for any significant performance gains and over the additional overhead added because of the additional bloom filter lookup was quite big, hence it did not make any sense to add those.

Upon my discussion with Steven recently, I thought of redoing the patch based on some new tests but I haven't been really able to work on that.

Regards,

Atri

In response to

Browse pgsql-hackers by date

  From Date Subject
Next Message Stephen Frost 2014-01-26 17:20:34 Re: Tablespace options in \db+
Previous Message Magnus Hagander 2014-01-26 17:13:21 Tablespace options in \db+