Quick Links

Re: WIP: bloom filter in Hash Joins with batches

From:	"Shulgin, Oleksandr" <oleksandr(dot)shulgin(at)zalando(dot)de>
To:	Tomas Vondra <tomas(dot)vondra(at)2ndquadrant(dot)com>
Cc:	PostgreSQL Hackers <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: WIP: bloom filter in Hash Joins with batches
Date:	2015-12-17 09:50:46
Message-ID:	CACACo5Q0z9vQE8pHc+b6rzHr_TOTNg4=nc4eiQodtD1poOBCCQ@mail.gmail.com
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

On Tue, Dec 15, 2015 at 11:30 PM, Tomas Vondra <tomas(dot)vondra(at)2ndquadrant(dot)com
> wrote:

>
> Attached is a spreadsheet with results for various work_mem values, and
> also with a smaller data set (just 30M rows in the fact table), which
> easily fits into memory. Yet it shows similar gains, shaving off ~40% in
> the best case, suggesting that this is not just thanks to reduction of I/O
> when forcing the temp files to disk.

A neat idea! Have you possibly tried to also collect statistics about
actual false-positive rates and filter allocation sizes in every of the
collected data points?

--
Alex

In response to

WIP: bloom filter in Hash Joins with batches at 2015-12-15 22:30:06 from Tomas Vondra

Responses

Re: WIP: bloom filter in Hash Joins with batches at 2015-12-17 15:34:47 from Tomas Vondra

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Simon Riggs	2015-12-17 10:44:56	Re: WIP: bloom filter in Hash Joins with batches
Previous Message	Shulgin, Oleksandr	2015-12-17 09:45:32	Re: On-demand running query plans using auto_explain and signals