Quick Links

Re: Disk-based hash aggregate's cost model

From:	Jeff Davis <pgsql(at)j-davis(dot)com>
To:	Tomas Vondra <tomas(dot)vondra(at)2ndquadrant(dot)com>
Cc:	Peter Geoghegan <pg(at)bowt(dot)ie>, PostgreSQL Hackers <pgsql-hackers(at)lists(dot)postgresql(dot)org>
Subject:	Re: Disk-based hash aggregate's cost model
Date:	2020-09-07 20:55:28
Message-ID:	b805637251965df210ab4f15f1ac17f8484dd9c8.camel@j-davis.com
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

On Sun, 2020-09-06 at 23:21 +0200, Tomas Vondra wrote:
> I've tested the costing changes on the simplified TPC-H query, on two
> different machines, and it seems like a clear improvement.

Thank you. Committed.

> So yeah, the patched costing is much closer to sort (from the point
> of
> this cost/duration metric), although for higher work_mem values
> there's
> still a clear gap where the hashing seems to be under-costed by a
> factor
> of ~2 or more.

There seems to be a cliff right after 4MB. Perhaps lookup costs on a
larger hash table?

Regards,
Jeff Davis

In response to

Re: Disk-based hash aggregate's cost model at 2020-09-06 21:21:12 from Tomas Vondra

Responses

Re: Disk-based hash aggregate's cost model at 2020-09-08 14:06:58 from Tomas Vondra

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Peter Geoghegan	2020-09-07 21:31:07	Re: v13: show extended stats target in \d
Previous Message	Tom Lane	2020-09-07 20:49:20	Re: Auto-vectorization speeds up multiplication of large-precision numerics