Quick Links

Re: Better default_statistics_target

From:	Simon Riggs <simon(at)2ndquadrant(dot)com>
To:	Chris Browne <cbbrowne(at)acm(dot)org>
Cc:	pgsql-patches(at)postgresql(dot)org
Subject:	Re: Better default_statistics_target
Date:	2007-12-06 19:34:42
Message-ID:	1196969682.4255.455.camel@ebony.site
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers pgsql-patches

On Wed, 2007-12-05 at 15:13 -0500, Chris Browne wrote:

> I have the theory (thus far not borne out by any numbers) that it
> might be a useful approach to try to go through the DB schema and use
> what information is there to try to come up with better numbers on a
> per-column basis.

Yeh, agreed.

The difficulty is making this work for generic datatypes.

> - Datestamps tend to imply temporal dispersion, ergo "somewhat fewer
> bins." Similar for floats.

Hmmm, not sure about that one. Some date/time columns can change very
quickly over time, so the stats are frequently out of date.

> Then could come a "second order" perspective, where data would
> actually get sampled from pg_statistics.
>
> - If we look at the number of distinct histogram bins used, for a
> particular column, and find that there are some not used, we might
> drop bins.

The histograms are height balanced, so they are always all used.

--
Simon Riggs
2ndQuadrant http://www.2ndQuadrant.com

In response to

Re: Better default_statistics_target at 2007-12-05 20:13:48 from Chris Browne

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Andrew Dunstan	2007-12-06 21:02:21	Re: [HACKERS] BUG #3799: csvlog skips some logs
Previous Message	Alvaro Herrera	2007-12-06 19:29:21	Re: shared_buffers, wal_buffers, WAL files, data files

Browse pgsql-patches by date

	From	Date	Subject
Next Message	Tom Lane	2007-12-07 00:19:27	Proposed patch to make mergejoin cost estimation more symmetric
Previous Message	Merlin Moncure	2007-12-06 19:12:22	Re: PQParam version 0.5