Quick Links

Re: Does "correlation" mislead the optimizer on large tables?

From:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To:	Ron Mayer <ron(at)intervideo(dot)com>
Cc:	pgsql-performance(at)postgresql(dot)org
Subject:	Re: Does "correlation" mislead the optimizer on large tables?
Date:	2003-01-24 06:48:19
Message-ID:	11025.1043390899@sss.pgh.pa.us
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-performance

Ron Mayer <ron(at)intervideo(dot)com> writes:
> On a large tables, I think the "correlation" pg_stats field as calculated
> by "vacuum analyze" or "analyze" can mislead the optimizer.

If you look in the pghackers archives, you will find some discussion
about changing the equation that cost_index() uses to estimate the
impact of correlation on indexscan cost. The existing equation is
ad-hoc and surely wrong, but so far no one's proposed a replacement
that can be justified any better. If you've got such a replacement
then we're all ears...

> In particular, if I have a large table t with columns 'a','b','c', etc,
> and I cluster the table as follows:
> create table t_ordered as select * from t order by a,b;
> vacuum analyze t_ordered;
> Column "b" will (correctly) get a very low "correlation" in
> the pg_stats table -- but I think the optimizer would do better
> assuming a high correlation because similar 'b' values are still
> grouped closely on the same disk pages.

How would that be? They'll be separated by the stride of 'a'.

It seems likely to me that a one-dimensional correlation statistic may
be inadequate, but I haven't seen any proposals for better stats.

regards, tom lane

In response to

Does "correlation" mislead the optimizer on large tables? at 2003-01-24 04:16:09 from Ron Mayer

Responses

Re: Does "correlation" mislead the optimizer on large at 2003-01-24 16:27:03 from Stephan Szabo
Re: Does "correlation" mislead the optimizer on large at 2003-01-24 19:36:50 from Ron Mayer

Browse pgsql-performance by date

	From	Date	Subject
Next Message	Andrew Sullivan	2003-01-24 13:22:19	Re: Crash Recovery
Previous Message	Tom Lane	2003-01-24 06:29:35	Re: Crash Recovery