Quick Links

Re: Why won't it index scan?

From:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To:	"Ed L(dot)" <pgsql(at)bluepolka(dot)net>
Cc:	pgsql-general(at)postgresql(dot)org
Subject:	Re: Why won't it index scan?
Date:	2006-05-18 02:29:14
Message-ID:	7088.1147919354@sss.pgh.pa.us
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-general

"Ed L." <pgsql(at)bluepolka(dot)net> writes:
> So, does this sound like we just happened to get repeatedly
> horribly unrepresentative random samples with stats target at
> 10? Are we at the mercy of randomness here? Or is there a
> better preventive procedure we can follow to systematically
> identify this kind of situation?

I think the real issue is that stats target 10 is too small for large
tables: the samples are just not large enough to support a decent
numdistinct estimate, which is the critical stat for cases such as this
(ie, estimating the number of hits on a value that's not in the
most-common-values list).

The reason the default is currently 10 is just conservatism: it was
already an order of magnitude better than what it replaced (a *single*
representative value) and I didn't feel I had the evidence to justify
higher values. It's become clear that the default ought to be higher,
but I've still got no good fix on a more reasonable default. 100 might
be too much, or then again maybe not.

I encourage you to play around with default_statistics_target and see
what you can learn about quality of estimates vs. planning time.

regards, tom lane

In response to

Re: Why won't it index scan? at 2006-05-17 20:07:24 from Ed L.

Responses

Re: Why won't it index scan? at 2006-05-18 05:04:53 from Greg Stark
Re: Why won't it index scan? at 2006-05-22 20:49:02 from Jim C. Nasby
Re: Why won't it index scan? at 2006-05-23 00:55:16 from Joshua D. Drake

Browse pgsql-general by date

	From	Date	Subject
Next Message	Tom Lane	2006-05-18 02:48:19	Re: Contributing code
Previous Message	Tim Allen	2006-05-18 01:52:33	Re: Contributing code