Quick Links

Re: index skipped in favor of seq scan.

From:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To:	ryan(dot)a(dot)roemmich(at)mail(dot)sprint(dot)com
Cc:	pgsql-general(at)postgresql(dot)org
Subject:	Re: index skipped in favor of seq scan.
Date:	2001-07-10 15:55:54
Message-ID:	29213.994780554@sss.pgh.pa.us
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-general

ryan(dot)a(dot)roemmich(at)mail(dot)sprint(dot)com writes:
> On the other end of the spectrum there are many addresses with only one
> entry. When I use one of these addresses in the WHERE clause it takes
> just as long as the address with 150k rows. If the sequential scan is
> better for 150k rows out of 800k rows, what about 1 out of 800k? It
> seems that when my table grew to this size the index was no longer used.

The problem is that the 150k-duplicates value is dominating the
planner's rather inadequate statistics, and causing it to believe that
the table contains only a few values that all occur many times. If that
were the true scenario then the use of seq scan would be the correct
choice.

This is fixed (I hope) for 7.2, but there's not much to be done about
it in current releases, unless you can avoid storing the 150k-duplicates
value. Is that a real value, or just a dummy? If you could replace it
with NULL then the right things would happen, because the statistics do
already distinguish NULL from regular data values.

regards, tom lane

In response to

RE: index skipped in favor of seq scan. at 2001-07-10 14:55:45 from ryan.a.roemmich

Browse pgsql-general by date

	From	Date	Subject
Next Message	Martijn van Oosterhout	2001-07-10 16:00:03	Re: [PATCH] Partial indicies again
Previous Message	Kapil Tilwani	2001-07-10 15:55:04	Re: A small question about Red Hat