Quick Links

Re: IN list processing performance (yet again)

From:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To:	Dave Tenny <tenny(at)attbi(dot)com>
Cc:	pgsql-performance(at)postgresql(dot)org
Subject:	Re: IN list processing performance (yet again)
Date:	2003-05-29 01:46:13
Message-ID:	13453.1054172773@sss.pgh.pa.us
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-performance

Dave Tenny <tenny(at)attbi(dot)com> writes:
> </blockquote>
> There is one very interesting thing in my test case though.  It
> certainly seemed as if the 
> parameterized statements were successfully using the index of the
> freshly-created-but-unanalyzed table, 
> or else the times on those queries would have been terrible too.  It
> was only the IN list form 
> of query that wasn't making correct use of the index.  How can the
> planner recognize uniqueness for 
> one case but not the other?

The question is whether a seqscan will be faster than an indexscan; at
some point there's less I/O involved to just scan the table once. If
the planner doesn't know the index is unique then it's going to estimate
a higher cost for the indexscan (due to more rows fetched) and there is
some number of rows at which it will flip over to a seqscan. The same
will happen even if it *does* know the index is unique, it's just that
it will take more IN elements to make it happen. This is reasonable
behavior IMHO, although whether the flip-over point is anywhere near
the actual breakeven point on your hardware is anyone's guess. The cost
estimates are often far enough off that it's not very close.

regards, tom lane

In response to

Re: IN list processing performance (yet again) at 2003-05-29 01:19:56 from Dave Tenny

Browse pgsql-performance by date

	From	Date	Subject
Next Message	Christopher Kings-Lynne	2003-05-29 01:47:53	Re: IN list processing performance (yet again)
Previous Message	Dave Tenny	2003-05-29 01:19:56	Re: IN list processing performance (yet again)