Quick Links

Re: tricky query

From:	Sam Mason <sam(at)samason(dot)me(dot)uk>
To:	pgsql-performance(at)postgresql(dot)org
Subject:	Re: tricky query
Date:	2005-06-28 17:42:05
Message-ID:	20050628174205.GT62747@colo.samason.me.uk
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-performance

John A Meinel wrote:
>SELECT t1.id+1 as id_new FROM id_test t1
> WHERE NOT EXISTS
> (SELECT t2.id FROM id_test t2 WHERE t2.id = t1.id+1)
> ORDER BY t1.id LIMIT 1;

This works well on sparse data, as it only requires as many index
access as it takes to find the first gap. The simpler "NOT IN"
version that everybody seems to have posted the first time round
has a reasonably constant (based on the number of rows, not gap
position) startup time but the actual time spent searching for the
gap is much lower.

I guess the version you use depends on how sparse you expect the
data to be. If you expect your query to have to search through
more than half the table before finding the gap then you're better
off using the "NOT IN" version, otherwise the "NOT EXISTS" version
is faster -- on my system anyway.

Hope that's interesting!

Sam

In response to

Re: tricky query at 2005-06-28 15:42:28 from John A Meinel

Browse pgsql-performance by date

	From	Date	Subject
Next Message	Tom Lane	2005-06-28 18:14:42	Re: Too slow querying a table of 15 million records
Previous Message	Michael Stone	2005-06-28 17:27:01	Re: read block size