Quick Links

Re: Top-k optimizations?

From:	Ron Mayer <rm_pg(at)cheapcomplexdevices(dot)com>
To:	David Fetter <david(at)fetter(dot)org>
Subject:	Re: Top-k optimizations?
Date:	2005-01-14 04:24:31
Message-ID:	41E7497F.3080408@cheapcomplexdevices.com
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

David Fetter wrote:
> Folks,
>
> As this came up in a work situation, I was wondering a little bit
> about the top-k issue. Right now, top-k is implemented (most easily,
> I think) via a SELECT with a LIMIT and no OFFSET. 3 questions arise
> from this.

I think the simplest LIMIT query doesn't make it easy to show ties;
but if you don't want the equivalent of MSSQL's "with ties" clause
I think LIMIT works well.

> 1. Are there currently any optimizations specific to top-k in
> PostgreSQL? If so, what are they?

Well, when I do queries like
"select * from customers order by dollarsspent desc limit 3"
it happily uses an index on dollarsspent.

> 3. What kinds of top-k optimizations might (eventually) be included
> in PostgreSQL?

I think a slightly related topic is whether syntactically it'd be
nice if postgresql had the SQL 2003 optional olap features to
specify ways of doing top-k queries as described here:
http://troels.arvin.dk/db/rdbms/#select-top-n

SELECT * FROM (
SELECT
RANK() OVER (ORDER BY age ASC) AS ranking,
person_id, person_name, age
FROM person
) AS foo
WHERE ranking <= 3

Seems IBM and Oracle support that syntax or something very similar.

Yeah, I know it's probably an orthogonal question to the
optimizations one, but might make porting nicer.

In response to

Top-k optimizations? at 2005-01-14 01:43:31 from David Fetter

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Greg Stark	2005-01-14 04:50:01	Re: Bug? 8.0 does not use partial index
Previous Message	Tom Lane	2005-01-14 04:17:32	Re: FATAL: catalog is missing 1 attribute(s) for relid 16396