Quick Links

Querying distinct values from a large table

From:	Igor Lobanov <ilobanov(at)swsoft(dot)com>
To:	pgsql-performance(at)postgresql(dot)org
Subject:	Querying distinct values from a large table
Date:	2007-01-30 08:33:34
Message-ID:	45BF02DE.7080605@swsoft.com
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-performance

Greetings!

I have rather large table with about 5 millions of rows and a dozen of
columns. Let's suppose that columns are named 'a', 'b', 'c' etc. I need
to query distinct pairs of ('a';'b') from this table.

I use following query:

SELECT DISTINCT a, b FROM tbl;

but unfortunately, it takes forever to complete. Explaining gives me
information that bottleneck is seqscan on 'tbl', which eats much time.

Creating compound index on this table using following statement:

CREATE INDEX tbl_a_b_idx ON tbl( a, b );

gives no effect, postgres simply ignores it, at least according to the
EXPLAIN output.

Is there any way to somehow improve the performance of this operation?
Table can not be changed.

--
Igor Lobanov
Internal Development Engineer
SWsoft, Inc.

Responses

Re: Querying distinct values from a large table at 2007-01-30 09:12:51 from Richard Huxton
Re: Querying distinct values from a large table at 2007-01-30 16:44:35 from Bruno Wolff III

Browse pgsql-performance by date

	From	Date	Subject
Next Message	Richard Huxton	2007-01-30 09:12:51	Re: Querying distinct values from a large table
Previous Message	Tomas Vondra	2007-01-30 07:10:14	Re: Partitioning