Quick Links

Re: Why isn't an index being used when selecting a distinct value?

From:	Gregory Stark <stark(at)enterprisedb(dot)com>
To:	"Keaton Adams" <kadams(at)mxlogic(dot)com>
Cc:	<pgsql-general(at)postgresql(dot)org>
Subject:	Re: Why isn't an index being used when selecting a distinct value?
Date:	2008-02-15 22:12:58
Message-ID:	873artvpg5.fsf@oxford.xeocode.com
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-general

"Keaton Adams" <kadams(at)mxlogic(dot)com> writes:

> Version: Postgres 8.1.4
> Platform: RHEL
>
> Given this scenario with the indexes in place, when I ask for the distinct
> field1_id values, why does the optimizer choose a sequential scan instead of
> just reading from the kda_log_fid_cre_20080123_idx index? The time it takes
> to perform the sequential scan against 20+ million records is way too slow.

Try (temporarily) doing:

SET enable_seqscan = off;

> keaton=# explain select distinct field1_id into temp kda_temp from
> kda_log_20080213;

If the database is right that will be even slower. Using a full index scan
requires a lot of random access seeks, generally the larger the table the
*more* likely a sequential scan and sort is a better approach than using an
index.

If it's wrong and it's faster then you have to consider whether it's only
faster because you've read the table into cache already. Will it be in cache
in production? If so then you migth try raising effective_cache_size or
lowering random_page_cost.

Another thing to try is using GROUP BY instead of DISTINCT. This is one case
where the postgres optimizer doesn't handle the two equivalent cases in
exactly the same way and there are some plans available in one method that
aren't in the other. That's only likely to help if you have relative few
values of field1_id but it's worth trying.

--
Gregory Stark
EnterpriseDB http://www.enterprisedb.com
Ask me about EnterpriseDB's On-Demand Production Tuning

In response to

Why isn't an index being used when selecting a distinct value? at 2008-02-15 21:31:41 from Keaton Adams

Responses

Re: Why isn't an index being used when selecting a distinct value? at 2008-02-18 05:22:33 from Keaton Adams

Browse pgsql-general by date

	From	Date	Subject
Next Message	Ken Johanson	2008-02-15 23:08:15	Re: Strict-typing benefits/costs
Previous Message	Jozsef Szalay	2008-02-15 21:54:28	pg_restore, search_path and operator class