Quick Links

Postgres picks suboptimal index after building of an extended statistics

From:	"Andrey V(dot) Lepikhov" <a(dot)lepikhov(at)postgrespro(dot)ru>
To:	PostgreSQL Hackers <pgsql-hackers(at)lists(dot)postgresql(dot)org>
Subject:	Postgres picks suboptimal index after building of an extended statistics
Date:	2021-06-23 14:19:26
Message-ID:	0ca4553c-1f34-12ba-9122-44199d1ced41@postgrespro.ru
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

Hi,

Ivan Frolkov reported a problem with choosing a non-optimal index during
a query optimization. This problem appeared after building of an
extended statistics.

I prepared the test case (see t.sql in attachment).
For reproduction of this case we need to have a composite primary key
index and one another index.
Before creation of extended statistics, SELECT from the table choose PK
index and returns only one row. But after, this SELECT picks alternative
index, fetches and filters many tuples.

The problem is related to a corner case in btree cost estimation procedure:
if postgres detects unique one-row index scan, it sets
numIndexTuples to 1.0.

But the selectivity is calculated as usual, by the
clauselist_selectivity() routine and can have a value, much more than
corresponding to single tuple. This selectivity value is used later in
the code to calculate a number of fetched tuples and can lead to
choosing of an suboptimal index.

The attached patch is my suggestion to fix this problem.

--
regards,
Andrey Lepikhov
Postgres Professional

Attachment	Content-Type	Size
t.sql	application/sql	1.1 KB
0001-In-the-case-of-an-unique-one-row-btree-index-scan-on.patch	text/plain	5.8 KB

Responses

Re: Postgres picks suboptimal index after building of an extended statistics at 2021-08-11 00:48:19 from Peter Geoghegan

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Tomas Vondra	2021-06-23 14:25:10	Re: logical decoding and replication of sequences
Previous Message	Tomas Vondra	2021-06-23 14:14:03	Re: logical decoding and replication of sequences