Quick Links

Re: PoC/WIP: Extended statistics on expressions

From:	Dean Rasheed <dean(dot)a(dot)rasheed(at)gmail(dot)com>
To:	Tomas Vondra <tomas(dot)vondra(at)enterprisedb(dot)com>
Cc:	Justin Pryzby <pryzby(at)telsasoft(dot)com>, PostgreSQL Hackers <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: PoC/WIP: Extended statistics on expressions
Date:	2021-03-17 18:54:41
Message-ID:	CAEZATCWF66BWJq-OwLuP5LGK6W9LDYxCQwLxqB36qmq3b1Ch8Q@mail.gmail.com
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

On Wed, 17 Mar 2021 at 17:26, Tomas Vondra
<tomas(dot)vondra(at)enterprisedb(dot)com> wrote:
>
> My concern is that the current behavior (where we prefer expression
> stats over multi-column stats to some extent) works fine as long as the
> parts are independent, but once there's dependency it's probably more
> likely to produce underestimates. I think underestimates for grouping
> estimates were a risk in the past, so let's not make that worse.
>

I'm not sure the current behaviour really is preferring expression
stats over multi-column stats. In this example, where we're grouping
by (a+b), (c+d) and have stats on [(a+b),c] and (c+d), neither of
those multi-column stats actually match more than one
column/expression. If anything, I'd go the other way and say that it
was wrong to use the [(a+b),c] stats in the first case, where they
were the only stats available, since those stats aren't really
applicable to (c+d), which probably ought to be treated as
independent. IOW, it might have been better to estimate the first case
as

ndistinct((a+b)) * ndistinct(c) * ndistinct(d)

and the second case as

ndistinct((a+b)) * ndistinct((c+d))

Regards,
Dean

In response to

Re: PoC/WIP: Extended statistics on expressions at 2021-03-17 17:25:58 from Tomas Vondra

Responses

Re: PoC/WIP: Extended statistics on expressions at 2021-03-17 19:07:22 from Tomas Vondra

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	John Naylor	2021-03-17 18:59:38	Re: WIP: BRIN multi-range indexes
Previous Message	Justin Pryzby	2021-03-17 18:45:34	Re: ALTER TABLE .. DETACH PARTITION CONCURRENTLY