Re: Wrong results due to missing quals

From: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To: Richard Guo <guofenglinux(at)gmail(dot)com>
Cc: PostgreSQL-development <pgsql-hackers(at)postgresql(dot)org>
Subject: Re: Wrong results due to missing quals
Date: 2023-05-24 15:10:21
Message-ID: 1132178.1684941021@sss.pgh.pa.us
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

Richard Guo <guofenglinux(at)gmail(dot)com> writes:
> So the qual 't2.a = t5.a' is missing.

Ugh.

> I looked into it and found that both clones of this joinqual are
> rejected by clause_is_computable_at, because their required_relids do
> not include the outer join of t2/(t3/t4), and meanwhile include nullable
> rels of this outer join.
> I think the root cause is that, as Tom pointed out in [1], we're not
> maintaining required_relids very accurately. In b9c755a2, we make
> clause_is_computable_at test required_relids for clone clauses. I think
> this is how this issue sneaks in.

Yeah. I'm starting to think that b9c755a2 took the wrong approach.
Really, required_relids is about making sure that a qual isn't
evaluated "too low", before all necessary joins have been formed. But
clause_is_computable_at is charged with making sure we don't evaluate
it "too high", after some incompatible join has been formed. There's
no really good reason to suppose that required_relids can serve both
purposes, even if it were computed perfectly accurately (and what is
perfect, anyway?).

So right now I'm playing with the idea of reverting the change in
clause_is_computable_at and seeing how else we can fix the previous
bug. Don't have anything to show yet, but one thought is that maybe
deconstruct_distribute_oj_quals needs to set up clause_relids for
clone clauses differently. Another idea is that maybe we need another
RestrictInfo field that's directly a set of OJ relids that this clause
can't be applied above. That'd reduce clause_is_computable_at to
basically a bms_intersect test which would be nice speed-wise. The
space consumption could be annoying, but I'm thinking that we might
only have to populate the field in clone clauses, which would
alleviate that issue.

regards, tom lane

In response to

Responses

Browse pgsql-hackers by date

  From Date Subject
Next Message Tom Lane 2023-05-24 15:37:10 Re: memory leak in trigger handling (since PG12)
Previous Message Erik Rijkers 2023-05-24 14:57:59 Re: PG 16 draft release notes ready