Quick Links

Re: hashjoin chosen over 1000x faster plan

From:	"Kevin Grittner" <Kevin(dot)Grittner(at)wicourts(dot)gov>
To:	"Simon Riggs" <simon(at)2ndquadrant(dot)com>, "Tom Lane" <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Cc:	<pgsql-performance(at)postgresql(dot)org>
Subject:	Re: hashjoin chosen over 1000x faster plan
Date:	2007-10-10 18:30:13
Message-ID:	470CD3E3.EE98.0025.0@wicourts.gov
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-performance

>>> On Wed, Oct 10, 2007 at 1:07 PM, in message <20980(dot)1192039650(at)sss(dot)pgh(dot)pa(dot)us>,
Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us> wrote:
> Simon Riggs <simon(at)2ndquadrant(dot)com> writes:
>> Basically the planner doesn't ever optimise for the possibility of the
>> never-executed case because even a single row returned would destroy
>> that assumption.
>
> It's worse than that: the outer subplan *does* return some rows.
> I suppose that all of them had NULLs in the join keys, which means
> that (since 8.1 or so) nodeMergejoin discards them as unmatchable.
> Had even one been non-NULL the expensive subplan would have been run.

Well, this query is run tens of thousands of times per day by our web
application; less than one percent of those runs would require the
subplan. (In my initial post I showed counts to demonstrate that 1%
of the rows had a non-NULL value and, while I wouldn't expect the
planner to know this, these tend to be clustered on a lower
percentage of cases.) If the philosophy of the planner is to go for
the lowest average cost (versus lowest worst case cost) shouldn't it
use the statistics for to look at the percentage of NULLs?

-Kevin

In response to

Re: hashjoin chosen over 1000x faster plan at 2007-10-10 18:07:30 from Tom Lane

Responses

Re: hashjoin chosen over 1000x faster plan at 2007-10-10 18:54:52 from Simon Riggs

Browse pgsql-performance by date

	From	Date	Subject
Next Message	Simon Riggs	2007-10-10 18:54:52	Re: hashjoin chosen over 1000x faster plan
Previous Message	Tom Lane	2007-10-10 18:07:30	Re: hashjoin chosen over 1000x faster plan