Quick Links

Re: wCTE: why not finish sub-updates at the end, not the beginning?

From:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To:	Marko Tiikkaja <marko(dot)tiikkaja(at)cs(dot)helsinki(dot)fi>
Cc:	pgsql-hackers(at)postgreSQL(dot)org
Subject:	Re: wCTE: why not finish sub-updates at the end, not the beginning?
Date:	2011-02-26 05:55:03
Message-ID:	8153.1298699703@sss.pgh.pa.us
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

Marko Tiikkaja <marko(dot)tiikkaja(at)cs(dot)helsinki(dot)fi> writes:
> On 2011-02-25 4:58 PM, Tom Lane wrote:
>> Specifically, I'm imagining getting rid of the patch's additions to
>> InitPlan and ExecutePlan that find all the modifying sub-queries and
>> force them to be cycled to completion before the main plan runs.
>> Just run the main plan and let it pull tuples from the CTEs as needed.
>> Then, in ExecutorEnd, cycle any unfinished ModifyTable nodes to
>> completion before shutting down the plan.

> This idea has actually been discussed before when we talked about
> optimizing wCTEs, but IIRC you said that doing this in ExecutorEnd is a
> bit ugly.

Further experimentation has reminded me of why I didn't want to put such
processing in ExecutorEnd :-(. There are some nasty interactions with
EXPLAIN:

1. EXPLAIN ANALYZE fails to include the execution cycles associated with
running the ModifyTable nodes to completion. In the worst case, such as
"WITH t AS (INSERT ...) SELECT 1", it will claim the INSERT subplan is
never executed, even though rows certainly got inserted. This is
because EXPLAIN extracts all the counts from the execution state tree
before shutting it down with ExecutorEnd.

2. But it gets worse. Try the same query *without* ANALYZE. You'll
find the INSERT executes anyway! That's because EXPLAIN still calls
ExecutorEnd to clean up the execution state tree, and ExecutorEnd
doesn't realize it's not supposed to run any of the plan.

So we really need some refactoring here. I dislike adding another
fundamental step to the ExecutorStart/ExecutorRun/ExecutorEnd sequence,
but there may not be a better way. The only way I see to fix this
without changing that API is to have ExecutorRun do the cleanup
processing just after the top plan node returns a null tuple, and that
seems a bit ugly as well.

Thoughts?

regards, tom lane

In response to

Re: wCTE: why not finish sub-updates at the end, not the beginning? at 2011-02-25 16:03:46 from Marko Tiikkaja

Responses

Re: wCTE: why not finish sub-updates at the end, not the beginning? at 2011-02-26 08:19:23 from Heikki Linnakangas
Re: wCTE: why not finish sub-updates at the end, not the beginning? at 2011-02-26 13:31:16 from Greg Stark
Re: wCTE: why not finish sub-updates at the end, not the beginning? at 2011-02-26 13:41:43 from Dean Rasheed

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Bruce Momjian	2011-02-26 06:55:39	Re: TODO: You can alter it, but you can't view it
Previous Message	Robert Haas	2011-02-26 05:52:40	Re: wCTE: about the name of the feature