Quick Links

Re: Buffers from parallel workers not accumulated to upper nodes with gather merge

From:	Amit Kapila <amit(dot)kapila16(at)gmail(dot)com>
To:	Jehan-Guillaume de Rorthais <jgdr(at)dalibo(dot)com>
Cc:	PostgreSQL mailing lists <pgsql-bugs(at)lists(dot)postgresql(dot)org>
Subject:	Re: Buffers from parallel workers not accumulated to upper nodes with gather merge
Date:	2020-07-20 09:59:00
Message-ID:	CAA4eK1LQnp4UwfEXXKQQtxkHDDx7q83hbDN-BPBNP84qGjqpAQ@mail.gmail.com
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Thread:
Lists:	pgsql-bugs

On Mon, Jul 20, 2020 at 1:29 PM Jehan-Guillaume de Rorthais
<jgdr(at)dalibo(dot)com> wrote:
>
> On Mon, 20 Jul 2020 11:30:34 +0530
> Amit Kapila <amit(dot)kapila16(at)gmail(dot)com> wrote:
>
> > On Sat, Jul 18, 2020 at 7:32 PM Jehan-Guillaume de Rorthais
> > <jgdr(at)dalibo(dot)com> wrote:
> [...]
> > > The Merge node works correctly because it calls ExecShutdownGatherWorkers as
> > > soon as the workers are exhausted from gather_readnext. Because of this,
> > > buffers from workers are already accounted and propagated to upper nodes
> > > before the recursive call of ExecShutdownNode on each nodes. There's no
> > > similar call to ExecShutdownGatherMergeWorkers for Gather Merge. Adding a
> > > call to ExecShutdownGatherMergeWorkers in gather_merge_getnext when workers
> > > are exhausted seems to fix the issue, but I feel like this is the wrong
> > > place to fix this issue.
> >
> > Why do you think so?
>
> Because the fix seemed too specific to the Gather Merge node alone. This bug
> might exist for some other nodes (I didn't look for them) and the trap will
> stay for futur ones.
>
> The fix in ExecShutdownNode recursion have a broader impact on all present
> and futur nodes.
>
> > I think irrespective of whether we want to call
> > ExecShutdownGatherMergeWorkers in gather_merge_getnext (when we know
> > we are done aka binaryheap_empty) to fix this particular issue, it is
> > better to shutdown the workers as soon as we are done similar to what
> > we do for Gather node. It is good to release resources as soon as we
> > can.
>
> Indeed. I was focusing on the bug and I didn't thought about that. The patch I
> test was really just a quick test. I'll have a closer look at this, but I
> suppose this might be considered as a separate commit?
>

Good Question. Initially, I thought we will have it in a same commit,
but now on again thinking about it, we might want to keep this one for
the HEAD only and ExecShutdownNode related change in the
back-branches. BTW, can you please test if the problem exist in
back-branches, and does your change fix it there as well?

--
With Regards,
Amit Kapila.
EnterpriseDB: http://www.enterprisedb.com

In response to

Re: Buffers from parallel workers not accumulated to upper nodes with gather merge at 2020-07-20 07:59:27 from Jehan-Guillaume de Rorthais

Responses

Re: Buffers from parallel workers not accumulated to upper nodes with gather merge at 2020-07-20 12:09:24 from Jehan-Guillaume de Rorthais
Re: Buffers from parallel workers not accumulated to upper nodes with gather merge at 2020-07-21 15:34:57 from Jehan-Guillaume de Rorthais

Browse pgsql-bugs by date

	From	Date	Subject
Next Message	Francisco Olarte	2020-07-20 10:56:37	Re: Improvement for query planner? (no, not about count(*) again ;-))
Previous Message	Sandeep Thakkar	2020-07-20 09:18:47	Re: BUG #16460: Error when executing REFRESH MATERIALIZED VIEW WITH DATA;