Quick Links

Re: Using distinct in an aggregate prevents parallel execution?

From:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To:	Thomas Kellerer <spam_eater(at)gmx(dot)net>
Cc:	pgsql-general(at)postgresql(dot)org
Subject:	Re: Using distinct in an aggregate prevents parallel execution?
Date:	2018-06-06 14:32:27
Message-ID:	6457.1528295547@sss.pgh.pa.us
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Thread:
Lists:	pgsql-general

Thomas Kellerer <spam_eater(at)gmx(dot)net> writes:
> Is this a known limitation?

Yes, unless somebody has done radical restructuring of the aggregation
code while I wasn't looking.

agg(DISTINCT ...) is currently implemented inside the Agg plan node,
so it's an indivisible black box to everything else. That was a
simple, minimum-code-footprint method for implementing the feature
back when; but it's got lots of drawbacks, and one is that there's
no reasonable way to parallelize.

I'd anticipate that before we could even start to think of parallelizing,
we'd have to split out the distinct-ification processing into a separate
plan node.

agg(... ORDER BY ...) has got the same problem, and it'd likely be
advisable to fix that at the same time.

regards, tom lane

In response to

Using distinct in an aggregate prevents parallel execution? at 2018-06-06 12:41:25 from Thomas Kellerer

Responses

Re: Using distinct in an aggregate prevents parallel execution? at 2018-06-06 19:12:34 from Thomas Kellerer

Browse pgsql-general by date

	From	Date	Subject
Next Message	Joshua D. Drake	2018-06-06 15:34:23	Re: Code of Conduct plan
Previous Message	Adrian Klaver	2018-06-06 14:20:53	Re: Failover replication building a new master