Quick Links

Re: MergeAppend could consider sorting cheapest child path

From:	Andrei Lepikhov <lepihov(at)gmail(dot)com>
To:	Alexander Korotkov <aekorotkov(at)gmail(dot)com>
Cc:	Alexander Pyhalov <a(dot)pyhalov(at)postgrespro(dot)ru>, Andy Fan <zhihuifan1213(at)163(dot)com>, Bruce Momjian <bruce(at)momjian(dot)us>, PostgreSQL Hackers <pgsql-hackers(at)lists(dot)postgresql(dot)org>, Nikita Malakhov <HukuToc(at)gmail(dot)com>
Subject:	Re: MergeAppend could consider sorting cheapest child path
Date:	2025-06-03 13:53:50
Message-ID:	0b8a7527-8078-4c98-a987-7153b64ca4ab@gmail.com
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

On 3/6/2025 15:38, Alexander Korotkov wrote:
> On Tue, Jun 3, 2025 at 4:23 PM Andrei Lepikhov <lepihov(at)gmail(dot)com> wrote:
>> To establish a stable foundation for discussion, I conducted simple
>> tests - see, for example, a couple of queries in the attachment. As I
>> see it, Sort->Append works faster: in my test bench, it takes 1250ms on
>> average versus 1430ms, and it also has lower costs - the same for data
>> with and without massive numbers of duplicates. Playing with sizes of
>> inputs, I see the same behaviour.
>
> I run your tests. For Sort(Append()) case I've got actual
> time=811.047..842.473. For MergeAppend case I've got actual time
> actual time=723.678..967.004. That looks interesting. At some point
> we probably should teach our Sort node to start returning tuple before
> finishing the last merge stage.
>
> However, I think costs are not adequate to the timing. Our cost model
> predicts that startup cost of MergeAppend is less than startup cost of
> Sort(Append()). And that's correct. However, in fast total time of
> MergeAppend is bigger than total time of Sort(Append()). The
> differences in these two cases are comparable. I think we need to
> just our cost_sort() to reflect that.
May you explain your idea? As I see (and have shown in the previous
message), the total cost of the Sort->Append is fewer than
MergeAppend->Sort.
Additionally, as I mentioned earlier, the primary reason for choosing
MergeAppend in the regression test was a slight total cost difference
that triggered the startup cost comparison.
May you show the query and its explain, that is a subject of concern for
you?

--
regards, Andrei Lepikhov

In response to

Re: MergeAppend could consider sorting cheapest child path at 2025-06-03 13:38:47 from Alexander Korotkov

Responses

Re: MergeAppend could consider sorting cheapest child path at 2025-06-03 14:05:13 from Alexander Korotkov

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Tom Lane	2025-06-03 14:01:58	Re: C11 / VS 2019
Previous Message	Nathan Bossart	2025-06-03 13:52:58	Re: like pg_shmem_allocations, but fine-grained for DSM registry ?