Re: [HACKERS] CLUSTER command progress monitor

From: Tatsuro Yamada <yamada(dot)tatsuro(at)lab(dot)ntt(dot)co(dot)jp>
To: Dmitry Dolgov <9erthalion6(at)gmail(dot)com>
Cc: pg(at)bowt(dot)ie, Robert Haas <robertmhaas(at)gmail(dot)com>, PostgreSQL-development <pgsql-hackers(at)postgresql(dot)org>, Jeff Janes <jeff(dot)janes(at)gmail(dot)com>, Thomas Munro <thomas(dot)munro(at)enterprisedb(dot)com>, Michael Paquier <michael(dot)paquier(at)gmail(dot)com>
Subject: Re: [HACKERS] CLUSTER command progress monitor
Date: 2018-12-03 09:22:20
Message-ID: bb3d3eb2-0464-34c0-e2cf-adab9d6c2a06@lab.ntt.co.jp
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

On 2018/11/29 21:20, Dmitry Dolgov wrote:
>> On Fri, Aug 24, 2018 at 7:06 AM Tatsuro Yamada <yamada(dot)tatsuro(at)lab(dot)ntt(dot)co(dot)jp> wrote:
>>
>> On 2017/11/22 6:07, Peter Geoghegan wrote:
>>> On Mon, Oct 2, 2017 at 6:04 AM, Robert Haas <robertmhaas(at)gmail(dot)com> wrote:
>>>> Progress reporting on sorts seems like a tricky problem to me, as I
>>>> said before. In most cases, a sort is going to involve an initial
>>>> stage where it reads all the input tuples and writes out quicksorted
>>>> runs, and then a merge phase where it merges all the output tapes into
>>>> a sorted result. There are some complexities; for example, if the
>>>> number of tapes is really large, then we might need multiple merge
>>>> phases, only the last of which will produce tuples.
>>>
>>> This would ordinarily be the point at which I'd say "but you're very
>>> unlikely to require multiple passes for an external sort these days".
>>> But I won't say that on this thread, because CLUSTER generally has
>>> unusually wide tuples, and so is much more likely to be I/O bound, to
>>> require multiple passes, etc. (I bet the v10 enhancements
>>> disproportionately improved CLUSTER performance.)
>>
>> Hi,
>>
>> I came back to develop the feature for community.
>> V4 patch is corrected these following points:
>>
>> - Rebase on master (143290efd)
>> - Fix document
>> - Replace the column name scan_index_relid with cluster_index_relid.
>> Thanks to Jeff Janes!
>>
>> I'm now working on improving the patch based on Robert's comment related to
>> "Seqscan and Sort case" and also considering how to handle the "Index scan case".
>
> Thank you,
>
> Unfortunately, this patch has some conflicts now, could you rebase it? Also
> what's is the status of your work on improving it based on the
> provided feedback?
>
> In the meantime I'm moving it to the next CF.

Thank you for managing the CF and Sorry for the late reply.
I'll rebase it for the next CF and also I'll clear my head because the patch
needs design change to address the feedbacks, I guess. Therefore, the status is
reconsidering the design of the patch. :)

Regards,
Tatsuro Yamada
NTT Open Source Software Center

In response to

Responses

Browse pgsql-hackers by date

  From Date Subject
Next Message Etsuro Fujita 2018-12-03 11:20:32 Re: postgres_fdw: oddity in costing aggregate pushdown paths
Previous Message Alexander Lakhin 2018-12-03 08:58:13 Re: make installcheck-world in a clean environment