Re: speedup COPY TO for partitioned table.

From: torikoshia <torikoshia(at)oss(dot)nttdata(dot)com>
To: jian he <jian(dot)universality(at)gmail(dot)com>
Cc: Kirill Reshke <reshkekirill(at)gmail(dot)com>, vignesh C <vignesh21(at)gmail(dot)com>, David Rowley <dgrowleyml(at)gmail(dot)com>, Melih Mutlu <m(dot)melihmutlu(at)gmail(dot)com>, PostgreSQL Hackers <pgsql-hackers(at)lists(dot)postgresql(dot)org>
Subject: Re: speedup COPY TO for partitioned table.
Date: 2025-06-30 07:57:30
Message-ID: 5769351f63b0d377535f92c925cec4ea@oss.nttdata.com
Views: Whole Thread | Raw Message | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

On 2025-06-27 16:14, jian he wrote:
Thanks for updating the patch!

> On Thu, Jun 26, 2025 at 9:43 AM torikoshia <torikoshia(at)oss(dot)nttdata(dot)com>
> wrote:
>>
>> After applying the patch, blank lines exist between these statements
>> as
>> below. Do we really need these blank lines?
>>
>> ```
>> scan_rel = table_open(scan_oid,
>> AccessShareLock);
>>
>> CopyThisRelTo(cstate, scan_rel, cstate->rel,
>> &processed);
>>
>> table_close(scan_rel, AccessShareLock);
>> ``
>>
> we can remove these empty new lines.
> actually, I realized we don't need to use AccessShareLock here—we can
> use NoLock
> instead, since BeginCopyTo has already acquired AccessShareLock via
> find_all_inheritors.

That makes sense.
I think it would be helpful to add a comment explaining why NoLock is
safe here — for example:

/* We already got the needed lock */

In fact, in other places where table_open(..., NoLock) is used, similar
explanatory comments are often included(Above comment is one of them).

>> > +/*
>> > + * rel: the relation from which the actual data will be copied.
>> > + * root_rel: if not NULL, it indicates that we are copying partitioned
>> > relation
>> > + * data to the destination, and "rel" is the partition of "root_rel".
>> > + * processed: number of tuples processed.
>> > +*/
>> > +static void
>> > +CopyThisRelTo(CopyToState cstate, Relation rel, Relation root_rel,
>>
>> This comment only describes the parameters. Wouldn't it better to add
>> a
>> brief summary of what this function does overall?
>>
>
> what do you think the following
>
> /*
> * CopyThisRelTo:
> * This will scanning a single table (which may be a partition) and
> exporting
> * its rows to a COPY destination.
> *
> * rel: the relation from which the actual data will be copied.
> * root_rel: if not NULL, it indicates that we are copying partitioned
> relation
> * data to the destination, and "rel" is the partition of "root_rel".
> * processed: number of tuples processed.
> */
> static void
> CopyThisRelTo(CopyToState cstate, Relation rel, Relation root_rel,
> uint64 *processed)

I think it would be better to follow the style of nearby functions in
the same file. For example:

/*
* Scan a single table (which may be a partition) and export
* its rows to the COPY destination.
*/

Also, regarding the function name CopyThisRelTo() — I wonder if the
"This" is really necessary?
Maybe something simpler like CopyRelTo() is enough.

What do you think?

Regards,

--
Atsushi Torikoshi
Seconded from NTT DATA Japan Corporation to SRA OSS K.K.

In response to

Responses

Browse pgsql-hackers by date

  From Date Subject
Next Message Rahila Syed 2025-06-30 07:59:22 Re: add function for creating/attaching hash table in DSM registry
Previous Message Dilip Kumar 2025-06-30 07:51:24 Re: pg_logical_slot_get_changes waits continously for a partial WAL record spanning across 2 pages