Re: Make COPY format extendable: Extract COPY TO format implementations

From: Sutou Kouhei <kou(at)clear-code(dot)com>
To: sawada(dot)mshk(at)gmail(dot)com
Cc: michael(at)paquier(dot)xyz, david(dot)g(dot)johnston(at)gmail(dot)com, tgl(at)sss(dot)pgh(dot)pa(dot)us, zhjwpku(at)gmail(dot)com, pgsql-hackers(at)postgresql(dot)org
Subject: Re: Make COPY format extendable: Extract COPY TO format implementations
Date: 2025-09-10 02:40:58
Message-ID: 20250910.114058.944005985178219874.kou@clear-code.com
Views: Whole Thread | Raw Message | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

Hi,

In <CAD21AoCidyfKcpf9-f2Np8kWgkM09c4TjnS1h1hcO_-CCbjeqw(at)mail(dot)gmail(dot)com>
"Re: Make COPY format extendable: Extract COPY TO format implementations" on Tue, 9 Sep 2025 13:15:43 -0700,
Masahiko Sawada <sawada(dot)mshk(at)gmail(dot)com> wrote:

>> I don't object your approach but we need a good way to
>> measure performance. If we use this approach, we can omit it
>> for now and we can revisit your approach later without
>> breaking compatibility. How about using this approach if we
>> can't find a good way to measure performance?
>
> I think it would be better to hear more opinions about this idea and
> then make a decision, rather than basing our decision on whether or
> not we can measure its performance, so we can be more confident in the
> idea we have chosen. While this idea has the above downside, it could
> make sense because we always allocate the entire CopyFrom/ToStateData
> even today in spite of some fields being not used at all in binary
> format and it requires less implementation costs to hide the
> for-core-only fields. On the other hand, another possible idea is that
> we have three different structs for categories 1 (core-only), 2 (core
> and extensions), and 3 (extension-only), and expose only 2 that has a
> void pointer to 3. The core can allocate the memory for 1 that embeds
> 2 at the beginning of the fields. While this design looks cleaner and
> we can minimize overheads due to indirect references, it would require
> more implementation costs. Which method we choose, I think we need
> performance measurements in several scenarios to check if performance
> regressions don't happen unexpectedly.

OK. So the next step is collecting more opinions, right?

Could you add key people in this area to Cc to hear their
opinions? I'm not familiar with key people in the PostgreSQL
community...

Thanks,
--
kou

In response to

Responses

Browse pgsql-hackers by date

  From Date Subject
Next Message Jonathan S. Katz 2025-09-10 03:13:43 PostgreSQL 18 GA press release draft
Previous Message Zhijie Hou (Fujitsu) 2025-09-10 02:34:58 RE: Add support for specifying tables in pg_createsubscriber.