Re: COPY JSON: use trailing commas in FORCE_ARRAY output

From: Ayush Tiwari <ayushtiwari(dot)slg01(at)gmail(dot)com>
To: Chao Li <li(dot)evan(dot)chao(at)gmail(dot)com>
Cc: PostgreSQL Hackers <pgsql-hackers(at)lists(dot)postgresql(dot)org>
Subject: Re: COPY JSON: use trailing commas in FORCE_ARRAY output
Date: 2026-05-06 07:42:38
Message-ID: CAJTYsWUdfOCo=m6dWpX=uxXFKiTUO751ju3mVJidd9p5VqApFg@mail.gmail.com
Views: Whole Thread | Raw Message | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

Hi,

On Wed, 6 May 2026 at 12:11, Chao Li <li(dot)evan(dot)chao(at)gmail(dot)com> wrote:

> Hi,
>
> Another issue I found with COPY TO (FORMAT json, FORCE_ARRAY) is that it
> places the delimiter comma at the beginning of the next line, like this:
> ```
> evantest=# copy test_json_copy to stdout with (format json, force_array);
> [
>
> {"id":1,"name":"Alice","is_active":true,"tags":["dev","db"],"created_at":"2026-05-01"}
>
> ,{"id":2,"name":"Bob","is_active":false,"tags":["manager"],"created_at":"2026-05-02"}
> ,{"id":3,"name":"Charlie","is_active":null,"tags":null,"created_at":null}
> ,{"id":4,"name":"Special Case:
> \"Quotes\"","is_active":true,"tags":["a","b"],"created_at":"2026-05-04"}
> ]
> ```
>
> I was surprised by the comma placement. It is valid JSON, but it looks
> quite uncommon.
>
> For comparison, the existing json_agg() places commas at the end of the
> line:
> ```
> evantest=# select json_agg(t) from (select id, name from test_json_copy) t;
> json_agg
> ----------------------------------------------
> [{"id":1,"name":"Alice"}, +
> {"id":2,"name":"Bob"}, +
> {"id":3,"name":"Charlie"}, +
> {"id":4,"name":"Special Case: \"Quotes\""}]
> (1 row)
> ```
>
> If this feature had already been released, I would not think it worth
> changing just for formatting. But since "FORMAT json" is a new PG19 feature
> and has not been released yet, I think it is better to make the output to
> follow the more common style.
>
> This patch changes the output to place the comma at the end of the
> previous line instead. The fix only adjusts how commas and newlines are
> emitted. It does not buffer the whole result, so it should not have any
> performance impact.
>

The patch looks good to me. I applied and the focused 'copy'
regression test too passed.

I agree that since FORMAT json is new for PG19, it is reasonable to adjust
the formatting before release. The implementation still streams the output
and only changes where the separator/newline are emitted.

Regards,
Ayush

In response to

Browse pgsql-hackers by date

  From Date Subject
Next Message Chao Li 2026-05-06 07:47:21 Re: Bug in ALTER SUBSCRIPTION ... SERVER / ... CONNECTION with broken old server
Previous Message Peter Eisentraut 2026-05-06 07:32:23 Re: Proposal: tighten validation for legacy EUC encodings or document that accepted byte sequences may be unconvertible to UTF8