Re: parallel data loading for pgbench -i

From: Mircea Cadariu <cadariu(dot)mircea(at)gmail(dot)com>
To: lakshmi <lakshmigcdac(at)gmail(dot)com>
Cc: PostgreSQL Hackers <pgsql-hackers(at)lists(dot)postgresql(dot)org>, tomas(at)vondra(dot)me
Subject: Re: parallel data loading for pgbench -i
Date: 2026-01-29 11:19:05
Message-ID: 4c1d0b97-a5f8-472c-afdd-bdeb09b93f33@gmail.com
Views: Whole Thread | Raw Message | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

Hi Lakshmi,

On 19/01/2026 09:25, lakshmi wrote:
>
> Hi Mircea,
>
> I tested the patch on 19devel and it worked well for me.
> Before applying it, |-j| is rejected in pgbench initialization mode as
> expected. After applying the patch, |pgbench -i -s 100 -j 10| runs
> successfully and shows a clear speedup.
> On my system the total runtime dropped to about 9.6s, with client-side
> data generation around 3.3s.
> I also checked correctness after the run — row counts for
> pgbench_accounts, pgbench_branches, and pgbench_tellers all match the
> expected values.
>
> Thanks for working on this, the improvement is very noticeable.
>
> Best regards,
> lakshmi
>
Thanks for having a look and trying it out!

FYI this is one of Tomas Vondra's patch ideas from his blog [1].

I have attached a new version which now includes docs, tests, a proposed
commit message, and an attempt to fix the current CI failures (Windows).

[1] - https://vondra.me/posts/patch-idea-parallel-pgbench-i

--
Thanks,
Mircea Cadariu

Attachment Content-Type Size
v1-0001-Add-parallel-data-loading-support-to-pgbench.patch text/plain 22.7 KB

In response to

Browse pgsql-hackers by date

  From Date Subject
Next Message John Naylor 2026-01-29 11:25:50 Re: Undefined behavior detected by new clang's ubsan
Previous Message shveta malik 2026-01-29 11:16:10 Re: Proposal: Conflict log history table for Logical Replication