Quick Links

Re: pg_upgrade parallelism

From:	Jacob Champion <pchampion(at)vmware(dot)com>
To:	"jcasanov(at)systemguards(dot)com(dot)ec" <jcasanov(at)systemguards(dot)com(dot)ec>, "pgsql-hackers(at)postgresql(dot)org" <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: pg_upgrade parallelism
Date:	2021-11-17 20:04:41
Message-ID:	c026099a52d0c5afaf0074a5a301f296a4b09047.camel@vmware.com
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

On Wed, 2021-11-17 at 14:44 -0500, Jaime Casanova wrote:
> I'm trying to add more parallelism by copying individual segments
> of a relfilenode in different processes. Does anyone one see a big
> problem in trying to do that? I'm asking because no one did it before,
> that could not be a good sign.

I looked into speeding this up a while back, too. For the use case I
was looking at -- Greenplum, which has huge numbers of relfilenodes --
spinning disk I/O was absolutely the bottleneck and that is typically
not easily parallelizable. (In fact I felt at the time that Andres'
work on async I/O might be a better way forward, at least for some
filesystems.)

But you mentioned that you were seeing disks that weren't saturated, so
maybe some CPU optimization is still valuable? I am a little skeptical
that more parallelism is the way to do that, but numbers trump my
skepticism.

> - why we read()/write() at all? is not a faster way of copying the file?
> i'm asking that because i don't actually know.

I have idly wondered if something based on splice() would be faster,
but I haven't actually tried it.

But there is now support for copy-on-write with the clone mode, isn't
there? Or are you not able to take advantage of it?

--Jacob

In response to

pg_upgrade parallelism at 2021-11-17 19:44:52 from Jaime Casanova

Responses

Re: pg_upgrade parallelism at 2022-01-12 04:51:07 from Jaime Casanova

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Andrew Dunstan	2021-11-17 20:24:09	Re: Granting SET and ALTER SYSTE privileges for GUCs
Previous Message	Jaime Casanova	2021-11-17 19:44:52	pg_upgrade parallelism