Quick Links

Re: parallelizing the archiver

From:	Julien Rouhaud <rjuju123(at)gmail(dot)com>
To:	"Bossart, Nathan" <bossartn(at)amazon(dot)com>
Cc:	"pgsql-hackers(at)postgresql(dot)org" <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: parallelizing the archiver
Date:	2021-09-10 05:08:21
Message-ID:	CAOBaU_bCAqkh3tJhLVvjGN_1oaMQw4MN9BWU3pB3KmSPwm3FaQ@mail.gmail.com
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

On Fri, Sep 10, 2021 at 6:30 AM Bossart, Nathan <bossartn(at)amazon(dot)com> wrote:
>
> Thanks for chiming in. I'm planning to work on a patch next week.

Great news!

About the technical concerns:

> I'm currently thinking of
> something a bit like autovacuum_max_workers, but the archive workers
> would be created once and would follow a competing consumers model.

In this approach, the launched archiver workers would be kept as long
as the instance is up, or should they be stopped if they're not
required anymore, e.g. if there was a temporary write activity spike?
I think we should make sure that at least one worker is always up.

> Another approach I'm looking at is to use background worker processes,
> although I'm not sure if linking such a critical piece of
> functionality to max_worker_processes is a good idea. However, I do
> see that logical replication uses background workers.

I think that using background workers is a good approach, and the
various guc in that area should allow users to properly configure
archiving too. If that's not the case, it might be an opportunity to
add some new infrastructure that could benefit all bgworkers users.

In response to

Re: parallelizing the archiver at 2021-09-09 22:30:54 from Bossart, Nathan

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Amit Kapila	2021-09-10 05:10:22	Re: Added schema level support for publication.
Previous Message	Thomas Munro	2021-09-10 05:04:09	Re: stat() vs ERROR_DELETE_PENDING, round N + 1