Quick Links

Re: improve performance of pg_dump with many sequences

From:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To:	Nathan Bossart <nathandbossart(at)gmail(dot)com>
Cc:	Michael Paquier <michael(at)paquier(dot)xyz>, Euler Taveira <euler(at)eulerto(dot)com>, pgsql-hackers <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: improve performance of pg_dump with many sequences
Date:	2025-12-29 17:26:01
Message-ID:	2885944.1767029161@sss.pgh.pa.us
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

Nathan Bossart <nathandbossart(at)gmail(dot)com> writes:
> Committed.

In the no-good-deed-goes-unpunished department: pg_dump's use
of pg_get_sequence_data() (nee pg_sequence_read_tuple()) is
evidently responsible for the complaint in bug #19365 [1]
that pg_dump can no longer survive concurrent sequence drops.

Given that that function already silently returns NULLs if the
sequence isn't readable for other reasons, I think it'd be
sane to make it silently return NULL if the sequence isn't
there anymore. Unfortunately, that looks like it'd require
nontrivial restructuring of init_sequence().

Or maybe we could make it not use init_sequence()? For the moment
a plain try_relation_open and check that it's a sequence should do,
but I'm not sure how that'd fit into people's plans for future
improvement of the sequence API.

There are other reasons not to like use of init_sequence in this
code path, too. pg_dump's session will build a SeqTable entry for
every sequence in the database, which there could be a lot of,
and it will acquire RowExclusiveLock on every sequence and hold
that to the end of the dump, which seems likely to be troublesome
from a concurrency standpoint. Since pg_get_sequence_data is a
read-only operation this lock level feels wrong.

BTW, I'm unconvinced that pg_dump behaves sanely when this function
does return nulls. I think the ideal thing would be for it to skip
issuing setval(), but right now it looks like it will issue one with
garbage values.

regards, tom lane

[1] https://www.postgresql.org/message-id/19365-6245240d8b926327%40postgresql.org

In response to

Re: improve performance of pg_dump with many sequences at 2024-07-31 15:19:01 from Nathan Bossart

Responses

Re: improve performance of pg_dump with many sequences at 2026-01-07 23:06:22 from Nathan Bossart

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Tom Lane	2025-12-29 18:08:39	Re: Fixing some ancient errors in hash join costing
Previous Message	Manni Wood	2025-12-29 17:03:17	Re: Speed up COPY FROM text/CSV parsing using SIMD