Re: Logical Replication of sequences

From: Amit Kapila <amit(dot)kapila16(at)gmail(dot)com>
To: Dilip Kumar <dilipbalaut(at)gmail(dot)com>
Cc: vignesh C <vignesh21(at)gmail(dot)com>, shveta malik <shveta(dot)malik(at)gmail(dot)com>, Shlok Kyal <shlok(dot)kyal(dot)oss(at)gmail(dot)com>, Chao Li <li(dot)evan(dot)chao(at)gmail(dot)com>, Masahiko Sawada <sawada(dot)mshk(at)gmail(dot)com>, "Hayato Kuroda (Fujitsu)" <kuroda(dot)hayato(at)fujitsu(dot)com>, Nisha Moond <nisha(dot)moond412(at)gmail(dot)com>, Peter Smith <smithpb2250(at)gmail(dot)com>, Peter Eisentraut <peter(at)eisentraut(dot)org>, PostgreSQL Hackers <pgsql-hackers(at)lists(dot)postgresql(dot)org>, Euler Taveira <euler(at)eulerto(dot)com>, Michael Paquier <michael(at)paquier(dot)xyz>, "Zhijie Hou (Fujitsu)" <houzj(dot)fnst(at)fujitsu(dot)com>, "Jonathan S(dot) Katz" <jkatz(at)postgresql(dot)org>
Subject: Re: Logical Replication of sequences
Date: 2025-10-07 12:16:04
Message-ID: CAA4eK1JvXHGDVS3BB6Og0CC67G-QYKO2xkmDW4pM3+YE_ty1fA@mail.gmail.com
Views: Whole Thread | Raw Message | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

On Tue, Oct 7, 2025 at 4:56 PM Dilip Kumar <dilipbalaut(at)gmail(dot)com> wrote:
>
> On Tue, Oct 7, 2025 at 3:51 PM Amit Kapila <amit(dot)kapila16(at)gmail(dot)com> wrote:
> >
> > On Tue, Oct 7, 2025 at 2:21 PM Dilip Kumar <dilipbalaut(at)gmail(dot)com> wrote:
> > >
> > > On Mon, Oct 6, 2025 at 4:33 PM vignesh C <vignesh21(at)gmail(dot)com> wrote:
> > > >
> > > > On Mon, 6 Oct 2025 at 12:07, vignesh C <vignesh21(at)gmail(dot)com> wrote:
> > > > >
> > > > > On Sat, 4 Oct 2025 at 21:24, Amit Kapila <amit(dot)kapila16(at)gmail(dot)com> wrote:
> > > > > >
> > > > > > On Tue, Sep 30, 2025 at 9:55 PM vignesh C <vignesh21(at)gmail(dot)com> wrote:
> > > > > > >
> > > > > >
> > > > > > In the 0001 patch, pg_get_sequence_data() exposes two new fields
> > > > > > log_cnt and page_lsn. I see that the later subscriber-side patch uses
> > > > > > both, the first one in SetSequence(). It is not clear from the
> > > > > > comments or the commit message of 0001 why it is necessary to use
> > > > > > log_cnt when setting the sequence. Can you explain what the problem
> > > > > > will be if we don't use log_cnt during sequence sync?
> > > > >
> > > > > I thought to keep the log_cnt value the same value as the publisher.
> > > > > I have verified from the upgrade that we don't retain the log_cnt
> > > > > value after upgrade, even if we copy log_cnt, the value will not be
> > > > > retained. The attached
> > > > > v20251006-0001-Enhance-pg_get_sequence_data-function.patch has the
> > > > > changes to remove log_cnt.
> > > >
> > > > Here is the rebased remaining patches.
> > >
> > > While testing the patches with different combinations to make
> > > publications, I do not understand why we don't support ALL SEQUENCE
> > > with some table option, or is it future pending work.
> > >
> >
> > Yes, it is left for future similar to the cases like FOR SEQUENCE s1
> > or FOR SEQUENCES IN SCHEMA. The key idea was to first support the
> > cases required for upgrade and we can later extend the feature after
> > some user feedback or separate discussion with -hackers to see what
> > others think. Does that sound reasonable to you?
>
> Yeah that's correct, I think the main use case for sequence
> synchronization is upgrade and it makes sense to use ALL TABLES/ALL
> SEQUENCES for upgrade. However, if a user is using selective tables
> for upgrade for now they might not be able to use ALL SEQUENCE and
> that should be fine as we are going to provide add on functionality.
>

Yes, we can add the additional functionality for selective sequences
if required but do we have an option to allow upgrade of selective
tables?

> I have one more question: while testing the sequence sync, I found
> this behavior is documented as well[1], but what's the reasoning
> behind it? Why REFRESH PUBLICATION will synchronize only newly added
> sequences and need to use REFRESH PUBLICATION SEQUENCES to
> re-synchronize all sequences.
>

The idea is that REFRESH PUBLICATION should behave similarly for
tables and sequences. This means that this command is primarily used
to add/remove tables/sequences and copy their respective initial
contents. The new command REFRESH PUBLICATION SEQUENCES is to sync the
existing sequences, it shouldn't add any new sequences, now, if it is
too confusing we can discuss having a different syntax for it. In the
meantime, let's make the 0001 publication-side patch ready for commit.

--
With Regards,
Amit Kapila.

In response to

Browse pgsql-hackers by date

  From Date Subject
Next Message Matheus Alcantara 2025-10-07 12:40:51 Re: Optimize LISTEN/NOTIFY
Previous Message Tomas Vondra 2025-10-07 12:08:27 Re: Should we update the random_page_cost default value?