From: | shveta malik <shveta(dot)malik(at)gmail(dot)com> |
---|---|
To: | vignesh C <vignesh21(at)gmail(dot)com> |
Cc: | Shlok Kyal <shlok(dot)kyal(dot)oss(at)gmail(dot)com>, Chao Li <li(dot)evan(dot)chao(at)gmail(dot)com>, Masahiko Sawada <sawada(dot)mshk(at)gmail(dot)com>, "Hayato Kuroda (Fujitsu)" <kuroda(dot)hayato(at)fujitsu(dot)com>, Nisha Moond <nisha(dot)moond412(at)gmail(dot)com>, Dilip Kumar <dilipbalaut(at)gmail(dot)com>, Amit Kapila <amit(dot)kapila16(at)gmail(dot)com>, Peter Smith <smithpb2250(at)gmail(dot)com>, Peter Eisentraut <peter(at)eisentraut(dot)org>, PostgreSQL Hackers <pgsql-hackers(at)lists(dot)postgresql(dot)org>, Euler Taveira <euler(at)eulerto(dot)com>, Michael Paquier <michael(at)paquier(dot)xyz>, "Zhijie Hou (Fujitsu)" <houzj(dot)fnst(at)fujitsu(dot)com>, "Jonathan S(dot) Katz" <jkatz(at)postgresql(dot)org>, shveta malik <shveta(dot)malik(at)gmail(dot)com> |
Subject: | Re: Logical Replication of sequences |
Date: | 2025-09-25 06:53:40 |
Message-ID: | CAJpy0uDTDzH6Cj_Fxq2O3Pn9gRq-3LgDJcZwEiZqXVkA8bkZ-w@mail.gmail.com |
Views: | Whole Thread | Raw Message | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-hackers |
On Tue, Sep 23, 2025 at 6:39 PM vignesh C <vignesh21(at)gmail(dot)com> wrote:
>
> On Mon, 22 Sept 2025 at 12:03, shveta malik <shveta(dot)malik(at)gmail(dot)com> wrote:
> >
> > On Thu, Sep 18, 2025 at 4:07 PM vignesh C <vignesh21(at)gmail(dot)com> wrote:
> > >
> > > Thanks for the comments, these are handled in the attached patch.
> > >
> >
> > Please find a few comments:
> >
> >
> > patch005:
> > 1)
> > GetSubscriptionRelations:
> > + /* Skip sequences if they were not requested */
> > + if (!get_sequences && (relkind == RELKIND_SEQUENCE))
> > + continue;
> > +
> > + /* Skip tables if they were not requested */
> > + if (!get_tables && (relkind != RELKIND_SEQUENCE))
> > + continue;
> >
> > The use of negated conditions makes the logic harder to follow,
> > especially in the second if block.
> >
> > Can we write it as:
> > bool is_sequence = (relkind == RELKIND_SEQUENCE);
> >
> > /* Skip if the relation type is not requested */
> > if ((get_tables && is_sequence) ||
> > (get_sequences && !is_sequence))
> > continue;
> >
> > Or at-least:
> > /* Skip sequences if they were not requested */
> > if (get_tables && (relkind == RELKIND_SEQUENCE))
> > continue;
> >
> > /* Skip tables if they were not requested */
> > if (get_sequences && (relkind != RELKIND_SEQUENCE))
> > continue;
>
> I felt this would not work. Say we want both sequences and tables,
> won't it skip the sequence this way from:
> if (get_tables && (relkind == RELKIND_SEQUENCE))
> continue;
>
Okay, I see your point. In that case, could we reverse the conditions
instead? That seems like the more obvious choice in terms of
readability :
if ((relkind == RELKIND_SEQUENCE && !get_sequences))
continue;
if ((relkind != RELKIND_SEQUENCE && !get_tables))
continue;
For second if-block, more understandable conditions will be:
(relkind == RELKIND_RELATION || relkind == RELKIND_PARTITIONED_TABLE)
&& !get_tables
But here we will have to check relkind twice, so I leave the decision to you.
~~
Few comments on latest patch:
LogicalRepSyncSequences():
1)
+ seq_entry = hash_search(sequences_to_copy, &key,
+ HASH_ENTER, &found);
+ Assert(seq_entry != NULL);
Since we are using HASH_ENTER, it will be good to add Assert(!found) as well.
2)
+ sequences_to_copy = hash_create("Logical replication sequence sync
worker sequences",
+ 256, &ctl, HASH_ELEM | HASH_FUNCTION | HASH_COMPARE | HASH_CONTEXT);
+
The name of the hash-table looks odd. Shall it simply be 'Logical
replication sequences'. 'Logical Replication' is good enough to give
an indication that these are sequences being synchronized by seq-sync
worker.
~~
copy_sequences():
3)
+ if (batch_size >= MAX_SEQUENCES_SYNC_PER_BATCH ||
+ (current_index + batch_size == total_seqs))
+ break;
In the first condition, I think a better comparison will be equality
one (batch_size == MAX_SEQUENCES_SYNC_PER_BATCH). We are not letting
batch_size go beyond MAX_SEQUENCES_SYNC_PER_BATCH.
4)
+ if (batch_size == 0)
+ {
+ CommitTransactionCommand();
+ break;
+ }
I could not think of a scenario when this will be hit. The outer loop
condition has 'while (current_index < total_seqs)', so batch_size has
to be something. Even if we skip some entries due to
remote_seq_queried being set to true already, that should not make
batch_size as 0 as the entries with remote_seq_queried=true are
already accounted for in current_index. Or am I missing something
here?
~~
sequencesync_list_invalidate_cb():
5)
+ /* invalidate all entries */
+ hash_seq_init(&status, sequences_to_copy);
+ while ((entry = (LogicalRepSequenceInfo *) hash_seq_search(&status)) != NULL)
+ entry->entry_valid = false;
Can you please elaborate when this case can be hit? I see such logic
in all such invalidation functions registered with
CacheRegisterRelcacheCallback(), but could not find any relevant
comment.
~~
Reviewing further.
thanks
Shveta
From | Date | Subject | |
---|---|---|---|
Next Message | Shubham Khanna | 2025-09-25 07:07:24 | Re: Add support for specifying tables in pg_createsubscriber. |
Previous Message | Michael Paquier | 2025-09-25 06:45:21 | Re: Orphan page in _bt_split |