Re: Conflict detection for update_deleted in logical replication

From: shveta malik <shveta(dot)malik(at)gmail(dot)com>
To: "Zhijie Hou (Fujitsu)" <houzj(dot)fnst(at)fujitsu(dot)com>
Cc: Amit Kapila <amit(dot)kapila16(at)gmail(dot)com>, vignesh C <vignesh21(at)gmail(dot)com>, Masahiko Sawada <sawada(dot)mshk(at)gmail(dot)com>, Nisha Moond <nisha(dot)moond412(at)gmail(dot)com>, "Hayato Kuroda (Fujitsu)" <kuroda(dot)hayato(at)fujitsu(dot)com>, pgsql-hackers <pgsql-hackers(at)postgresql(dot)org>, shveta malik <shveta(dot)malik(at)gmail(dot)com>
Subject: Re: Conflict detection for update_deleted in logical replication
Date: 2025-06-19 11:04:38
Message-ID: CAJpy0uCK2L=w7GrvsE=RN4kMcJEE9tWMO1FSuQ+JMxYp66at2Q@mail.gmail.com
Views: Whole Thread | Raw Message | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

>
> Here is the V38 patch set which includes the following changes:
>

Thank You for the patches. Few comments:

1)
+ <para>
+ Note that commit timestamps and origin data retained by enabling the
+ <link linkend="sql-createsubscription-params-with-retain-conflict-info"><literal>retain_conflict_info</literal></link>
+ option will not be preserved during the upgrade. As a
+ result, the upgraded subscriber might be unable to detect conflicts or log
+ relevant commit timestamps and origins when applying changes from the
+ publisher occurring during the upgrade.
+ </para>

This statement is true even for changes pending from 'before' the
upgrade. So we shall change last line where we mention 'during the
upgrade'

2)
+ <para>
+ Note that the information for conflict detection cannot be purged if
+ the subscription is disabled; thus, the information will accumulate
+ until the subscription is enabled. To prevent excessive accumulation,
+ it is recommended to disable <literal>retain_conflict_info</literal>
+ if the subscription will be inactive for an extended period.
+ </para>

I think this can be put in WARNING or CAUTION tags as this is
something which if neglected can result in system bloat.

3)
postgres=# create subscription sub3 connection 'dbname=postgres
host=localhost user=shveta port=5433' publication pub2 WITH (failover
= true, retain_conflict_info = true);
WARNING: commit timestamp and origin data required for detecting
conflicts won't be retained
HINT: Consider setting "track_commit_timestamp" to true.
ERROR: subscription "sub3" already exists

In CreateSubscription(), we shall move CheckSubConflictInfoRetention()
after sub-duplicity check. Above WARNING with the existing-sub ERROR
looks odd.

4)
In check_new_cluster_replication_slots(), earlier we were not doing
any checks for 'count of logical slots on new cluster' if there were
no logical slots on old cluster (i.e. nslots_on_old is 0). Now we are
doing a 'nslots_on_new' related check even when 'nslots_on_old' is 0
for the case when RCI is enabled. Shouldn't we skip 'nslots_on_new'
check when 'nslots_on_old' is 0?

5)
We refer to 'update_deleted' in patch1's comment when the conflict is
not yet created. Is it okay?

thanks
Shveta

In response to

Responses

Browse pgsql-hackers by date

  From Date Subject
Next Message Hayato Kuroda (Fujitsu) 2025-06-19 11:05:49 RE: Slot's restart_lsn may point to removed WAL segment after hard restart unexpectedly
Previous Message Aleksander Alekseev 2025-06-19 10:39:48 Re: [PATCH] Split varlena.c into varlena.c and bytea.c