| From: | Masahiko Sawada <sawada(dot)mshk(at)gmail(dot)com> |
|---|---|
| To: | Amit Kapila <amit(dot)kapila16(at)gmail(dot)com> |
| Cc: | Vitaly Davydov <v(dot)davydov(at)postgrespro(dot)ru>, Ashutosh Bapat <ashutosh(dot)bapat(dot)oss(at)gmail(dot)com>, PostgreSQL Hackers <pgsql-hackers(at)lists(dot)postgresql(dot)org> |
| Subject: | Re: Support logical replication of DDLs, take2 |
| Date: | 2026-04-30 20:40:47 |
| Message-ID: | CAD21AoC_HwTLRSWmD0d=E0Adftu0LE=Ep9FWJf2m1BJyq7x4Gw@mail.gmail.com |
| Views: | Whole Thread | Raw Message | Download mbox | Resend email |
| Thread: | |
| Lists: | pgsql-general pgsql-hackers |
On Wed, Apr 29, 2026 at 9:44 PM Amit Kapila <amit(dot)kapila16(at)gmail(dot)com> wrote:
>
> On Wed, Apr 29, 2026 at 3:19 AM Masahiko Sawada <sawada(dot)mshk(at)gmail(dot)com> wrote:
> >
> > On Mon, Apr 27, 2026 at 11:32 PM Amit Kapila <amit(dot)kapila16(at)gmail(dot)com> wrote:
> > >
> > > Yes, there will be a maintenance cost of JSON-based deparsing
> > > approach. But note that multiple senior people (Alvaro Herrera, Robert
> > > Haas) [1] seems to favor that approach. So, I am not sure we can
> > > conclude to abandon that approach without those people or some other
> > > senior people agreeing to abandon it. To be clear, I am not against
> > > considering a new/different approach for DDL replication but just that
> > > it is not clear that old/existing approach can be ruled out without
> > > more discussion on it,
> >
> > Thank you for pointing it out. Just to be clear, IIUC what they liked
> > was to use JSON string representation of DDLs, but not JSON string
> > representation of DDLs that are deparsed from parse nodes, no?
> >
>
> As per my understanding, we built deparsing stuff with a goal of
> supporting DDL replication and Alvaro was the original author of that
> work, see [1]. The benefit it provides flexibility in terms of
> filtering by decoding plugin, if any, or changing the DDL (like
> schema-mapping) during apply. It is not clear to me if we can achive
> similar level of flexibility with other approach.
I think we can generate the same JSON-string representation of DDLs
from catalog information, it would also require a lot of code, though.
It would be independent from parse nodes and if we implement it as an
option for pg_get_xxx_ddl() functionality it would be able to be
reused by other tools too.
>
> >
> > >
> > > We would need to maintain the JSON serialization code whenever
> > > > creating or modifying parse nodes, regardless of whether the changes
> > > > were related to DDL replication. IIUC, this was the primary reason the
> > > > feature didn't cross the finish line.
> > > >
> > > > Additionally, I think there is another design issue: it is not
> > > > output-plugin agnostic. Since the deparsed DDL was written by a
> > > > logical-replication-specific event trigger, third-party logical
> > > > decoding plugins cannot easily detect DDL events.
> > > >
> > >
> > > Why RmgrId like RM_LOGICALDDLMSG_ID and XLOG_LOGICAL_DDL_MESSAGE wal
> > > info is not sufficient for this? Decoder will add a message like
> > > REORDER_BUFFER_CHANGE_DDL which can be used to detect DDL message, no?
> >
> > Right, but I'm not sure this is a good developer experience that
> > additional steps are required to capture DDL events for other plugins
> > while changes of INSERT/UPDATE/DELETE/TRUNCATE are passed from the
> > logical decoding by default.
> >
>
> Yes, there could probably be additional steps for plugins but they
> must be doing a few things already which are defined at publication
> level like column lists, row filtering, something related to RI, etc.
I think those publication-level features operate at a somewhat
different layer than the fundamental mechanism of capturing DDL
events. Plugins filter rows or columns based on configuration, but the
logical decoding itself guarantees that the DML events are reliably
passed to them. Given that the TRUNCATE in logical replication already
works so, I guess DDL should have the same fundamental guarantee.
it's unclear to me how plugins could reliably manage these event
triggers. While a plugin might create an event trigger during the
startup callback if it doesn't exist, it cannot drop it during the
shutdown callback. We also cannot establish a dependency between an
event trigger and a logical replication slot. We would likely need to
invent a new plugin callback specifically invoked at slot drop time
just to clean it up. Also, if different plugins want to capture DDL
events, they could end up registering different event triggers,
emitting multiple DDL WAL records for the same DDL event.
> To reduce the plugin work, one naive idea is to let the event triggers
> be registered at first logical slot creation or may be at init time of
> plugin.
I think we need to note that replication slot creation and drop are
non-transactional operations. We need to make sure that both logical
slots and the event trigger are not orphaned in error or server crash
cases.
Regards,
--
Masahiko Sawada
Amazon Web Services: https://aws.amazon.com
| From | Date | Subject | |
|---|---|---|---|
| Next Message | Masahiko Sawada | 2026-05-01 18:40:33 | Re: Support logical replication of DDLs, take2 |
| Previous Message | Laurenz Albe | 2026-04-30 19:35:43 | Re: Confirmation on concurrent SELECT FOR UPDATE with ON CONFLICT DO NOTHING |
| From | Date | Subject | |
|---|---|---|---|
| Next Message | Erik Rijkers | 2026-04-30 20:52:36 | warning under gcc 16.1.0 |
| Previous Message | Tristan Partin | 2026-04-30 19:38:57 | Re: Add pg_stat_kind_info system view |