| From: | Xuneng Zhou <xunengzhou(at)gmail(dot)com> |
|---|---|
| To: | pgsql-hackers <pgsql-hackers(at)lists(dot)postgresql(dot)org> |
| Cc: | Noah Misch <noah(at)leadboat(dot)com>, Michael Paquier <michael(at)paquier(dot)xyz> |
| Subject: | Add WALRCV_CONNECTING state to walreceiver |
| Date: | 2025-12-12 04:51:00 |
| Message-ID: | CABPTF7X_Jgmyk1FBVNf3tyAOKqU55LLpLMzWkGtEAb_jQWVN=g@mail.gmail.com |
| Views: | Whole Thread | Raw Message | Download mbox | Resend email |
| Thread: | |
| Lists: | pgsql-hackers |
Hi Hackers,
Bug #19093 [1] reported that pg_stat_wal_receiver.status = 'streaming'
does not accurately reflect streaming health. In that discussion,
Noah noted that even before the reported regression, status =
'streaming' was unreliable because walreceiver sets it during early
startup, before attempting a connection. He suggested:
"Long-term, in master only, perhaps we should introduce another status
like 'connecting'. Perhaps enact the connecting->streaming status
transition just before tendering the first byte of streamed WAL to the
startup process. Alternatively, enact that transition when the startup
process accepts the
first streamed byte."
Michael and I also thought this could be a useful addition. This patch
implements that suggestion by adding a new WALRCV_CONNECTING state.
== Background ==
Currently, walreceiver transitions directly from STARTING to STREAMING
early in WalReceiverMain(), before any WAL data has been received.
This means status = 'streaming' can be observed even when:
- The connection to the primary has not been established
- No WAL data has actually been received or flushed
This makes it difficult for monitoring tools to distinguish between a
healthy streaming replica and one that is merely attempting to stream.
== Proposal ==
Introduce WALRCV_CONNECTING as an intermediate state between STARTING
and STREAMING:
- When walreceiver starts, it enters CONNECTING (instead of going
directly to STREAMING).
- The transition to STREAMING occurs in XLogWalRcvFlush(), inside the
existing spinlock-protected block that updates flushedUpto.
Feedbacks welcome.
[1] https://www.postgresql.org/message-id/flat/19093-c4fff49a608f82a0%40postgresql.org
--
Best,
Xuneng
| Attachment | Content-Type | Size |
|---|---|---|
| 0001-Add-WALRCV_CONNECTING-state-to-walreceiver.patch | application/x-patch | 4.5 KB |
| From | Date | Subject | |
|---|---|---|---|
| Next Message | Dilip Kumar | 2025-12-12 04:58:52 | Re: Proposal: Conflict log history table for Logical Replication |
| Previous Message | shveta malik | 2025-12-12 04:32:18 | Re: Proposal: Conflict log history table for Logical Replication |