Re: BF mamba failure

From: Kouber Saparev <kouber(at)gmail(dot)com>
To: Michael Paquier <michael(at)paquier(dot)xyz>
Cc: Bertrand Drouvot <bertranddrouvot(dot)pg(at)gmail(dot)com>, pgsql-hackers(at)lists(dot)postgresql(dot)org
Subject: Re: BF mamba failure
Date: 2025-09-11 13:35:01
Message-ID: CAN4RuQv3OTJ1Owqn2U+7ZbvL-N-nZYAUjF_NG8o12ayFN-=0GQ@mail.gmail.com
Views: Whole Thread | Raw Message | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

На чт, 11.09.2025 г. в 2:28 Michael Paquier <michael(at)paquier(dot)xyz> написа:

> As a start, are these failures only in the startup process? Has the
> startup process reached a consistent state when the problem happens
> because the replay code is too eager at removing the stats entries?
> Has it not reached a consistent state. These could be useful hints to
> extract a reproducible test case, looking for common patterns.
>

The pattern is the same, although I am not 100% sure that the same replica
is having this - it is a cascaded streaming replica, i.e. a replica of
another replica. Once we had this in October 2024, with version 15.4, then
in August 2025 with 17.3, and now in September again (17.3). The database
is working for month(s) perfectly fine in a heavy production workload (lots
of WALs etc.), and then all of a sudden it shuts down.

Thanks for the feedback, and let me know if I could provide any additional
info.

--
Kouber

In response to

Browse pgsql-hackers by date

  From Date Subject
Next Message Ashutosh Sharma 2025-09-11 13:59:44 Re: Clear logical slot's 'synced' flag on promotion of standby
Previous Message Jelte Fennema-Nio 2025-09-11 13:29:27 Re: Extension security improvement: Add support for extensions with an owned schema