Re: Fix primary crash continually with invalid checkpoint after promote

From: Nathan Bossart <nathandbossart(at)gmail(dot)com>
To: Zhao Rui <875941708(at)qq(dot)com>
Cc: PostgreSQL-development <pgsql-hackers(at)postgresql(dot)org>, pgsql-bugs <pgsql-bugs(at)lists(dot)postgresql(dot)org>
Subject: Re: Fix primary crash continually with invalid checkpoint after promote
Date: 2022-04-26 18:16:29
Message-ID: 20220426181629.GA3002456@nathanxps13
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-bugs pgsql-hackers

On Tue, Apr 26, 2022 at 03:16:13PM +0800, Zhao Rui wrote:
> In function CreateRestartPoint, control file is updated and old wals are removed. But in some situations, control file is not updated, old wals are still removed. Thus produces an invalid checkpoint with nonexistent wal. Crucial log: "invalid primary checkpoint record", "could not locate a valid checkpoint record".

I think this is the same issue tracked here: [0].

[0] https://postgr.es/m/20220316.102444.2193181487576617583.horikyota.ntt%40gmail.com

--
Nathan Bossart
Amazon Web Services: https://aws.amazon.com

In response to

Browse pgsql-bugs by date

  From Date Subject
Next Message Tom Lane 2022-04-26 19:47:13 Re: Fix primary crash continually with invalid checkpoint after promote
Previous Message Tom Lane 2022-04-26 14:11:10 Re: R: 14.1 immutable function, bad performance if check number = 'NaN'

Browse pgsql-hackers by date

  From Date Subject
Next Message Nathan Bossart 2022-04-26 18:33:49 Re: Possible corruption by CreateRestartPoint at promotion
Previous Message David Christensen 2022-04-26 18:15:05 Re: [PATCH] Teach pg_waldump to extract FPIs from the WAL