Re: Bug in MultiXact replay compat logic for older minor version after crash-recovery

From: Heikki Linnakangas <hlinnaka(at)iki(dot)fi>
To: Andrey Borodin <x4mmm(at)yandex-team(dot)ru>
Cc: 段坤仁(刻韧) <duankunren(dot)dkr(at)alibaba-inc(dot)com>, pgsql-hackers <pgsql-hackers(at)postgresql(dot)org>
Subject: Re: Bug in MultiXact replay compat logic for older minor version after crash-recovery
Date: 2026-03-20 15:14:04
Message-ID: 5b7f0a04-4a60-44bb-9d2c-8917af0b10fa@iki.fi
Views: Whole Thread | Raw Message | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

On 20/03/2026 15:39, Andrey Borodin wrote:
>> On 20 Mar 2026, at 16:19, Heikki Linnakangas <hlinnaka(at)iki(dot)fi> wrote:
>>
>> Hmm, after startup, before we have zeroed any pages, it still works though. So I think my patch works, but it means that tracking the latest page we have zeroed is not merely an optimization to avoid excessive SimpleLruDoesPhysicalPageExist() calls, it's needed for correctness. Need to adjust the comments for that.
>
> If we are sure buffers have no this page we can detect it via FS.
> Otherwise... nothing bad can happen, actually. We might get false positive and zero the page once more.

Zeroing the page again is dangerous because the CREATE_ID records can be
out of order. The page might already contain some later multixids, and
zeroing will overwrite them.

> If we got init_needed==false, maybe cache it for this page and set last_initialized_offsets_page = pageno?
> Or, perhaps, XLOG_MULTIXACT_ZERO_OFF_PAGE will do it for us anyway, but a bit later.

My patch does set last_initialized_offsets_page = pageno, if it
initializes the page, so yeah I think we're good there.

- Heikki

In response to

Responses

Browse pgsql-hackers by date

  From Date Subject
Next Message Peter Eisentraut 2026-03-20 15:23:50 Re: SQL Property Graph Queries (SQL/PGQ)
Previous Message Nitin Jadhav 2026-03-20 14:26:46 Re: Change checkpoint‑record‑missing PANIC to FATAL