Re: POC: make mxidoff 64 bits

From: Heikki Linnakangas <hlinnaka(at)iki(dot)fi>
To: Maxim Orlov <orlovmg(at)gmail(dot)com>, wenhui qiu <qiuwenhuifx(at)gmail(dot)com>
Cc: Alexander Korotkov <aekorotkov(at)gmail(dot)com>, Postgres hackers <pgsql-hackers(at)lists(dot)postgresql(dot)org>
Subject: Re: POC: make mxidoff 64 bits
Date: 2025-11-13 16:04:48
Message-ID: 54aa8f65-f0e4-4464-b543-e0399c1cab1e@iki.fi
Views: Whole Thread | Raw Message | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

I realized that this issue was still outstanding:

On 01/04/2025 21:25, Heikki Linnakangas wrote:
> Thanks! I did some manual testing of this. I created a little helper
> function to consume multixids, to test the autovacuum behavior, and
> found one issue:
>
> If you consume a lot of multixid members space, by creating lots of
> multixids with huge number of members in each, you can end up with a
> very bloated members SLRU, and autovacuum is in no hurry to clean it up.
> Here's what I did:
>
> 1. Installed attached test module
> 2. Ran "select consume_multixids(10000, 100000);" many times
> 3. ran:
>
> $ du -h data/pg_multixact/members/
> 26G    data/pg_multixact/members/
>
> When I run "vacuum freeze; select * from pg_database;", I can see that
> 'datminmxid' for the current database is advanced. However, autovacuum
> is in no hurry to vacuum 'template0' and 'template1', so pg_multixact/
> members/ does not get truncated. Eventually, when
> autovacuum_multixact_freeze_max_age is reached, it presumably will, but
> you will run out of disk space before that.
>
> There is this check for members size at the end of SetOffsetVacuumLimit():
>
>>
>>     /*
>>      * Do we need autovacuum?    If we're not sure, assume yes.
>>      */
>>     return !oldestOffsetKnown ||
>>         (nextOffset - oldestOffset > MULTIXACT_MEMBER_AUTOVAC_THRESHOLD);
>
> And the caller (SetMultiXactIdLimit()) will in fact signal the
> autovacuum launcher after "vacuum freeze" because of that. But
> autovacuum launcher will look at the datminmxid / relminmxid values, see
> that they are well within autovacuum_multixact_freeze_max_age, and do
> nothing.
>
> This is a very extreme case, but clearly the code to signal autovacuum
> launcher, and the freeze age cutoff that autovacuum then uses, are not
> in sync.
>
> This patch removed MultiXactMemberFreezeThreshold(), per my suggestion,
> but we threw this baby with the bathwater. We discussed that in this
> thread, but didn't come up with any solution. But ISTM we still need
> something like MultiXactMemberFreezeThreshold() to trigger autovacuum
> freezing if the members have grown too large.

Here's a new patch version that addresses the above issue. I resurrected
MultiXactMemberFreezeThreshold(), using the same logic as before, just
using pretty arbitrary thresholds of 1 and 2 billion offsets instead of
the safe/danger thresholds derived from MaxMultiOffset. That gives
roughly the same behavior wrt. calculating effective freeze age as before.

Another change is that I removed the offset-based emergency vacuum
triggering. With 64-bit offsets, we never need to shut down the system
to prevent offset wraparound, so even if the offsets SLRU grows large,
it's not an "emergency" the same way that wraparound is. Consuming lots
of disk space could be a problem, of course, but we can let autovacuum
deal with that at the normal pace, like it deals with bloated tables.

The heuristics could surely be made better and/or more configurable, but
I think this good enough for now.

I included these changes as a separate patch for review purposes, but it
ought to be squashed with the main patch before committing.

- Heikki

Attachment Content-Type Size
v25-0001-Move-pg_multixact-SLRU-page-format-definitions-t.patch text/x-patch 10.3 KB
v25-0002-Use-64-bit-multixact-offsets.patch text/x-patch 37.9 KB
v25-0003-Add-pg_upgrade-for-64-bit-multixact-offsets.patch text/x-patch 30.9 KB
v25-0004-Remove-oldestOffset-oldestOffsetKnown-from-multi.patch text/x-patch 6.1 KB
v25-0005-Reintroduce-MultiXactMemberFreezeThreshold.patch text/x-patch 17.0 KB
v25-0006-TEST-bump-catversion.patch text/x-patch 798 bytes
v25-0007-TEST-Add-test-for-64-bit-mxoff-in-pg_resetwal.patch text/x-patch 4.9 KB
v25-0008-TEST-Add-test-for-wraparound-of-next-new-multi-i.patch text/x-patch 5.2 KB
v25-0009-TEST-Add-test-for-64-bit-mxoff-in-pg_upgrade.patch text/x-patch 12.0 KB
v25-0010-TEST-add-consume_multixids-function.patch text/x-patch 5.0 KB

In response to

Browse pgsql-hackers by date

  From Date Subject
Next Message Jacob Champion 2025-11-13 16:23:43 Re: Few untranslated error messages in OAuth
Previous Message Daniel Gustafsson 2025-11-13 15:53:14 Re: pg_getaddrinfo_all() with hintp=NULL