| From: | Heikki Linnakangas <hlinnaka(at)iki(dot)fi> |
|---|---|
| To: | Maxim Orlov <orlovmg(at)gmail(dot)com>, wenhui qiu <qiuwenhuifx(at)gmail(dot)com> |
| Cc: | Alexander Korotkov <aekorotkov(at)gmail(dot)com>, Postgres hackers <pgsql-hackers(at)lists(dot)postgresql(dot)org> |
| Subject: | Re: POC: make mxidoff 64 bits |
| Date: | 2025-11-13 16:04:48 |
| Message-ID: | 54aa8f65-f0e4-4464-b543-e0399c1cab1e@iki.fi |
| Views: | Whole Thread | Raw Message | Download mbox | Resend email |
| Thread: | |
| Lists: | pgsql-hackers |
I realized that this issue was still outstanding:
On 01/04/2025 21:25, Heikki Linnakangas wrote:
> Thanks! I did some manual testing of this. I created a little helper
> function to consume multixids, to test the autovacuum behavior, and
> found one issue:
>
> If you consume a lot of multixid members space, by creating lots of
> multixids with huge number of members in each, you can end up with a
> very bloated members SLRU, and autovacuum is in no hurry to clean it up.
> Here's what I did:
>
> 1. Installed attached test module
> 2. Ran "select consume_multixids(10000, 100000);" many times
> 3. ran:
>
> $ du -h data/pg_multixact/members/
> 26G data/pg_multixact/members/
>
> When I run "vacuum freeze; select * from pg_database;", I can see that
> 'datminmxid' for the current database is advanced. However, autovacuum
> is in no hurry to vacuum 'template0' and 'template1', so pg_multixact/
> members/ does not get truncated. Eventually, when
> autovacuum_multixact_freeze_max_age is reached, it presumably will, but
> you will run out of disk space before that.
>
> There is this check for members size at the end of SetOffsetVacuumLimit():
>
>>
>> /*
>> * Do we need autovacuum? If we're not sure, assume yes.
>> */
>> return !oldestOffsetKnown ||
>> (nextOffset - oldestOffset > MULTIXACT_MEMBER_AUTOVAC_THRESHOLD);
>
> And the caller (SetMultiXactIdLimit()) will in fact signal the
> autovacuum launcher after "vacuum freeze" because of that. But
> autovacuum launcher will look at the datminmxid / relminmxid values, see
> that they are well within autovacuum_multixact_freeze_max_age, and do
> nothing.
>
> This is a very extreme case, but clearly the code to signal autovacuum
> launcher, and the freeze age cutoff that autovacuum then uses, are not
> in sync.
>
> This patch removed MultiXactMemberFreezeThreshold(), per my suggestion,
> but we threw this baby with the bathwater. We discussed that in this
> thread, but didn't come up with any solution. But ISTM we still need
> something like MultiXactMemberFreezeThreshold() to trigger autovacuum
> freezing if the members have grown too large.
Here's a new patch version that addresses the above issue. I resurrected
MultiXactMemberFreezeThreshold(), using the same logic as before, just
using pretty arbitrary thresholds of 1 and 2 billion offsets instead of
the safe/danger thresholds derived from MaxMultiOffset. That gives
roughly the same behavior wrt. calculating effective freeze age as before.
Another change is that I removed the offset-based emergency vacuum
triggering. With 64-bit offsets, we never need to shut down the system
to prevent offset wraparound, so even if the offsets SLRU grows large,
it's not an "emergency" the same way that wraparound is. Consuming lots
of disk space could be a problem, of course, but we can let autovacuum
deal with that at the normal pace, like it deals with bloated tables.
The heuristics could surely be made better and/or more configurable, but
I think this good enough for now.
I included these changes as a separate patch for review purposes, but it
ought to be squashed with the main patch before committing.
- Heikki
| Attachment | Content-Type | Size |
|---|---|---|
| v25-0001-Move-pg_multixact-SLRU-page-format-definitions-t.patch | text/x-patch | 10.3 KB |
| v25-0002-Use-64-bit-multixact-offsets.patch | text/x-patch | 37.9 KB |
| v25-0003-Add-pg_upgrade-for-64-bit-multixact-offsets.patch | text/x-patch | 30.9 KB |
| v25-0004-Remove-oldestOffset-oldestOffsetKnown-from-multi.patch | text/x-patch | 6.1 KB |
| v25-0005-Reintroduce-MultiXactMemberFreezeThreshold.patch | text/x-patch | 17.0 KB |
| v25-0006-TEST-bump-catversion.patch | text/x-patch | 798 bytes |
| v25-0007-TEST-Add-test-for-64-bit-mxoff-in-pg_resetwal.patch | text/x-patch | 4.9 KB |
| v25-0008-TEST-Add-test-for-wraparound-of-next-new-multi-i.patch | text/x-patch | 5.2 KB |
| v25-0009-TEST-Add-test-for-64-bit-mxoff-in-pg_upgrade.patch | text/x-patch | 12.0 KB |
| v25-0010-TEST-add-consume_multixids-function.patch | text/x-patch | 5.0 KB |
| From | Date | Subject | |
|---|---|---|---|
| Next Message | Jacob Champion | 2025-11-13 16:23:43 | Re: Few untranslated error messages in OAuth |
| Previous Message | Daniel Gustafsson | 2025-11-13 15:53:14 | Re: pg_getaddrinfo_all() with hintp=NULL |