Re: vac_truncate_clog()'s bogus check leads to bogusness

From: Andres Freund <andres(at)anarazel(dot)de>
To: Noah Misch <noah(at)leadboat(dot)com>
Cc: pgsql-hackers(at)postgresql(dot)org, Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Subject: Re: vac_truncate_clog()'s bogus check leads to bogusness
Date: 2023-06-22 16:45:18
Message-ID: 20230622164518.nsnhypphgogmdafg@awork3.anarazel.de
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

Hi,

On 2023-06-21 21:50:39 -0700, Noah Misch wrote:
> On Wed, Jun 21, 2023 at 03:12:08PM -0700, Andres Freund wrote:
> > When vac_truncate_clog() returns early
> ...
> > we haven't released the lwlock that we acquired earlier
>
> > Until there's some cause for the session to call LWLockReleaseAll(), the lock
> > is held. Until then neither the process holding the lock, nor any other
> > process, can finish vacuuming. We don't even have an assert against a
> > self-deadlock with an already held lock, oddly enough.
>
> I agree with this finding. Would you like to add the lwlock releases, or
> would you like me to?

Happy with either. I do have code and testcase, so I guess it would make
sense for me to do it?

> The bug has been in all released versions for 2.5 years, yet it escaped
> notice. That tells us something. Bogus values have gotten rare? The
> affected session tends to get lucky and call LWLockReleaseAll() soon?

I am not sure either. I suspect that part of it is that people couldn't even
pinpoint the problem when it happened. Process exit calls LWLockReleaseAll(),
which I assume would avoid the problem in many cases.

Greetings,

Andres Freund

In response to

Responses

Browse pgsql-hackers by date

  From Date Subject
Next Message Tom Lane 2023-06-22 16:47:48 Re: [PATCH] Extend ALTER OPERATOR to support adding commutator, negator, hashes, and merges
Previous Message Andres Freund 2023-06-22 16:38:13 Re: Assert while autovacuum was executing