Re: Odd code around ginScanToDelete

From: Alexander Korotkov <aekorotkov(at)gmail(dot)com>
To: Andres Freund <andres(at)anarazel(dot)de>
Cc: pgsql-hackers(at)postgresql(dot)org
Subject: Re: Odd code around ginScanToDelete
Date: 2026-02-03 22:32:51
Message-ID: CAPpHfdt5BHkpvFnwg_RiMrCdD8W5FhOoahqY2Td-y3kb45UpZw@mail.gmail.com
Views: Whole Thread | Raw Message | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

On Tue, Feb 3, 2026 at 6:26 PM Andres Freund <andres(at)anarazel(dot)de> wrote:
>
> While looking at converting more places to UnlockReleaseBuffer(), in the
> course of making UnlockReleaseBuffer() faster than the two separate
> operations, I found this code:
>
> static bool
> ginScanToDelete(GinVacuumState *gvs, BlockNumber blkno, bool isRoot,
> DataPageDeleteStack *parent, OffsetNumber myoff)
> ...
>
> if (!meDelete)
> {
> if (BufferIsValid(me->leftBuffer))
> UnlockReleaseBuffer(me->leftBuffer);
> me->leftBuffer = buffer;
> }
> else
> {
> if (!isRoot)
> LockBuffer(buffer, GIN_UNLOCK);
>
> ReleaseBuffer(buffer);
> }
>
> if (isRoot)
> ReleaseBuffer(buffer);
>
>
> Which sure looks like it'd release buffer twice if isRoot is set? I guess
> that's not reachable, because presumably the root page will always go down the
> !meDelete path. But it sure made me wonder if there's a hard to reach bug.

Yes, it's not possible to have meDelete set for root, because
me->leftBuffer is always InvalidBuffer for the root. So the branch
handling meDelete case should better do Assert(!isRoot).

> This code was introduced in
> commit e14641197a5
> Author: Alexander Korotkov <akorotkov(at)postgresql(dot)org>
> Date: 2019-11-19 23:07:36 +0300
>
> Fix deadlock between ginDeletePage() and ginStepRight()
>
> I didn't trace it further to see if it existed before that in some fashion.

Yes. I think generally this area needs to be reworked to become more
clear, and have vast more comments. It was wrong from my side trying
to fix bugs there without reworking it into something more
appropriate. I'm planning to put work on this during this week.

> There's another oddity here: ginScanToDelete() requires that the root page has
> been locked by the caller already, but will afaict re-read the root page? But
> then have code to avoid locking it again, because that would not have worked?
> Seems odd.

It seems a bit odd for me that caller already have locked buffer, but
passes BlockNumber making us re-read the buffer. But I'm not sure
that's the same as your point. Could you, please, elaborate more on
this?

------
Regards,
Alexander Korotkov
Supabase

In response to

Browse pgsql-hackers by date

  From Date Subject
Next Message Álvaro Herrera 2026-02-03 22:35:42 Re: Orphaned users in PG16 and above can only be managed by Superusers
Previous Message Álvaro Herrera 2026-02-03 22:32:22 Re: IPC::Run::time[r|out] vs our TAP tests