Re: One-off failure in "cluster" test

From: Thomas Munro <thomas(dot)munro(at)gmail(dot)com>
To: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Cc: pgsql-hackers <pgsql-hackers(at)postgresql(dot)org>
Subject: Re: One-off failure in "cluster" test
Date: 2020-08-17 01:27:57
Message-ID: CA+hUKG+bQeWgMymmFCYq9wh8Kjw0jm=ASAKJmsYCNAR+rDwtqw@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

On Mon, Aug 17, 2020 at 1:20 PM Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us> wrote:
> Thomas Munro <thomas(dot)munro(at)gmail(dot)com> writes:
> > I wonder what caused this[1] one-off failure to see tuples in clustered order:
> > ...
> > I guess a synchronised scan could cause that, but I wouldn't expect one here.
>
> Looking at its configuration, chipmunk uses
>
> 'extra_config' => {
> ...
> 'shared_buffers = 10MB',
>
> which I think means that clstr_4 would be large enough to trigger a
> syncscan. Ordinarily that's not a problem since no other session would
> be touching clstr_4 ... but I wonder whether (a) autovacuum had decided
> to look at clstr_4 and (b) syncscan can trigger on vacuum-driven scans.
> (a) would explain the non-reproducibility.
>
> I kinda think that (b), if true, is a bad idea and should be suppressed.
> autovacuum would typically fail to keep up with other syncscans thanks
> to vacuum delay settings, so letting it participate seems unhelpful.

Yeah, I wondered that as well and found my way to historical
discussions concluding that autovacuum should not participate in sync
scans. Now I'm wondering if either table AM refactoring or parallel
vacuum refactoring might have inadvertently caused that to become a
possibility in REL_13_STABLE.

In response to

Responses

Browse pgsql-hackers by date

  From Date Subject
Next Message Michael Paquier 2020-08-17 01:32:37 Re: Fix an old description in high-availability.sgml
Previous Message Tom Lane 2020-08-17 01:20:40 Re: One-off failure in "cluster" test