Quick Links

Re: Unstable tests for recovery conflict handling

From:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To:	Andres Freund <andres(at)anarazel(dot)de>
Cc:	Thomas Munro <thomas(dot)munro(at)gmail(dot)com>, Melanie Plageman <melanieplageman(at)gmail(dot)com>, pgsql-hackers(at)lists(dot)postgresql(dot)org
Subject:	Re: Unstable tests for recovery conflict handling
Date:	2022-04-27 18:08:45
Message-ID:	447238.1651082925@sss.pgh.pa.us
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-committers pgsql-hackers

I wrote:
> It's been kind of hidden by other buildfarm noise, but
> 031_recovery_conflict.pl is not as stable as it should be [1][2][3][4].
> ...
> I think this is showing us a real bug, ie we sometimes fail to cancel
> the conflicting query.

After digging around in the code, I think this is almost certainly
some manifestation of the previously-complained-of problem [1] that
RecoveryConflictInterrupt is not safe to call in a signal handler,
leading the conflicting backend to sometimes decide that it's not
the problem. That squares with the observation that skink is more
prone to show this than other animals: you'd have to get the SIGUSR1
while the target backend isn't idle, so a very slow machine ought to
show it more. We don't seem to have that issue on the open items
list, but I'll go add it.

Not sure if the 'buffer pin conflict: stats show conflict on standby'
failure could trace to a similar cause.

regards, tom lane

[1] https://www.postgresql.org/message-id/flat/CA%2BhUKGK3PGKwcKqzoosamn36YW-fsuTdOPPF1i_rtEO%3DnEYKSg%40mail.gmail.com

In response to

Unstable tests for recovery conflict handling at 2022-04-27 16:45:14 from Tom Lane

Responses

Re: Unstable tests for recovery conflict handling at 2022-04-28 01:31:34 from Andres Freund
Re: Unstable tests for recovery conflict handling at 2022-07-26 17:57:53 from Tom Lane

Browse pgsql-committers by date

	From	Date	Subject
Next Message	Thomas Munro	2022-04-28 00:11:40	Re: pgsql: Add contrib/pg_walinspect.
Previous Message	Mark Dilger	2022-04-27 17:50:06	Re: Unstable tests for recovery conflict handling

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Nathan Bossart	2022-04-27 18:09:45	Re: Possible corruption by CreateRestartPoint at promotion
Previous Message	Mark Dilger	2022-04-27 17:50:06	Re: Unstable tests for recovery conflict handling