Re: Anti-critical-section assertion failure in mcxt.c reached by walsender

From: Noah Misch <noah(at)leadboat(dot)com>
To: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Cc: Thomas Munro <thomas(dot)munro(at)gmail(dot)com>, pgsql-hackers <pgsql-hackers(at)postgresql(dot)org>, Andrew Dunstan <andrew(at)dunslane(dot)net>
Subject: Re: Anti-critical-section assertion failure in mcxt.c reached by walsender
Date: 2021-05-08 00:14:18
Message-ID: 20210508001418.GA3076445@rfd.leadboat.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

On Fri, May 07, 2021 at 01:18:19PM -0400, Tom Lane wrote:
> Realizing that 9989d37d prevents the assertion failure, I went
> to see if thorntail had shown EIO failures without assertions.
> Looking back 180 days, I found these:
>
> sysname | branch | snapshot | stage | l
> -----------+---------------+---------------------+--------------------+------------------------------------------------------------------------------------------------------------------------------------------------
> thorntail | HEAD | 2021-03-19 21:28:15 | recoveryCheck | 2021-03-20 00:48:48.117 MSK [4089174:11] 008_fsm_truncation.pl PANIC: could not fdatasync file "000000010000000000000002": Input/output error
> thorntail | HEAD | 2021-04-06 16:08:10 | recoveryCheck | 2021-04-06 19:30:54.103 MSK [3355008:11] 008_fsm_truncation.pl PANIC: could not fdatasync file "000000010000000000000002": Input/output error
> thorntail | REL9_6_STABLE | 2021-04-12 02:38:04 | pg_basebackupCheck | pg_basebackup: could not fsync file "000000010000000000000013": Input/output error
>
> So indeed the kernel-or-hardware problem is affecting other branches.

Having a flaky buildfarm member is bad news. I'll LD_PRELOAD the attached to
prevent fsync from reaching the kernel. Hopefully, that will make the
hardware-or-kernel trouble unreachable. (Changing 008_fsm_truncation.pl
wouldn't avoid this, because fsync=off doesn't affect syncs outside the
backend.)

Attachment Content-Type Size
never_sync.c text/plain 139 bytes

In response to

Responses

Browse pgsql-hackers by date

  From Date Subject
Next Message David Rowley 2021-05-08 00:38:33 Re: Binary search in ScalarArrayOpExpr for OR'd constant arrays
Previous Message Peter Lee 2021-05-07 23:22:22 Will Postgres12 installed on a RHEL 6 server continue to function after the server get O/S upgrade to RHEL 7?