Re: Anti-critical-section assertion failure in mcxt.c reached by walsender

From: Noah Misch <noah(at)leadboat(dot)com>
To: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Cc: Thomas Munro <thomas(dot)munro(at)gmail(dot)com>, pgsql-hackers <pgsql-hackers(at)postgresql(dot)org>, Andrew Dunstan <andrew(at)dunslane(dot)net>
Subject: Re: Anti-critical-section assertion failure in mcxt.c reached by walsender
Date: 2021-05-07 02:28:31
Message-ID: 20210507022831.GA2991266@rfd.leadboat.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

On Thu, May 06, 2021 at 09:43:32PM -0400, Tom Lane wrote:
> 2. We evidently need to put a bit more effort into this error
> reporting logic. More generally, I wonder how we could audit
> the code for similar hazards elsewhere, because I bet there are
> some. (Or ... could it be sane to run functions included in
> the ereport's arguments in ErrorContext?)

Seems reasonable. I don't have good ideas for auditing; just making the
palloc work may be easier.

> 3. One might wonder why we're getting an fdatasync failure at
> all, when thorntail is configured to run with fsync = off.
> The answer to that one is that 008_fsm_truncation.pl takes it
> upon itself to force fsync = on, overriding the express wishes
> of the buildfarm owner, not to mention general project policy.
> AFAICT that was added with little if any thought in the initial
> creation of 008_fsm_truncation.pl, and I think we should take
> it out. There's certainly no visible reason for this one
> TAP script to be running with fsync on when no others do.

I've caught that one test taking ~10min due to its fsync use. If fsync=on
isn't important to the test, +1 for removing it.

In response to

Browse pgsql-hackers by date

  From Date Subject
Next Message Amit Kapila 2021-05-07 02:33:42 Re: Replication slot stats misgivings
Previous Message Tom Lane 2021-05-07 01:43:32 Re: Anti-critical-section assertion failure in mcxt.c reached by walsender