Re: Why is infinite_recurse test suddenly failing?

From: Andrew Dunstan <andrew(at)dunslane(dot)net>
To: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>, Andres Freund <andres(at)anarazel(dot)de>
Cc: pgsql-hackers(at)lists(dot)postgresql(dot)org, mark(at)2ndQuadrant(dot)com
Subject: Re: Why is infinite_recurse test suddenly failing?
Date: 2019-05-10 21:26:43
Message-ID: 072a5de5-14d3-94de-6772-d9b9ed1e019d@dunslane.net
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers


On 5/10/19 3:35 PM, Tom Lane wrote:
> Andres Freund <andres(at)anarazel(dot)de> writes:
>> On 2019-05-10 11:38:57 -0400, Tom Lane wrote:
>>> I am wondering if, somehow, the stack depth limit seen by the postmaster
>>> sometimes doesn't apply to its children. That would be pretty wacko
>>> kernel behavior, especially if it's only intermittently true.
>>> But we're running out of other explanations.
>> I wonder if this is a SIGSEGV that actually signals an OOM
>> situation. Linux, if it can't actually extend the stack on-demand due to
>> OOM, sends a SIGSEGV. The signal has that information, but
>> unfortunately the buildfarm code doesn't print it. p $_siginfo would
>> show us some of that...
>> Mark, how tight is the memory on that machine? Does dmesg have any other
>> information (often segfaults are logged by the kernel with the code
>> IIRC).
> It does sort of smell like a resource exhaustion problem, especially
> if all these buildfarm animals are VMs running on the same underlying
> platform. But why would that manifest as "you can't have a measly two
> megabytes of stack" and not as any other sort of OOM symptom?
>
> Mark, if you don't mind modding your local copies of the buildfarm
> script, I think what Andres is asking for is a pretty trivial addition
> in PGBuild/Utils.pm's sub get_stack_trace:
>
> my $cmdfile = "./gdbcmd";
> my $handle;
> open($handle, '>', $cmdfile) || die "opening $cmdfile: $!";
> print $handle "bt\n";
> + print $handle "p $_siginfo\n";
> close($handle);
>
>

I think we'll need to write that as:

    print $handle 'p $_siginfo',"\n";

As you have it written perl will try to interpolate a variable called
$_siginfo.

cheers

andrew

In response to

Responses

Browse pgsql-hackers by date

  From Date Subject
Next Message Andres Freund 2019-05-10 21:40:38 Re: Adding a test for speculative insert abort case
Previous Message Nasby, Jim 2019-05-10 21:07:05 Re: Problems with pg_upgrade and extensions referencing catalog tables/views