Re: Why is parula failing?

From: Robins Tharakan <tharakan(at)gmail(dot)com>
To: "Tharakan, Robins" <tharar(at)amazon(dot)com>
Cc: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>, David Rowley <dgrowleyml(at)gmail(dot)com>, "pgsql-hackers(at)lists(dot)postgresql(dot)org" <pgsql-hackers(at)lists(dot)postgresql(dot)org>
Subject: Re: Why is parula failing?
Date: 2024-04-13 13:31:49
Message-ID: CAEP4nAxQhdUM4qF4o_5sfP_TyTq09enzo5KCs4W2t7-VmDYnUQ@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

On Mon, 8 Apr 2024 at 21:25, Robins Tharakan <tharakan(at)gmail(dot)com> wrote:
>
>
> I'll keep an eye on this instance more often for the next few days.
> (Let me know if I could capture more if a run gets stuck again)

HEAD is stuck again on pg_sleep(), no CPU for the past hour or so.
Stack trace seems to be similar to last time.

$ pstack 24930
#0 0x0000ffffb8280954 in epoll_pwait () from /lib64/libc.so.6
#1 0x0000000000843408 in WaitEventSetWaitBlock (nevents=1,
occurred_events=<optimized out>, cur_timeout=600000, set=0x3b38dac0) at
latch.c:1570
#2 WaitEventSetWait (set=0x3b38dac0, timeout=timeout(at)entry=600000,
occurred_events=occurred_events(at)entry=0xfffffd1d66c8, nevents=nevents(at)entry=1,
wait_event_info=wait_event_info(at)entry=150994946) at latch.c:1516
#3 0x00000000008437c4 in WaitLatch (latch=<optimized out>,
wakeEvents=wakeEvents(at)entry=41, timeout=600000,
wait_event_info=wait_event_info(at)entry=150994946) at latch.c:538
#4 0x000000000090c384 in pg_sleep (fcinfo=<optimized out>) at misc.c:406
#5 0x0000000000699350 in ExecInterpExpr (state=0x3b5a41a0,
econtext=0x3b5a3f98, isnull=<optimized out>) at execExprInterp.c:764
#6 0x00000000006d1668 in ExecEvalExprSwitchContext (isNull=0xfffffd1d683f,
econtext=0x3b5a3f98, state=<optimized out>) at
../../../src/include/executor/executor.h:356
#7 ExecProject (projInfo=<optimized out>) at
../../../src/include/executor/executor.h:390
#8 ExecResult (pstate=<optimized out>) at nodeResult.c:135
#9 0x00000000006ba26c in ExecProcNode (node=0x3b5a3e88) at
../../../src/include/executor/executor.h:274
#10 gather_getnext (gatherstate=0x3b5a3c98) at nodeGather.c:287
#11 ExecGather (pstate=0x3b5a3c98) at nodeGather.c:222
#12 0x000000000069d28c in ExecProcNode (node=0x3b5a3c98) at
../../../src/include/executor/executor.h:274
#13 ExecutePlan (execute_once=<optimized out>, dest=0x3b5ae8e0,
direction=<optimized out>, numberTuples=0, sendTuples=<optimized out>,
operation=CMD_SELECT, use_parallel_mode=<optimized out>,
planstate=0x3b5a3c98, estate=0x3b5a3a70) at execMain.c:1646
#14 standard_ExecutorRun (queryDesc=0x3b59c250, direction=<optimized out>,
count=0, execute_once=<optimized out>) at execMain.c:363
#15 0x00000000008720e4 in PortalRunSelect (portal=portal(at)entry=0x3b410fb0,
forward=forward(at)entry=true, count=0, count(at)entry=9223372036854775807,
dest=dest(at)entry=0x3b5ae8e0) at pquery.c:924
#16 0x0000000000873900 in PortalRun (portal=portal(at)entry=0x3b410fb0,
count=count(at)entry=9223372036854775807, isTopLevel=isTopLevel(at)entry=true,
run_once=run_once(at)entry=true, dest=dest(at)entry=0x3b5ae8e0,
altdest=altdest(at)entry=0x3b5ae8e0, qc=qc(at)entry=0xfffffd1d6bf0) at
pquery.c:768
#17 0x000000000086f5d4 in exec_simple_query
(query_string=query_string(at)entry=0x3b391c90
"SELECT pg_sleep(0.1);") at postgres.c:1274
#18 0x0000000000870110 in PostgresMain (dbname=<optimized out>,
username=<optimized out>) at postgres.c:4680
#19 0x000000000086b6a0 in BackendMain (startup_data=<optimized out>,
startup_data_len=<optimized out>) at backend_startup.c:105
#20 0x00000000007c6268 in postmaster_child_launch
(child_type=child_type(at)entry=B_BACKEND,
startup_data=startup_data(at)entry=0xfffffd1d70b8
"", startup_data_len=startup_data_len(at)entry=4,
client_sock=client_sock(at)entry=0xfffffd1d70c0)
at launch_backend.c:265
#21 0x00000000007c9c50 in BackendStartup (client_sock=0xfffffd1d70c0) at
postmaster.c:3593
#22 ServerLoop () at postmaster.c:1674
#23 0x00000000007cb8f8 in PostmasterMain (argc=argc(at)entry=8,
argv=argv(at)entry=0x3b38d320)
at postmaster.c:1372
#24 0x0000000000496e18 in main (argc=8, argv=0x3b38d320) at main.c:197

CPU% MEM% TIME+ Command
.
.
0.0 0.0 0:00.00 │ └─ /bin/sh -c cd /opt/postgres/build-farm-14 &&
PATH=/opt/gcc/home/ec2-user/proj/gcc/target/bin/
0.0 0.1 0:00.07 │ └─ /usr/bin/perl ./run_build.pl
--config=build-farm.conf HEAD --verbose
0.0 0.0 0:00.00 │ └─ sh -c { cd pgsql.build/src/test/regress
&& make NO_LOCALE=1 check; echo $? > /opt/postg
0.0 0.0 0:00.00 │ └─ make NO_LOCALE=1 check
0.0 0.0 0:00.00 │ └─ /bin/sh -c echo "# +++ regress
check in src/test/regress +++" && PATH="/opt/postg
0.0 0.0 0:00.10 │ └─
../../../src/test/regress/pg_regress --temp-instance=./tmp_check
--inputdir=.
0.0 0.0 0:00.01 │ ├─ psql -X -a -q -d regression
-v HIDE_TABLEAM=on -v HIDE_TOAST_COMPRESSION=on
0.0 0.1 0:02.64 │ └─ postgres -D
/opt/postgres/build-farm-14/buildroot/HEAD/pgsql.build/src/test
0.0 0.2 0:00.05 │ ├─ postgres: postgres
regression [local] SELECT
0.0 0.0 0:00.06 │ ├─ postgres: logical
replication launcher
0.0 0.1 0:00.36 │ ├─ postgres: autovacuum
launcher
0.0 0.1 0:00.34 │ ├─ postgres: walwriter
0.0 0.0 0:00.32 │ ├─ postgres: background
writer
0.0 0.3 0:00.05 │ └─ postgres: checkpointer

-
robins

>

In response to

Responses

Browse pgsql-hackers by date

  From Date Subject
Next Message Ranier Vilela 2024-04-13 13:40:35 Re: Fix possible dereference null pointer (src/backend/replication/logical/reorderbuffer.c)
Previous Message jian he 2024-04-13 13:13:00 Re: altering a column's collation leaves an invalid foreign key