Re: soft lockup - CPU#16 stuck for 3124s! [postmaster:2273]

From: Ron Johnson <ronljohnsonjr(at)gmail(dot)com>
To: "pgsql-generallists(dot)postgresql(dot)org" <pgsql-general(at)lists(dot)postgresql(dot)org>
Subject: Re: soft lockup - CPU#16 stuck for 3124s! [postmaster:2273]
Date: 2024-03-22 17:31:43
Message-ID: CANzqJaAda267=Noy_bQceGdvHTVs+fQm=AixhRdiqe6rWfZ07g@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

On Fri, Mar 22, 2024 at 1:27 PM Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us> wrote:

> Matthias Apitz <guru(at)unixarea(dot)de> writes:
> > We have a PostgreSQL 15.1 server in production at a customer for some
> > weeks (migrated from an older version) on SuSE SLES 15.
>
> > The customer is facing machine locks and before the Linux server does
> > not respond any more (not even on SSH, only power-cycle reset helps to
> > get it back), short before the fault a lot of messages are in
> > /var/log/messages of the content:
>
> > # grep watchdog: /var/log/messages
> > ...
> > 2024-03-22T13:11:32.056154+01:00 sunrise kernel: [327844.313048][ C25]
> watchdog: BUG: soft lockup - CPU#25 stuck for 3069s! [migration/25:166]
> > 2024-03-22T13:12:28.056244+01:00 sunrise kernel: [327900.310267][ C16]
> watchdog: BUG: soft lockup - CPU#16 stuck for 3124s! [postmaster:2273]
> > 2024-03-22T13:12:28.056340+01:00 sunrise kernel: [327900.311052][ C25]
> watchdog: BUG: soft lockup - CPU#25 stuck for 3121s! [migration/25:166]
>
> Sounds like failing hardware to me :-(

Updating to 15.6 would rule out any bugs squashed in the last 15 months.

In response to

Responses

Browse pgsql-general by date

  From Date Subject
Next Message Nathan Bossart 2024-03-22 19:35:06 Re: Slow GRANT ROLE on PostgreSQL 16 with thousands of ROLEs
Previous Message Tom Lane 2024-03-22 17:27:15 Re: soft lockup - CPU#16 stuck for 3124s! [postmaster:2273]