Re: Properly handle OOM death?

From: Justin Pryzby <pryzby(at)telsasoft(dot)com>
To: pgsql-general(at)lists(dot)postgresql(dot)org
Cc: Christoph Berg <myon(at)debian(dot)org>, Devrim Gündüz <devrim(at)gunduz(dot)org>
Subject: Re: Properly handle OOM death?
Date: 2023-11-13 14:42:31
Message-ID: ZVI112aVNCHOQgfF@pryzbyj2023
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

On Mon, Mar 13, 2023 at 06:43:01PM +0100, Peter J. Holzer wrote:
> On 2023-03-13 09:21:18 -0800, Israel Brewster wrote:
> > I’m running a postgresql 13 database on an Ubuntu 20.04 VM that is a bit more
> > memory constrained than I would like, such that every week or so the various
> > processes running on the machine will align badly and the OOM killer will kick
> > in, killing off postgresql, as per the following journalctl output:
> >
> > Mar 12 04:04:23 novarupta systemd[1]: postgresql(at)13-main(dot)service: A process of this unit has been killed by the OOM killer.
> > Mar 12 04:04:32 novarupta systemd[1]: postgresql(at)13-main(dot)service: Failed with result 'oom-kill'.
> > Mar 12 04:04:32 novarupta systemd[1]: postgresql(at)13-main(dot)service: Consumed 5d 17h 48min 24.509s CPU time.
> >
> > And the service is no longer running.
>
> I might be misreading this, but it looks to me that systemd detects that
> *some* process in the group was killed by the oom killer and stops the
> service.

Yeah.

I found this old message on google. I'm surprised there aren't more,
similar complaints about this. It's as Peter said: it (sometimes)
causes systemd to actively *stop* the cluster after OOM, when it
would've come back online on its own if the init (supervisor) process
didn't interfere.

My solution was to set:
/usr/lib/systemd/system/postgresql(at)(dot)service OOMPolicy=continue

I suggest that the default unit files should do likewise.

--
Justin

In response to

Browse pgsql-general by date

  From Date Subject
Next Message b55white 2023-11-13 15:11:27 Re: Issue in compiling postgres on latest macOS 14.1.1
Previous Message David G. Johnston 2023-11-13 07:22:54 Re: Unique Primary Key Linked to Multiple Accounts