Skip site navigation (1) Skip section navigation (2)

Re: Excess idle processes, leaking WAL logs?

From: "Brandon Aiken" <BAiken(at)winemantech(dot)com>
To: "Greg Cox" <ratness(at)hotmail(dot)com>,<pgsql-novice(at)postgresql(dot)org>
Subject: Re: Excess idle processes, leaking WAL logs?
Date: 2007-01-03 22:26:52
Message-ID: (view raw, whole thread or download thread mbox)
Lists: pgsql-novice
I highly recommend moving to the latest 7.4.x.  7.4.6 is well over 2 years old, and the numerous security issues alone should be convincing enough to upgrade.  Updating this far appears to be a multi-step process, though.  See the release notes.

There was a race condition prior to 7.4.10 in transaction logging:

There's also a pg_clog race condition in 7.4.12, but that doesn't seem obviously related to me.

You didn't say if you're using clustering, but you may need to use this to correct this issue.

I am by no means an expert with PostgreSQL; however, it seems likely that at least some of the problems you're having have been identified and corrected.

Brandon Aiken
CS/IT Systems Engineer
-----Original Message-----
From: pgsql-novice-owner(at)postgresql(dot)org [mailto:pgsql-novice-owner(at)postgresql(dot)org] On Behalf Of Greg Cox
Sent: Wednesday, January 03, 2007 4:52 PM
To: pgsql-novice(at)postgresql(dot)org
Subject: [NOVICE] Excess idle processes, leaking WAL logs?

I'm a sysadmin, not a DBA, and I inherited a legacy app after the developer 
left the company.  It's 4 early mod_perl (1.29) + early DBI (1.43) app 
servers, going against postgres 7.4.6.

The DB just crapped itself a few days ago.  In the postmortem, we found out 
that the number of processes on the server had been climbing as the server 
ran, going from having some 100 processes, to about 350, which we believe 
were mostly idle postgres processes (someone else got the page-out for 
support).  The uptime was around 6 months.

In the 3 days since it died, I've been watching it and, once again, the 
processes are climbing slowly again.  They're idle postgres processes, 
fairly evenly distributed against the app servers, and more interestingly, 
if I do lsof|grep postgres, I see a large number of lines (now 55):

postmaste 24521 postgres   55u   REG       58,0   16777216   2899982 
/a/path/pdm/pgsql/data/pg_xlog/0000030F000000E2 (deleted)

I looked in the 7.4 docs about WAL.  checkpoint_timeout is 300, 
checkpoint_segments is 8, plenty of space in pg_xlog, but there's 18 files 
in there, some a few hours old, which I would suspect would not be the case 
if it were checkpointing properly.

Am I chasing up the right tree over these leaking processes/connections, or 
is the WAL logs thing just a red herring?  Sadly, I can't edit the app code, 
and I doubt I could upgrade the DB unless I can really business-justify that 
a minor change would do it (I'd probably have to stay in the 7.4 series due 
to timid managers), but if there's little baby tweaks I'm missing, or if I 
can say "yeah, this version of postgres leaks, plan for 3-month-reboots or 
moving to 7.4.x", it'd work for me.

Thanks for any help!

Dave vs. Carl: The Insignificant Championship Series.  Who will win?

---------------------------(end of broadcast)---------------------------
TIP 4: Have you searched our list archives?


In response to

pgsql-novice by date

Next:From: Tom LaneDate: 2007-01-03 23:13:27
Subject: Re: Excess idle processes, leaking WAL logs?
Previous:From: Greg CoxDate: 2007-01-03 21:51:55
Subject: Excess idle processes, leaking WAL logs?

Privacy Policy | About PostgreSQL
Copyright © 1996-2017 The PostgreSQL Global Development Group