Skip site navigation (1) Skip section navigation (2)

Re: Attempt to stop dead instance can stop a randomprocess?

From: "Kevin Grittner" <Kevin(dot)Grittner(at)wicourts(dot)gov>
To: "Tom Lane" <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Cc: <pgsql-hackers(at)postgresql(dot)org>
Subject: Re: Attempt to stop dead instance can stop a randomprocess?
Date: 2007-08-31 20:30:31
Message-ID: 46D83417.EE98.0025.0@wicourts.gov (view raw or flat)
Thread:
Lists: pgsql-hackers
>>> On Fri, Aug 31, 2007 at  3:10 PM, in message <1068(dot)1188591013(at)sss(dot)pgh(dot)pa(dot)us>,
Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us> wrote: 
> "Kevin Grittner" <Kevin(dot)Grittner(at)wicourts(dot)gov> writes:
>> Well, that's not due to a bug in PostgreSQL.  We're using a buggy LDAP
>> implementation (not my call) which can crash things.  The machine totally
>> locked up after logging distress messages from that daemon, and they cycled
>> power to get out of it.
> 
> Hmm.  Do I correctly grasp the picture that you've got several Postgres
> installations on the machine and they're all booted by startup scripts?
 
Several is an understatement.  This is the machine where we're running one
PostgreSQL instance per county in "warm standby" mode -- not to actually use
in recovery, but only to confirm that the backups are  flowing back and
applying cleanly.  So, 72 instances, on ports 5401 to 5472.
 
> In this situation, it's actually not a bad idea to run each one under a
> separate userid.
 
OK, I'll see about getting that set up.
 
> (Some people prefer to fix this by having a startup script that forcibly
> removes all the lockfiles before launching the postmasters.  I think
> that's kinda risky, although if it's done in a separate script that
> you'd have no reason to run by hand, it's probably OK.
 
I don't like that idea much.  I'd rather add 72 new OS users.
 
> BTW, I would imagine that some scenario like this preceded the problem
> that you actually reported, since had all the postmasters started
> successfully, they'd all have written correct lockfiles.
 
Quite likely.  Most of the action happened before I arrived for the day.
 
Thanks.
 
-Kevin
 


In response to

pgsql-hackers by date

Next:From: Tom LaneDate: 2007-08-31 21:10:41
Subject: Re: [PATCH] Lazy xid assingment V2
Previous:From: Tom LaneDate: 2007-08-31 20:10:13
Subject: Re: Attempt to stop dead instance can stop a random process?

Privacy Policy | About PostgreSQL
Copyright © 1996-2014 The PostgreSQL Global Development Group