Skip site navigation (1) Skip section navigation (2)

Re: Reliably determining whether the server came up

From: "H(dot) Hall" <hhall1001(at)reedyriver(dot)com>
To: Mischa Sandberg <mischa_sandberg(at)telus(dot)net>, pgsql-admin(at)postgresql(dot)org
Subject: Re: Reliably determining whether the server came up
Date: 2008-11-15 18:43:45
Message-ID: 491F1861.4060403@reedyriver.com (view raw or flat)
Thread:
Lists: pgsql-admin
Mischa Sandberg wrote:
> I've been trying to work out a reliable script to determine, 
> after pg_ctl start, that the server is done attempting
> to come up, and that it has either succeeded OR FAILED.
> This is for several hundred unattended appliance-type servers,
> currently on PG 8.0 but soon to be on 8.3
>   

Why don't you try to create a connection to a db on each server?

HH
> Haven't found anything in the archives.
> I want to determine success/failure without time-outs, since:
> the db is restarted every time a server gets an upgrade,
> and it can get several upgrades in a batch, and the cpu/disk load 
> during an upgrade is highly variable; a restart with no
> recovery may still require as much as a minute to get to 'ready'.
>
> We also need to restart the server several hundred times
> in our in-house system tests.
>
> So pg_ctl -w start is not an option, even if the timeout were
> configurable to under a minute.
>
> The best I have that doesn't involve modifying pg_ctl is:
>
> # Hand-compute $NEXT_LOG from postgresql.conf
> # parameters (log_directory) and (log_filename).
> # Replace %S format with a '??' wildcard (yech).
>
> $ TEMP_LOG=/tmp/pg.$PGPORT.log
> $ touch $NEXT_LOG >$TEMP_LOG
> $ FROM=`awk 'END {print NR+1}' $NEXT_LOG`
> $ pg_ctl start -s -l $TEMPLOG
> $ while tail +$FROM $NEXT_LOG | ! egrep -hw
> 'FATAL|PANIC|DETAIL|ready|shutting|^postmaster cannot' $TEMP_LOG -; do
> sleep 1; done
>
> The nasty cases are when the server fails (exits)
> without being able to create its std log file (e.g. 
> error in postgresql.conf). 
>
> So I'm down to patching start_postmaster in pg_ctl.c 
> to use popen("... & echo $!") instead of system("... &"),
> then make test_postmaster_connection do a kill(0,pid)
> if PQsetdbLogin fails.
>
> Any suggestions appreciated.
>   



In response to

Responses

pgsql-admin by date

Next:From: Mischa SandbergDate: 2008-11-15 20:29:05
Subject: Re: Reliably determining whether the server came up
Previous:From: Tom LaneDate: 2008-11-14 19:08:06
Subject: Re: question on warm standby

Privacy Policy | About PostgreSQL
Copyright © 1996-2014 The PostgreSQL Global Development Group