Quick Links

Re: production server down

From:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To:	Joe Conway <mail(at)joeconway(dot)com>
Cc:	"Hackers (PostgreSQL)" <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: production server down
Date:	2004-12-15 06:10:21
Message-ID:	27366.1103091021@sss.pgh.pa.us
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

Joe Conway <mail(at)joeconway(dot)com> writes:
> Any theories on how we screwed up?

I hesitate to suggest this, but maybe a cron job blindly copying data
from point A to point B?

I'm not sure that that could entirely explain the facts. My
recollection of the xlog.c logic is that the pg_control file is read
into shared memory during postmaster boot, and after that it's
write-only: at checkpoint times we update the file image in shared
memory and then write it out to pg_control.

Offhand my bets would revolve around (a) multiple postmasters trying to
run the same PGDATA directory (we have interlocks to protect against
this, but I have no faith that they work against an NFS-mounted data
directory), or (b) you somehow wiped a PGDATA directory and restored it
from backup tapes underneath a running postmaster.

regards, tom lane

In response to

Re: production server down at 2004-12-15 05:50:02 from Joe Conway

Responses

Re: production server down at 2004-12-15 06:25:15 from Joe Conway

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Joe Conway	2004-12-15 06:25:15	Re: production server down
Previous Message	Christopher Browne	2004-12-15 06:01:31	Re: V8.0rc1 On AIX.