Re: random system table corruption ...

From: Hans-Jürgen Schönig <postgres(at)cybertec(dot)at>
To: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Cc: PostgreSQL-development <pgsql-hackers(at)postgresql(dot)org>, eg(at)cybertec(dot)at
Subject: Re: random system table corruption ...
Date: 2005-09-11 16:41:36
Message-ID: 43245E40.40105@cybertec.at
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

Tom Lane wrote:
> =?ISO-8859-1?Q?Hans-J=FCrgen_Sch=F6nig?= <postgres(at)cybertec(dot)at> writes:
>
>>in the past we have faced a couple of problems with corrupted system
>>tables. this seems to be a version independent problem which occurs on
>>hackers' from time to time.
>>i have checked a broken file and i have seen that the corrupted page has
>>actually been zeroed out.
>
>
> That sounds to me like a hardware problem --- disk or disk controller
> momentarily writing zeroes instead of what it should write. Have you
> seen this on more than one physical machine? Do you have any evidence
> for the implication that it only happens to system tables and not user
> tables?
>
> Also, you don't have zero_damaged_pages turned on by any chance?
>
> regards, tom lane

tom,

well, there is some evidence that this is not a hardware related issue.
we have only seen this problem from time to time but it happened on
different machines. it cannot be reproduced. it can even happen when
somebody runs a script which has been called million times before.
in my current scenario the page header only consists of 0x00 bytes and
therefore the page checks fails when reading the system table.

i have never seen this in data files up to now (at least not when the
hardware was still intact).

did anybody face similar problems? maybe on sun?
by the way: currently the broken system is running PostgreSQL 7.4 but as
I said - we have also seen that on 8.0 once.

best regards,

hans

--
Cybertec Geschwinde & Schönig GmbH
Schöngrabern 134; A-2020 Hollabrunn
Tel: +43/1/205 10 35 / 340
www.postgresql.at, www.cybertec.at

In response to

Browse pgsql-hackers by date

  From Date Subject
Next Message Andrew Dunstan 2005-09-11 17:06:23 Re: initdb profiles
Previous Message Tom Lane 2005-09-11 16:33:00 Re: -fPIC