Re: Completely broken replica after PANIC: WAL contains references to invalid pages

From: Sergey Konoplev <gray(dot)ru(at)gmail(dot)com>
To: Andres Freund <andres(at)2ndquadrant(dot)com>
Cc: Simon Riggs <simon(at)2ndquadrant(dot)com>, pgsql-bugs <pgsql-bugs(at)postgresql(dot)org>, Maxim Boguk <maxim(dot)boguk(at)gmail(dot)com>, Максим Панченко <Panchenko(at)gw(dot)tander(dot)ru>, Толстенко Илья <tolstenko_iv(at)gw(dot)tander(dot)ru>, Сизов Сергей Павлович <sizov_sp(at)gw(dot)tander(dot)ru>, Соболев Виталий Анатольевич <sobolev_va(at)gw(dot)tander(dot)ru>
Subject: Re: Completely broken replica after PANIC: WAL contains references to invalid pages
Date: 2013-11-05 20:03:19
Message-ID: CAL_0b1uebnF4mKHzngihSWD5qfJVE40bfcPwZ7V=HO8dEx4ACw@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-bugs

On Mon, Nov 4, 2013 at 1:48 AM, Andres Freund <andres(at)2ndquadrant(dot)com> wrote:
> On 2013-11-01 00:35:18 -0700, Sergey Konoplev wrote:
>> On Tue, Apr 2, 2013 at 11:26 AM, Andres Freund <andres(at)2ndquadrant(dot)com> wrote:
>> > The attached patch fixes this although I don't like the way it knowledge of the
>> > point up to which StartupSUBTRANS zeroes pages is handled.
>>
>> So, after half a year the same failure has happened again on the same
>> replica, but now patched with the Andres' patch (9.2.4 + the patch)
>> that was supposed to fix it.
>
> I think that's actually a different bug - what the patch was fixing was
> the clog/subtrans errors, not the "reference to invalid pages".

Can I do something to help fixing this particular bug? I backed up the
relation and xlog files in case if someone need to dig it deeper.

> I've submitted a fix for at least one cause causing that independently
> from this bug which is included in 9.2.5
> (17fa4c321ccf9693de406faffe6b235e949aa25f). Note thought that that fix
> needed a fix iff using a cassert enabled build
> (4da24f12e63313b7dbb6b3e3d0317e04045df636) which is *not* in 9.2.5.

--
Kind regards,
Sergey Konoplev
PostgreSQL Consultant and DBA

http://www.linkedin.com/in/grayhemp
+1 (415) 867-9984, +7 (901) 903-0499, +7 (988) 888-1979
gray(dot)ru(at)gmail(dot)com

In response to

Responses

Browse pgsql-bugs by date

  From Date Subject
Next Message Maxy 2013-11-05 20:03:44 I: I: psql security fail?
Previous Message Alvaro Herrera 2013-11-05 19:49:35 Re: I: psql security fail?