Re: Archive recovery won't be completed on some situation.

From: Kyotaro HORIGUCHI <horiguchi(dot)kyotaro(at)lab(dot)ntt(dot)co(dot)jp>
To: hlinnakangas(at)vmware(dot)com
Cc: masao(dot)fujii(at)gmail(dot)com, pgsql-hackers(at)postgresql(dot)org
Subject: Re: Archive recovery won't be completed on some situation.
Date: 2014-03-20 06:56:36
Message-ID: 20140320.155636.100338288.horiguchi.kyotaro@lab.ntt.co.jp
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

Hello,

> On 03/19/2014 10:28 AM, Kyotaro HORIGUCHI wrote:
> > The*problematic* operation sequence I saw was performed by
> > pgsql-RA/Pacemaker. It stops a server already with immediate mode
> > and starts the Master as a Standby at first, then
> > promote. Focusing on this situation, there would be reasonable to
> > reset backup positions.
>
> Well, that's scary. I would suggest doing a fast shutdown instead. But
> if you really want to do an immediate shutdown, you should delete the
> backup_label file after the shutdown

"We"'d also said them the former thing on several occations. They
answered that they hate shutdown checkpoint to take long time
before shutdown is completed. The latter one has not come on my
mind and seems promising. Thank you for the suggestion.

> When restarting after immediate shutdown and a backup_label file is
> present, the system doesn't know if the system crashed during a
> backup, and it needs to perform crash recovery, or if you're trying
> restore from a backup. It makes a compromise, and starts recovery from
> the checkpoint given in the backup_label, as if it was restoring from
> a backup, but if it doesn't see a backup-end WAL record, it just
> starts up anyway (which would be wrong if you are indeed restoring
> from a backup). But if you create a recovery.conf file, that indicates
> that you are definitely restoring from a backup, so it's more strict
> and insists that the backup-end record must be replayed.
>
> > 9.4 canceles backup mode even on
> > immediate shutdown so the operation causes no problem, but 9.3
> > and before are doesn't.
>
> Hmm, I don't think we've changed that behavior in 9.4.

Now pmdie behaves in the similar manner between fast and
immediate shutdown after 82233ce7ea42d6ba519. It is an side
effect of a change on immediate shutdown which make it to wait
the children to die by SIGQUIT.

regards,

--
Kyotaro Horiguchi
NTT Open Source Software Center

In response to

Browse pgsql-hackers by date

  From Date Subject
Next Message Pavel Stehule 2014-03-20 06:56:46 Re: four minor proposals for 9.5
Previous Message Kyotaro HORIGUCHI 2014-03-20 06:43:00 Re: Archive recovery won't be completed on some situation.