Re: Another reason why the recovery tests take a long time

From: Andres Freund <andres(at)anarazel(dot)de>
To: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Cc: pgsql-hackers(at)postgreSQL(dot)org
Subject: Re: Another reason why the recovery tests take a long time
Date: 2017-06-26 18:01:04
Message-ID: 20170626180104.irip5bklcocgdy3q@alap3.anarazel.de
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

On 2017-06-26 13:42:52 -0400, Tom Lane wrote:
> Andres Freund <andres(at)anarazel(dot)de> writes:
> > On 2017-06-26 12:32:00 -0400, Tom Lane wrote:
> >> ... But I wonder whether it's intentional that the old
> >> walreceiver dies in the first place. That FATAL exit looks suspiciously
> >> like it wasn't originally-designed-in behavior.
>
> > It's quite intentional afaik - I've complained about the bad error
> > message recently (we really shouldn't say "no COPY in progress), but
> > exiting seems quite reasonable. Otherwise we'd have add a separate
> > retry logic into the walsender, that reconnects without a new walsender
> > being started.
>
> Ah, I see. I'd misinterpreted the purpose of the infinite loop in
> WalReceiverMain() --- now I see that's for receiving requests from the
> startup proc for different parts of the WAL stream, not for reconnecting
> to the master.

Right. And if the connection fails, we intentionally (whether that's
good or bad is another question) switch to restore_command (or
pg_xlog...) based recovery, in which case we certainly do not want the
walsender around.

- Andres

In response to

Browse pgsql-hackers by date

  From Date Subject
Next Message Tom Lane 2017-06-26 18:06:56 Re: Another reason why the recovery tests take a long time
Previous Message Tom Lane 2017-06-26 17:42:52 Re: Another reason why the recovery tests take a long time