Quick Links

Re: Another reason why the recovery tests take a long time

From:	Andres Freund <andres(at)anarazel(dot)de>
To:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Cc:	pgsql-hackers(at)postgreSQL(dot)org
Subject:	Re: Another reason why the recovery tests take a long time
Date:	2017-06-26 18:01:04
Message-ID:	20170626180104.irip5bklcocgdy3q@alap3.anarazel.de
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

On 2017-06-26 13:42:52 -0400, Tom Lane wrote:
> Andres Freund <andres(at)anarazel(dot)de> writes:
> > On 2017-06-26 12:32:00 -0400, Tom Lane wrote:
> >> ... But I wonder whether it's intentional that the old
> >> walreceiver dies in the first place. That FATAL exit looks suspiciously
> >> like it wasn't originally-designed-in behavior.
>
> > It's quite intentional afaik - I've complained about the bad error
> > message recently (we really shouldn't say "no COPY in progress), but
> > exiting seems quite reasonable. Otherwise we'd have add a separate
> > retry logic into the walsender, that reconnects without a new walsender
> > being started.
>
> Ah, I see. I'd misinterpreted the purpose of the infinite loop in
> WalReceiverMain() --- now I see that's for receiving requests from the
> startup proc for different parts of the WAL stream, not for reconnecting
> to the master.

Right. And if the connection fails, we intentionally (whether that's
good or bad is another question) switch to restore_command (or
pg_xlog...) based recovery, in which case we certainly do not want the
walsender around.

- Andres

In response to

Re: Another reason why the recovery tests take a long time at 2017-06-26 17:42:52 from Tom Lane

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Tom Lane	2017-06-26 18:06:56	Re: Another reason why the recovery tests take a long time
Previous Message	Tom Lane	2017-06-26 17:42:52	Re: Another reason why the recovery tests take a long time