Quick Links

Re: Suppressing useless wakeups in walreceiver

From:	Thomas Munro <thomas(dot)munro(at)gmail(dot)com>
To:	Nathan Bossart <nathandbossart(at)gmail(dot)com>
Cc:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>, Kyotaro Horiguchi <horikyota(dot)ntt(at)gmail(dot)com>, alvherre(at)alvh(dot)no-ip(dot)org, bharath(dot)rupireddyforpostgres(at)gmail(dot)com, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Suppressing useless wakeups in walreceiver
Date:	2022-11-14 23:26:00
Message-ID:	CA+hUKGKEg53sAYhjjB3HE++bi9d5o4Dt1Hy-5rkU4WwAqSjNGA@mail.gmail.com
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

On Tue, Nov 15, 2022 at 12:14 PM Nathan Bossart
<nathandbossart(at)gmail(dot)com> wrote:
> On Tue, Nov 15, 2022 at 09:42:26AM +1300, Thomas Munro wrote:
> > That works for 020_pg_receivewal. I wonder if there are also tests
> > that stream a bit of WAL first and then do wait_for_catchup that were
> > previously benefiting from the 100ms-after-startup message by
> > scheduling luck (as in, that was usually enough for replay)? I might
> > go and teach Cluster.pm to log how much time is wasted in
> > wait_for_catchup to get some observability, and then try to figure out
> > how to optimise it properly. We could perhaps put the 100ms duct tape
> > back temporarily though, if necessary.
>
> Oh, I see. Since we don't check the apply position when determining
> whether to send a reply, tests may need to wait a full
> wal_receiver_status_interval. FWIW with my patch, the runtime of the
> src/test/recovery tests seems to be back to what it was on my machine, but
> I certainly wouldn't rule out scheduling luck.

Yeah, what you have looks good in my experiments. Here are some
statistics on waits on my system. Times in seconds, first row is
before the new time logic went in, your patch is the last row.

name | count | sum | avg | stddev | min | max
---------------+-------+---------+-------+--------+-------+--------
before | 718 | 37.375 | 0.052 | 0.135 | 0.006 | 1.110
master | 718 | 173.100 | 0.241 | 1.374 | 0.006 | 10.004
initial-100ms | 718 | 37.169 | 0.052 | 0.126 | 0.006 | 0.676
initial-0ms | 718 | 35.276 | 0.049 | 0.123 | 0.006 | 0.661

The difference on master is explained by these 14 outliers:

Now that I can see what is going on I'm going to try to see how much I
can squeeze out of this...

In response to

Re: Suppressing useless wakeups in walreceiver at 2022-11-14 23:14:39 from Nathan Bossart

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Nathan Bossart	2022-11-14 23:40:04	Re: predefined role(s) for VACUUM and ANALYZE
Previous Message	Andres Freund	2022-11-14 23:24:07	Re: meson oddities