From:
Fujii Masao <masao(dot)fujii(at)gmail(dot)com>
To:
Heikki Linnakangas <hlinnakangas(at)vmware(dot)com>
Cc:
Amit kapila <amit(dot)kapila(at)huawei(dot)com>, "pgsql-bugs(at)postgresql(dot)org" <pgsql-bugs(at)postgresql(dot)org>, "pgsql-hackers(at)postgresql(dot)org" <pgsql-hackers(at)postgresql(dot)org>
Subject:
Re: BUG #7534: walreceiver takes long time to detect n/w breakdown
Date:
2012-10-01 16:57:34
Message-ID:
CAHGQGwEd34=Z7=t9q8Xf11pmQS5a216ug7NW4V6qpuawG1crOA@mail.gmail.com (view raw or flat )
Thread:
2012-09-12 11:54:59 from amit(dot)kapila(at)huawei(dot)com
2012-09-12 16:41:44 from Magnus Hagander <magnus(at)hagander(dot)net>
2012-09-13 04:00:24 from Amit Kapila <amit(dot)kapila(at)huawei(dot)com>
2012-09-12 16:45:05 from Fujii Masao <masao(dot)fujii(at)gmail(dot)com>
2012-09-13 04:22:08 from Amit Kapila <amit(dot)kapila(at)huawei(dot)com>
2012-09-13 17:27:52 from Fujii Masao <masao(dot)fujii(at)gmail(dot)com>
2012-09-14 13:01:37 from Amit kapila <amit(dot)kapila(at)huawei(dot)com>
2012-09-15 05:57:12 from Fujii Masao <masao(dot)fujii(at)gmail(dot)com>
2012-09-15 07:26:05 from Amit kapila <amit(dot)kapila(at)huawei(dot)com>
2012-09-15 18:44:19 from Fujii Masao <masao(dot)fujii(at)gmail(dot)com>
2012-09-16 06:10:43 from Amit kapila <amit(dot)kapila(at)huawei(dot)com>
2012-09-17 07:03:28 from Amit Kapila <amit(dot)kapila(at)huawei(dot)com>
2012-09-18 12:32:43 from Fujii Masao <masao(dot)fujii(at)gmail(dot)com>
2012-09-18 12:50:33 from Amit Kapila <amit(dot)kapila(at)huawei(dot)com>
2012-09-21 11:18:01 from Amit kapila <amit(dot)kapila(at)huawei(dot)com>
2012-10-01 10:38:49 from Heikki Linnakangas <hlinnakangas(at)vmware(dot)com>
2012-10-01 15:06:12 from Robert Haas <robertmhaas(at)gmail(dot)com>
2012-10-02 07:43:50 from Amit kapila <amit(dot)kapila(at)huawei(dot)com>
2012-10-01 16:57:34 from Fujii Masao <masao(dot)fujii(at)gmail(dot)com>
2012-10-02 00:02:54 from Robert Haas <robertmhaas(at)gmail(dot)com>
2012-10-02 03:50:42 from Alvaro Herrera <alvherre(at)2ndquadrant(dot)com>
2012-10-02 07:36:39 from Amit kapila <amit(dot)kapila(at)huawei(dot)com>
2012-10-02 08:26:24 from Heikki Linnakangas <hlinnakangas(at)vmware(dot)com>
2012-10-04 10:12:30 from Amit kapila <amit(dot)kapila(at)huawei(dot)com>
2012-10-04 12:27:52 from Amit Kapila <amit(dot)kapila(at)huawei(dot)com>
2012-10-08 14:08:00 from Robert Haas <robertmhaas(at)gmail(dot)com>
2012-10-08 14:42:23 from Amit Kapila <amit(dot)kapila(at)huawei(dot)com>
2012-10-09 12:29:52 from Robert Haas <robertmhaas(at)gmail(dot)com>
2012-10-09 13:04:31 from Amit Kapila <amit(dot)kapila(at)huawei(dot)com>
2012-10-10 15:44:34 from Heikki Linnakangas <hlinnakangas(at)vmware(dot)com>
2012-10-11 10:17:11 from Amit Kapila <amit(dot)kapila(at)huawei(dot)com>
2012-10-11 14:52:40 from Heikki Linnakangas <hlinnakangas(at)vmware(dot)com>
2012-10-11 15:52:25 from Amit kapila <amit(dot)kapila(at)huawei(dot)com>
2012-10-13 16:35:12 from Fujii Masao <masao(dot)fujii(at)gmail(dot)com>
2012-10-15 10:13:01 from Heikki Linnakangas <hlinnakangas(at)vmware(dot)com>
2012-10-15 14:27:56 from Heikki Linnakangas <hlinnakangas(at)vmware(dot)com>
2012-10-15 16:31:09 from Fujii Masao <masao(dot)fujii(at)gmail(dot)com>
2012-10-16 12:31:08 from Heikki Linnakangas <hlinnakangas(at)vmware(dot)com>
2012-10-18 16:48:31 from Fujii Masao <masao(dot)fujii(at)gmail(dot)com>
2012-11-07 17:22:04 from Heikki Linnakangas <hlinnakangas(at)vmware(dot)com>
2012-11-08 16:40:47 from Fujii Masao <masao(dot)fujii(at)gmail(dot)com>
2012-11-08 16:56:41 from Fujii Masao <masao(dot)fujii(at)gmail(dot)com>
2012-10-17 11:46:04 from Amit Kapila <amit(dot)kapila(at)huawei(dot)com>
2012-10-17 13:07:30 from Amit Kapila <amit(dot)kapila(at)huawei(dot)com>
2012-10-18 15:19:30 from Fujii Masao <masao(dot)fujii(at)gmail(dot)com>
2012-10-19 11:42:16 from Amit kapila <amit(dot)kapila(at)huawei(dot)com>
2012-11-08 08:33:42 from Heikki Linnakangas <hlinnakangas(at)vmware(dot)com>
2012-11-08 08:53:53 from Amit Kapila <amit(dot)kapila(at)huawei(dot)com>
2012-11-08 17:12:14 from Fujii Masao <masao(dot)fujii(at)gmail(dot)com>
2012-11-09 06:03:32 from Amit Kapila <amit(dot)kapila(at)huawei(dot)com>
2012-11-12 14:53:58 from Fujii Masao <masao(dot)fujii(at)gmail(dot)com>
2012-11-13 04:06:51 from Amit kapila <amit(dot)kapila(at)huawei(dot)com>
2012-11-13 16:02:05 from Fujii Masao <masao(dot)fujii(at)gmail(dot)com>
2012-11-15 13:59:12 from Amit kapila <amit(dot)kapila(at)huawei(dot)com>
2012-11-16 11:40:03 from Amit Kapila <amit(dot)kapila(at)huawei(dot)com>
2013-01-01 16:48:48 from Boszormenyi Zoltan <zb(at)cybertec(dot)at>
2013-01-02 07:11:15 from Hari Babu <haribabu(dot)kommi(at)huawei(dot)com>
2013-01-04 12:43:08 from Hari Babu <haribabu(dot)kommi(at)huawei(dot)com>
2013-01-07 14:23:02 from Boszormenyi Zoltan <zb(at)cybertec(dot)at>
2013-01-09 04:02:58 from Hari Babu <haribabu(dot)kommi(at)huawei(dot)com>
2013-01-16 07:48:17 from Abhijit Menon-Sen <ams(at)2ndQuadrant(dot)com>
2013-01-16 10:31:54 from Heikki Linnakangas <hlinnakangas(at)vmware(dot)com>
2013-01-18 06:50:58 from Amit Kapila <amit(dot)kapila(at)huawei(dot)com>
2013-01-18 10:15:53 from Heikki Linnakangas <hlinnakangas(at)vmware(dot)com>
2013-01-18 11:41:36 from Amit Kapila <amit(dot)kapila(at)huawei(dot)com>
2013-01-18 12:05:05 from Heikki Linnakangas <hlinnakangas(at)vmware(dot)com>
2013-01-18 13:24:54 from Amit Kapila <amit(dot)kapila(at)huawei(dot)com>
2013-01-19 12:19:28 from Magnus Hagander <magnus(at)hagander(dot)net>
2013-01-22 09:56:44 from Hari Babu <haribabu(dot)kommi(at)huawei(dot)com>
2013-01-24 15:13:50 from Hari Babu <haribabu(dot)kommi(at)huawei(dot)com>
2013-01-18 13:43:08 from Dimitri Fontaine <dimitri(at)2ndQuadrant(dot)fr>
2013-01-19 12:20:23 from Magnus Hagander <magnus(at)hagander(dot)net>
2013-01-19 17:05:02 from Dimitri Fontaine <dimitri(at)2ndQuadrant(dot)fr>
2013-01-19 17:33:23 from Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
2013-01-19 20:06:37 from Dimitri Fontaine <dimitri(at)2ndQuadrant(dot)fr>
2013-01-20 16:45:31 from Robert Haas <robertmhaas(at)gmail(dot)com>
2013-01-21 12:51:44 from Magnus Hagander <magnus(at)hagander(dot)net>
2013-01-22 06:31:14 from Amit Kapila <amit(dot)kapila(at)huawei(dot)com>
2013-01-28 09:45:10 from Magnus Hagander <magnus(at)hagander(dot)net>
Lists:
pgsql-bugs pgsql-hackers
On Mon, Oct 1, 2012 at 7:38 PM, Heikki Linnakangas
<hlinnakangas(at)vmware(dot)com> wrote:
> Hmm, I think we need to step back a bit. I've never liked the way
> replication_timeout works, where it's the user's responsibility to set
> wal_receiver_status_interval < replication_timeout. It's not very
> user-friendly. I'd rather not copy that same design to this walreceiver
> timeout. If there's two different timeouts like that, it's even worse,
> because it's easy to confuse the two.
Agreed.
I'd like to specify the replication timeout like we do TCP keepalives, i.e.,
what about introducing something like following parameters?
walsender_keepalives_idle
walsender_keepalives_interval
walsender_keeaplives_count
walreceiver_keepalives_idle
walreceiver_keepalives_interval
walreceiver_keepalives_count
I believe many users are basically familiar with TCP keepalives and how to
specify it. So I think that this approach would be intuitive to users. Also
this approach includes your proposal. If you specify
walsender_keepalives_idle = walsender_timeout / 2
walsender_keepalives_interval = -1 (disable; Ping is never sent
again if there is no reply after first Ping is sent)
walsender_keepalives_count = 1
the replication timeout works as you proposed. But of course the downside
of this approach is that the number of parameter for replication timeout is
increased from two (replication_timeout and
wal_receiver_status_interval) to six,
and those parameters are confusingly similar to existing
tcp_keepalives parameters,
which might cause another confusion to users. One idea to solve this problem is
to use existing tcp_keepalives paramters values for the replication timeout.
Regards,
--
Fujii Masao
In response to
Responses
pgsql-hackers by date
Next :From: Jeff DavisDate: 2012-10-01 17:04:09
Subject : Re: WIP checksums patch
Previous :From : Andres FreundDate : 2012-10-01 16:53:47
Subject : Re: embedded list v3
pgsql-bugs by date
Next :From: Freddie BurgessDate: 2012-10-01 20:18:42
Subject : Postgres 9.2 with Postgis 1.5.3 Upgrade
Previous :From : Robert HaasDate : 2012-10-01 15:06:12
Subject : Re: BUG #7534: walreceiver takes long time to detect n/w breakdown