Quick Links

Re: Online enabling of checksums

From:	Andres Freund <andres(at)anarazel(dot)de>
To:	Alvaro Herrera <alvherre(at)2ndquadrant(dot)com>
Cc:	Michael Banck <michael(dot)banck(at)credativ(dot)de>, Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>, Daniel Gustafsson <daniel(at)yesql(dot)se>, Robert Haas <robertmhaas(at)gmail(dot)com>, Magnus Hagander <magnus(at)hagander(dot)net>, Andrey Borodin <x4mmm(at)yandex-team(dot)ru>, Tomas Vondra <tomas(dot)vondra(at)2ndquadrant(dot)com>, Heikki Linnakangas <hlinnaka(at)iki(dot)fi>, Greg Stark <stark(at)mit(dot)edu>, PostgreSQL-development <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: Online enabling of checksums
Date:	2018-08-01 16:30:19
Message-ID:	20180801163019.vekqsvdsu6s3bizg@alap3.anarazel.de
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

On 2018-08-01 12:20:12 -0400, Alvaro Herrera wrote:
> Hello
>
> On 2018-Aug-01, Andres Freund wrote:
>
> > My problem isn't just that I shouldn't think this should be committed
> > without at least a firm committement to do better,
>
> I take "I think this shouldn't be committed" is what you meant.

Yep.

> I'm not sure I agree with this line of argument. The reality is that
> real life or diverging priorities preclude people from working on
> $stuff.

Right.

> This is a useful feature-1 we have here, and if we stall it
> until we have feature-2, we may not get either until a year later.
> That's not a great outcome. We didn't wait for partitioning, parallel
> query, DDL progress reporting, logical replication, JIT, wait events (to
> name but a few) to solve world's hunger in order to start getting
> committed. We move forward step by step, and that's a good thing.

But we asked that they implement something consistent, and rejected many
that were not deemed to be that.

> > my problem is that I think the "restart" approach is just using the
> > entirely wrong hammer to solve the problem at hand. At the very least
> > it's very problematic in respect to replicas, which need to know about
> > the setting too, and can have similar problems the restart on the
> > primary is supposed to prevent.
>
> If we define "restart" to mean taking all the servers down
> simultaneously, that can be planned.

It really can't realistically. That'd essentially mean breaking PITR.
You'd have to schedule the restart of any replicas to happen after a
specific record. And what if there's replicas that are on a delay? What
if there's data centers that are currently offline?

And again, this isn't hard to do properly. I don't get why we're talking
about an at least operationally complex workaround when the proper
solution isn't hard.

Greetings,

Andres Freund

In response to

Re: Online enabling of checksums at 2018-08-01 16:20:12 from Alvaro Herrera

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Andres Freund	2018-08-01 16:33:48	Re: Online enabling of checksums
Previous Message	Tomas Vondra	2018-08-01 16:25:48	Re: Online enabling of checksums