Quick Links

Re: PostgreSQL's handling of fsync() errors is unsafe and risks data loss at least on XFS

From:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To:	Craig Ringer <craig(at)2ndquadrant(dot)com>
Cc:	PostgreSQL Hackers <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: PostgreSQL's handling of fsync() errors is unsafe and risks data loss at least on XFS
Date:	2018-03-28 03:53:08
Message-ID:	20431.1522209188@sss.pgh.pa.us
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

Craig Ringer <craig(at)2ndquadrant(dot)com> writes:
> TL;DR: Pg should PANIC on fsync() EIO return.

Surely you jest.

> Retrying fsync() is not OK at
> least on Linux. When fsync() returns success it means "all writes since the
> last fsync have hit disk" but we assume it means "all writes since the last
> SUCCESSFUL fsync have hit disk".

If that's actually the case, we need to push back on this kernel brain
damage, because as you're describing it fsync would be completely useless.

Moreover, POSIX is entirely clear that successful fsync means all
preceding writes for the file have been completed, full stop, doesn't
matter when they were issued.

regards, tom lane

In response to

PostgreSQL's handling of fsync() errors is unsafe and risks data loss at least on XFS at 2018-03-28 02:23:46 from Craig Ringer

Responses

Re: PostgreSQL's handling of fsync() errors is unsafe and risks data loss at least on XFS at 2018-03-29 02:30:59 from Michael Paquier
Re: PostgreSQL's handling of fsync() errors is unsafe and risks data loss at least on XFS at 2018-03-29 05:58:45 from Craig Ringer

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Pavan Deolasee	2018-03-28 03:53:44	Re: Changing WAL Header to reduce contention during ReserveXLogInsertLocation()
Previous Message	Andrew Dunstan	2018-03-28 03:43:19	Re: Changing WAL Header to reduce contention during ReserveXLogInsertLocation()