Quick Links

Re: Let PostgreSQL's On Schedule checkpoint write buffer smooth spread cycle by tuning IsCheckpointOnSchedule?

From:	Heikki Linnakangas <hlinnaka(at)iki(dot)fi>
To:	digoal zhou <digoal(dot)zhou(at)gmail(dot)com>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Let PostgreSQL's On Schedule checkpoint write buffer smooth spread cycle by tuning IsCheckpointOnSchedule?
Date:	2015-05-12 12:14:35
Message-ID:	5551EEAB.4010202@iki.fi
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

On 05/12/2015 03:27 AM, digoal zhou wrote:
> PostgreSQL (<=9.4) trend to smooth buffer write smooth in a
> checkpoint_completion_target (checkpoint_timeout or checkpoint_segments),
> but when we use synchronous_commit=off, there is a little problem for
> the checkpoint_segments
> target, because xlog write fast(for full page write which the first page
> write after checkpoint), so checkpointer cann't sleep and write buffer not
> smooth.
> ...
> I think we can add an condition to the IsCheckpointOnSchedule,
> if (synchronous_commit != SYNCHRONOUS_COMMIT_OFF)
> {
> recptr = GetInsertRecPtr();
> elapsed_xlogs = (((double) (recptr -
> ckpt_start_recptr)) / XLogSegSize) / CheckPointSegments;
>
> if (progress < elapsed_xlogs)
> {
> ckpt_cached_elapsed = elapsed_xlogs;
> return false;
> }
> }

This has nothing to do with asynchronous_commit, except that setting
asynchronous_commit=off makes your test case run faster, and hit the
problem harder.

I think the real problem here is that IsCheckpointOnSchedule assumes
that the rate of WAL generated is constant throughout the checkpoint
cycle, but in reality you generate a lot more WAL immediately after the
checkpoint begins, thanks to full_page_writes. For example, in the
beginning of the cycle, you quickly use up, say, 20% of the WAL space in
the first 10 seconds, and the scheduling thinks it's in a lot of hurry
to finish the checkpoint because it extrapolates that the rest of the
WAL will be used up in the next 40 seconds. But in reality, the WAL
consumption levels off, and you have many minutes left until
CheckPointSegments.

Can you try the attached patch? It modifies the above calculation to
take the full-page-write effect into account. I used X^1.5 as the
corrective function, which roughly reflects the typical WAL consumption
pattern. You can adjust the exponent, 1.5, to make the correction more
or less aggressive.

- Heikki

Attachment	Content-Type	Size
compensate-fpw-effect-on-checkpoint-scheduling-1.patch	application/x-patch	989 bytes

In response to

Let PostgreSQL's On Schedule checkpoint write buffer smooth spread cycle by tuning IsCheckpointOnSchedule? at 2015-05-12 00:27:51 from digoal zhou

Responses

Re: Let PostgreSQL's On Schedule checkpoint write buffer smooth spread cycle by tuning IsCheckpointOnSchedule? at 2015-05-13 08:35:28 from Heikki Linnakangas
Re: Let PostgreSQL's On Schedule checkpoint write buffer smooth spread cycle by tuning IsCheckpointOnSchedule? at 2015-06-26 13:43:26 from Heikki Linnakangas

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Andrew Dunstan	2015-05-12 12:35:36	Re: pg_basebackup vs. Windows and tablespaces
Previous Message	Stephen Frost	2015-05-12 11:11:32	Re: LOCK TABLE Permissions