Quick Links

CheckpointStartLock starvation

From:	Heikki Linnakangas <heikki(at)enterprisedb(dot)com>
To:	PostgreSQL-development <pgsql-hackers(at)postgresql(dot)org>
Subject:	CheckpointStartLock starvation
Date:	2007-04-02 19:00:03
Message-ID:	461152B3.5060404@enterprisedb.com
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

I'm seeing a problem on my benchmark machine: checkpoints stop happening
after the ramp-up period.

It looks like the bgwriter gets starved waiting on the
CheckpointStartLock. The CheckpointStartLock is held in shared mode over
an XLogFlush when committing, which on an extremely busy system like a
benchmark is always long enough to have a new transaction to acquire the
CheckpointStartLock again.

I'm running another test with more logging to confirm that's what's
happening, but I'm pretty sure that's it...

As a proposed fix, instead of acquiring the CheckpointStartLock in
RecordTransactionCommit, we set a flag in MyProc saying "commit in
progress". Checkpoint will scan through the procarray and make note of
any commit in progress transactions, after computing the new redo record
ptr, and wait for all of them to finish before flushing clog.

Unless someone has a better idea, I'll write a patch to do the above.

--
Heikki Linnakangas
EnterpriseDB http://www.enterprisedb.com

Responses

Re: CheckpointStartLock starvation at 2007-04-02 19:46:04 from Tom Lane
Re: CheckpointStartLock starvation at 2007-04-03 00:52:32 from ITAGAKI Takahiro

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Alvaro Herrera	2007-04-02 19:22:08	Is this portable?
Previous Message	Zdenek Kotala	2007-04-02 18:55:11	Questions about pid file creation code