Quick Links

Adjustment of spinlock sleep delays

From:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To:	pgsql-hackers(at)postgreSQL(dot)org
Subject:	Adjustment of spinlock sleep delays
Date:	2003-08-05 20:11:28
Message-ID:	12131.1060114288@sss.pgh.pa.us
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

I've been thinking about Ludwig Lim's recent report of a "stuck
spinlock" failure on a heavily loaded machine. Although I originally
found this hard to believe, there is a scenario which makes it
plausible. Suppose that we have a bunch of recently-started backends
as well as one or more that have been running a long time --- long
enough that the scheduler has niced them down a priority level or two.
Now suppose that one of the old-timers gets interrupted while holding
a spinlock (an event of small but nonzero probability), and that before
it can get scheduled again, several of the newer, higher-priority
backends all start trying to acquire the same spinlock. The "acquire"
code looks like "try to grab the spinlock a few times, then sleep for
10 msec, then try again; give up after 1 minute". If there are enough
backends trying this that cycling through all of them takes at least
10 msec, then the lower-priority backend will never get scheduled, and
after a minute we get the dreaded "stuck spinlock".

To forestall this scenario, I'm thinking of introducing backoff into the
sleep intervals --- that is, after first failure to get the spinlock,
sleep 10 msec; after the second, sleep 20 msec, then 40, etc, with a
maximum sleep time of maybe a second. The number of iterations would be
reduced so that we still time out after a minute's total delay.

Comments?

regards, tom lane

Responses

Re: Adjustment of spinlock sleep delays at 2003-08-05 20:55:01 from Mike Mascari
Re: Adjustment of spinlock sleep delays at 2003-08-05 21:13:43 from Rod Taylor
Re: Adjustment of spinlock sleep delays at 2003-08-06 01:37:51 from Christopher Kings-Lynne
Re: Adjustment of spinlock sleep delays at 2003-08-06 11:23:54 from Mendola Gaetano

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Tom Lane	2003-08-05 20:22:54	Re: TODO: trigger features
Previous Message	Dag-Erling =?iso-8859-1?q?Sm=F8rgrav?=	2003-08-05 20:03:45	Re: AUTO_INCREMENT patch