| From: | Greg Smith <gsmith(at)gregsmith(dot)com> | 
|---|---|
| To: | pgsql-patches(at)postgresql(dot)org | 
| Subject: | Re: Expose checkpoint start/finish times into SQL. | 
| Date: | 2008-04-04 07:01:06 | 
| Message-ID: | Pine.GSO.4.64.0804040251110.9874@westnet.com | 
| Views: | Whole Thread | Raw Message | Download mbox | Resend email | 
| Thread: | |
| Lists: | pgsql-patches | 
On Fri, 4 Apr 2008, Tom Lane wrote:
> The actual advice I'd give to a DBA faced with such a case is to
> kill -ABRT the bgwriter and send the stack trace to -hackers.
And that's a perfect example of where they're trying to get to.  They 
didn't notice the problem until after the crash.  The server didn't come 
back up (busy processing WALs) and that downtime was caught by a 
monitoring system.  At that point it was too late to collect debugging 
information on what was wrong inside the server processes that might have 
given a clue what happened.
If they'd have noticed it while the server was up, perhaps because the 
"last checkpoint" value hadn't changed in a long time (which seems like it 
might be available via stats even if, as you say, the background writer is 
out of its mind at that point), they could have done such a kill and 
collected some actual useful information here.  That's the theory at 
least.
--
* Greg Smith gsmith(at)gregsmith(dot)com http://www.gregsmith.com Baltimore, MD
| From | Date | Subject | |
|---|---|---|---|
| Next Message | Tom Lane | 2008-04-04 07:14:28 | Re: Expose checkpoint start/finish times into SQL. | 
| Previous Message | Tom Lane | 2008-04-04 06:36:35 | Re: Expose checkpoint start/finish times into SQL. |