Quick Links

Re: db recovery after raid5 failure

From:	"Kevin Grittner" <Kevin(dot)Grittner(at)wicourts(dot)gov>
To:	<b_ki(at)hotmail(dot)com>,<pgsql-admin(at)postgresql(dot)org>, <qcor(at)vp(dot)pl>
Subject:	Re: db recovery after raid5 failure
Date:	2010-06-23 11:50:28
Message-ID:	4C21AEB40200002500032865@gw.wicourts.gov
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-admin

Balkrishna Sharma wrote:

>> average about two drive failures a month

> You must be having a real huge postgres setup with several hundreds
> of drives to have such high frequency of failure.

About 100 database servers with over 1000 drives spinning 24/7. Also
probably significant, management-set policy is to replace machines
after four years, and we don't always hit that. I haven't tried to
run numbers on it, but the pattern sure seems to match published
reports that we have some initial failures within the first few
months when a set of machines go in, it settles down for about three
years, then the failure rate starts to edge inexorably upward.

>> As a place to put "one more copy" it might make sense, as long as
>> it had strong encryption.

> I didn't expand but that's what I meant. The copy in cloud to be
> your final resort incase the LAN and the WAN copy both fail. You
> get an extra copy in a different geographic location for some
> catastrophic event.

OK, that makes sense.

-Kevin

Browse pgsql-admin by date

	From	Date	Subject
Next Message	Senthil Kumar G	2010-06-23 11:59:26	Re: Unable to start Statistics Collector
Previous Message	sivakumar krishnamurthy	2010-06-23 10:34:27	Re: Unable to start Statistics Collector