Reddit's latest failure & PG

From: Jeff <threshar(at)torgo(dot)978(dot)org>
To: pgsql-advocacy <pgsql-advocacy(at)postgresql(dot)org>
Subject: Reddit's latest failure & PG
Date: 2011-03-18 13:34:40
Message-ID: 327470E1-D276-441A-B7D9-EEBDB5A20699@torgo.978.org
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-advocacy

http://blog.reddit.com/2011/03/why-reddit-was-down-for-6-of-last-24.html

Reddit was down for a while yesterday and they had 2 failures - one
was EBS (they use Amazon EC2 and EBS) failing.

Then they had another failure where somehow their slave PG databases
got ahead of the master. They are using Londiste for replication and
the only thing I can think of is EBS must have been lying about fsync
on the master, so some transactions were lost there.

I don't see them posting on the lists much, maybe we should reach out
to them as Reddit is a rather popular site nowadays and it could be
some good exposure for PG. (They are also using Cassandra)

--
Jeff Trout <jeff(at)jefftrout(dot)com>
http://www.stuarthamm.net/
http://www.dellsmartexitin.com/

Responses

Browse pgsql-advocacy by date

  From Date Subject
Next Message Korry Douglas 2011-03-18 13:53:05 Re: Reddit's latest failure & PG
Previous Message Christophe Pettus 2011-03-18 05:56:08 Re: CMS, foreign keys, and the legacy of MySQL