Re: Partitioning / Clustering

From: "David Roussel" <pgsql-performance(at)diroussel(dot)xsmail(dot)com>
To: "Alex Stapleton" <alexs(at)advfn(dot)com>, pgsql-performance(at)postgresql(dot)org
Subject: Re: Partitioning / Clustering
Date: 2005-05-11 07:57:57
Message-ID: 1115798277.29223.233869472@webmail.messagingengine.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-performance

For an interesting look at scalability, clustering, caching, etc for a
large site have a look at how livejournal did it.
http://www.danga.com/words/2004_lisa/lisa04.pdf

They have 2.6 Million active users, posting 200 new blog entries per
minute, plus many comments and countless page views.

Although this system is of a different sort to the type I work on it's
interesting to see how they've made it scale.

They use mysql on dell hardware! And found single master replication did
not scale. There's a section on multimaster replication, not sure if
they use it. The main approach they use is to parition users into
spefic database clusters. Caching is done using memcached at the
application level to avoid hitting the db for rendered pageviews.

It's interesting that the solution livejournal have arrived at is quite
similar in ways to the way google is set up.

David

In response to

Responses

Browse pgsql-performance by date

  From Date Subject
Next Message Kim Bisgaard 2005-05-11 08:20:25 Re: full outer performance problem
Previous Message Neil Conway 2005-05-11 07:55:10 Re: Partitioning / Clustering