Hadoop backend?

From: Paul Sheer <paulsheer(at)gmail(dot)com>
To: pgsql-hackers(at)postgresql(dot)org
Subject: Hadoop backend?
Date: 2009-02-21 20:17:30
Message-ID: c67e3dc60902211217p66906a35pe2cabe2c832e7b2d@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

Hadoop backend for PostGreSQL....

A problem that my client has, and one that I come across often,
is that a database seems to always be associated with a particular
physical machine, a physical machine that has to be upgraded,
replaced, or otherwise maintained.

Even if the database is replicated, it just means there are two or
more machines. Replication is also a difficult thing to properly
manage.

With a distributed data store, the data would become a logical
object - no adding or removal of machines would affect the data.
This is an ideal that would remove a tremendous maintenance
burden from many sites ---- well, at least the one's I have worked
at as far as I can see.

Does anyone know of plans to implement PostGreSQL over Hadoop?

Yahoo seems to be doing this:
http://glinden.blogspot.com/2008/05/yahoo-builds-two-petabyte-postgresql.html

But they store tables column-ways for their performance situation.
If one is doing a lot of inserts I don't think this is most efficient - ?

Has Yahoo put the source code for their work online?

Many thanks for any pointers.

-paul

Responses

Browse pgsql-hackers by date

  From Date Subject
Next Message pi song 2009-02-22 02:37:29 Re: Hadoop backend?
Previous Message Tom Lane 2009-02-21 18:46:07 Okay to change TypeCreate() signature in back branches?