Re: PostgreSQL, OLAP, and Large Clusters

From: Chris Travers <chris(dot)travers(at)gmail(dot)com>
To: Stephen Frost <sfrost(at)snowman(dot)net>
Cc: Scott Marlowe <scott(dot)marlowe(at)gmail(dot)com>, Ryan Kelly <rpkelly22(at)gmail(dot)com>, pgsql-general(at)postgresql(dot)org
Subject: Re: PostgreSQL, OLAP, and Large Clusters
Date: 2012-09-27 06:57:10
Message-ID: CAKt_ZfvHbXSpUGabvDZtuPiLYiwXVWhkxht4H=qrmR9Y3p_mjw@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

Obviously OLAP performance depends to some extent on what you want to do
with it. My recommendation is to follow Scott's advice first (keep things
simple as long as you can), and then look at supplemental approaches when
you reach the limit there.

The big limit IMO is the lack of intraquery parallelism. This has
implications when trying to run OLAP queries on terabytes worth of data and
so at some point you may reach the limits of what a single instance can do.
At that point you can go with a proprietary vendor like Greenplum or you
can look at PostgresXC (which is open source). Keep in mind that solving
this issue has significant complexity costs and so you are usually better
off with tuning your existing instance first.

Best Wishes,
Chris Travers

In response to

Browse pgsql-general by date

  From Date Subject
Next Message Andres Freund 2012-09-27 06:59:09 Re: Memory issues
Previous Message ac@hsk.hk 2012-09-27 03:51:46 Re: Linux PowerPC 64bits issue