Re: Large Databases redux

From: Marti Raudsepp <marti(at)juffo(dot)org>
To: Martijn van Oosterhout <kleptog(at)svana(dot)org>, John R Pierce <pierce(at)hogranch(dot)com>, pgsql-general(at)postgresql(dot)org
Subject: Re: Large Databases redux
Date: 2012-03-22 21:52:49
Message-ID: CABRT9RDJeBHp2eTyOEPM8sheND9RUaobHQRFhdKf_-PpPNeB+w@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

On Thu, Mar 22, 2012 at 00:20, Martijn van Oosterhout <kleptog(at)svana(dot)org> wrote:
> That, and a good RAID controller with BBU cache will go a long way to
> relieving the pain of fsync.

Well a BBU cache RAID is helpful, but fsyncs are a minor problem in
data warehouse workloads, since inserts are done in large bulks and
commits are rare. And you can run with synchronous_commit=off, since
it's always possible to reload the last batch after a power failure.

On Wed, Mar 21, 2012 at 23:18, Jason Herr <jaherr(at)gmail(dot)com> wrote:
> Single selects on tables need to be 3ms

You've set yourself an impossible target, that's below the average
seek time of 15kRPM disks. For indexed single-row selects on
non-cached data, expect at least a few index page fetches and a heap
fetch, and potentially file system block map lookups. 20ms seems a
more plausible target. But with competing I/O activity, especially
other OLAP/DW queries and bulk data loads, that's still quite
optimistic.

If you have a high cached access correlation and lots of RAM, it might
be possible to keep the *average* below 3ms, but I don't know if you
can bet on that with 2TB of storage.

Regards,
Marti

In response to

Browse pgsql-general by date

  From Date Subject
Next Message Tom Lane 2012-03-22 22:09:25 Re: Index on System Table
Previous Message Fabrízio de Royes Mello 2012-03-22 20:04:59 Re: Index on System Table