Re: Setup for large database

From: "Jim C(dot) Nasby" <jnasby(at)pervasive(dot)com>
To: "mlartz(at)gmail(dot)com" <mlartz(at)gmail(dot)com>
Cc: pgsql-general(at)postgresql(dot)org
Subject: Re: Setup for large database
Date: 2006-04-21 20:02:17
Message-ID: 20060421200216.GQ49405@pervasive.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

On Thu, Apr 20, 2006 at 06:02:17PM -0700, mlartz(at)gmail(dot)com wrote:
> 1) How anal should I be about my hardware setup? I have about 15 300GB
> 10K RPM SCSI drives, 4 of which I can directly attach to the server and
> the rest one the FC array. Should I just put the OS and transaction
> logs on the direct attached storage and and then RAID10 the rest of
> them and be done, or would I significantly benefit from separating out
> the indexes and partitioning across tablespaces across drives? Would
> RAID5 across 10+ drives yield acceptable performance numbers?

My experience is more OLTP than OLAP, but for a warehouse envirenment
RAID5 can be a good solution since there's typically not a lot of
updating going on.

I've yet to see much gain from moving pg_xlog onto it's own seperate set
of drives; there's usually not enough traffic from the OS to justify it.
But it is possible that you could end up generating enough WAL traffic
that pg_xlog would become a performance limiter on only 2 drives, though
I suspect you'd have to have over 20-30 drives for data before that
happened.

> 3) I've currently installed RHEL4 AS for my OS, which I am very
> comfortable with. I was going to go with EXT3 on everything (noatime)
> ... sound good?

There's a data=writeback option for ext3 that can make a big performance
difference.

> 2) Assuming that my data is roughly evenly distributed among IP
> addresses, I figured that a naive partitioning based on the first octet

See my other reply...

> 3) I guess I don't quite understand Bizgres. At the moment, it seems
> to be just a development beta of Postgres ... is this true? I realize
> that the focus is on BI/ETL stuff, but the current improvements seem to
> benefit Postgres as a whole. Is there currently or can you imagine a
> case where a feature in Bizgres won't get integrated into Postgres?
> How significant is the fork between Bizgres and Postgres? I've also
> considered taking a look at Bizgres MPP. I know that its the wrong
> forum, but any comments?

You'd probably be better off asking on a bizgres list...

> 4) Not to start any sort of flame war, but my company has an Oracle
> license and there are a bunch of people wanting me to go that way.
> I've been doing just fine with Postgres at the moment and am quite
> comfortable with it, but am being pressured to go with our Oracle
> license. Cost (and prejudices) aside, do you think it would be wise to
> go with Oracle to begin with, considering the size of the database that
> I'm planning?

There's certainly people out there running multi-terrabyte databases on
PostgreSQL. Unless there's a sound technical reason to switch, I'd stick
with PostgreSQL, especially because migrating to Oracle from PostgreSQL
is fairly easy.
--
Jim C. Nasby, Sr. Engineering Consultant jnasby(at)pervasive(dot)com
Pervasive Software http://pervasive.com work: 512-231-6117
vcard: http://jim.nasby.net/pervasive.vcf cell: 512-569-9461

In response to

Browse pgsql-general by date

  From Date Subject
Next Message Jim C. Nasby 2006-04-21 20:23:03 Re: temp tables problem
Previous Message Jim C. Nasby 2006-04-21 19:55:37 Re: Setup for large database