Quick Links

Distributed/Parallel Computing

From:	Viji V Nair <viji(at)fedoraproject(dot)org>
To:	pgsql-performance(at)postgresql(dot)org
Subject:	Distributed/Parallel Computing
Date:	2009-10-05 19:11:07
Message-ID:	84c89ac10910051211j4b6ae882m8f7eb0ea7dc3506c@mail.gmail.com
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-performance

Hi Team,

This question may have asked many times previously also, but I could not
find a solution for this in any post. any help on the following will be
greatly appreciated.

We have a PG DB with PostGIS functions. There are around 100 tables in the
DB and almost all the tables contains 1 million records, around 5 table
contains more than 20 million records. The total DB size is 40GB running on
a 16GB, 2 x XEON 5420, RAID6, RHEL5 64bit machines, the questions is

1. The geometry calculations which we does are very complex and it is taking
a very long time to complete. We have optimised PG config to the best, now
we need a mechanism to distribute these queries to multiple boxes. What is
best recommended way for this distributed/parallel deployment. We have tried
PGPOOL II, but the performance is not satisfactory. Going for a try with
GridSQL

2. How we can distribute/split these large tables to multiple disks of
different nodes?

Thanks in advance

Viji

Responses

Re: Distributed/Parallel Computing at 2009-10-06 02:40:50 from Jeff Janes

Browse pgsql-performance by date

	From	Date	Subject
Next Message	Guy Rouillier	2009-10-05 19:35:05	Re: Query plan for NOT IN
Previous Message	Greg Smith	2009-10-05 18:58:43	Re: Speed while runnning large transactions.