Re: TPC-H Scaling Factors X PostgreSQL Cluster Command

From: Heikki Linnakangas <heikki(at)enterprisedb(dot)com>
To: Nelson Kotowski <nkotowski(at)gmail(dot)com>
Cc: pgsql-performance(at)postgresql(dot)org
Subject: Re: TPC-H Scaling Factors X PostgreSQL Cluster Command
Date: 2007-04-23 09:46:45
Message-ID: 462C8085.3020602@enterprisedb.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-performance

Nelson Kotowski wrote:
> So far, i need to do it in three different scale factors (1, 2 and 5GB
> databases).
>
> My build process comprehends creating the tables without any foreign keys,
> indexes, etc. - Running OK!
> Then, i load the data from the flat files generated through DBGEN software
> into these tables. - Running OK!
>
> Finally, i run a "optimize" script that does the following:
>
> - Alter the tables to add the mandatory foreign keys;
> - Create all mandatory indexes;
> - Cluster the orders table by the orders table index;
> - Cluster the lineitem table by the lineitem table index;
> - Vacuum the database;
> - Analyze statistics.

Cluster will completely rewrite the table and indexes. On step 2, you
should only create the indexes you're clustering on, and create the rest
of them after clustering.

Or even better, generate and load the data in the right order to start
with, so you don't need to cluster at all.

--
Heikki Linnakangas
EnterpriseDB http://www.enterprisedb.com

In response to

Responses

Browse pgsql-performance by date

  From Date Subject
Next Message henk de wit 2007-04-23 11:35:26 Re: Redundant sub query triggers slow nested loop left join
Previous Message Mario Weilguni 2007-04-23 08:53:38 Re: postgres: 100% CPU utilization