Large databases, performance

From: "Shridhar Daithankar" <shridhar_daithankar(at)persistent(dot)co(dot)in>
To: pgsql-hackers(at)postgresql(dot)org
Cc: pgsql-general <pgsql-general(at)postgresql(dot)org>, "pankaj M(dot) Tolani" <pankaj(at)pspl(dot)co(dot)in>
Subject: Large databases, performance
Date: 2002-10-03 12:36:10
Message-ID: 3D9C8712.9513.9C6521D@localhost
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general pgsql-hackers pgsql-performance pgsql-sql

Hi,

Today we concluded test for database performance. Attached are results and the
schema, for those who have missed earlier discussion on this.

We have (almost) decided that we will partition the data across machines. The
theme is, after every some short interval a burst of data will be entered in
new table in database, indexed and vacuume. The table(s) will be inherited so
that query on base table will fetch results from all the children. The
application has to consolidate all the data per node basis. If the database is
not postgresql, app. has to consolidate data across partitions as well.

Now we need to investigate whether selecting on base table to include children
would use indexes created on children table.

It's estimated that when entire data is gathered, total number of children
tables would be around 1K-1.1K across all machines.

This is in point of average rate of data insertion i.e. 5K records/sec and
total data size, estimated to be 9 billion rows max i.e. estimated database
size is 900GB. Obviously it's impossible to keep insertion rate on an indexed
table high as data grows. So partitioning/inheritance looks better approach.

Postgresql is not the final winner as yet. Mysql is in close range. I will keep
you guys posted about the result.

Let me know about any comments..

Bye
Shridhar

--
Price's Advice: It's all a game -- play it to have fun.

Attachment Content-Type Size
unknown_filename text/plain 1.3 KB
unknown_filename text/plain 557 bytes

Responses

Browse pgsql-general by date

  From Date Subject
Next Message Charles H. Woloszynski 2002-10-03 12:54:29 Re: Large databases, performance
Previous Message Diogo Biazus 2002-10-03 12:35:19 Re: Anyone want to assist with the translation of the Advocacy

Browse pgsql-hackers by date

  From Date Subject
Next Message Charles H. Woloszynski 2002-10-03 12:54:29 Re: Large databases, performance
Previous Message Diogo Biazus 2002-10-03 12:35:19 Re: Anyone want to assist with the translation of the Advocacy

Browse pgsql-performance by date

  From Date Subject
Next Message Charles H. Woloszynski 2002-10-03 12:54:29 Re: Large databases, performance
Previous Message Ron Johnson 2002-10-02 10:48:01 Re: Comparitive UPDATE speed

Browse pgsql-sql by date

  From Date Subject
Next Message Charles H. Woloszynski 2002-10-03 12:54:29 Re: Large databases, performance
Previous Message Thrasher 2002-10-03 08:18:30 Re: Updating from select