Quick Links

Re: Some ideas for comment

From:	Chris Hoover <revoohc(at)gmail(dot)com>
To:	Merlin Moncure <merlin(dot)moncure(at)rcsonline(dot)com>
Cc:	pgsql-performance(at)postgresql(dot)org
Subject:	Re: Some ideas for comment
Date:	2005-08-24 20:26:40
Message-ID:	1d219a6f05082413265e872134@mail.gmail.com
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Thread:
Lists:	pgsql-performance

On 8/24/05, Merlin Moncure <merlin(dot)moncure(at)rcsonline(dot)com> wrote:
> > Ok, there is always a lot of talk about tuning PostgreSQL on linux and
> > how PostgreSQL uses the linux kernel cache to cache the tables and
> > indexes.
> [...]
> >
> > 1. Implement a partition type layout using views and rules - This
> > will allow me to have one table in each view with the "active" data,
> > and the inactive data stored by year in other tables.
> >
> > So I would have the following (for each major table):
> >
> > Table View as
> > select * from active_table
> > union all
> > select * from table_2005
> > union all
> > select * from table_2004
> > etc.
>
> Linux does a pretty good job of deciding what to cache. I don't think
> this will help much. You can always look at partial indexes too.
>
Yes, but won't this help create the need to store less? If I have
1,000.000 rows in a table, but only 4,000 are active, if I move those
4 to another table and link the tables via a view, should that not
help keep the 9,996,000 rows out of the kernel cache (the majority of
the time at least)?

This would mean I have more room for other objects and hopefully less
turn over in the cache, and less disk i/o.

Yes?
[...]
> I would strongly consider adding more memory :).
Unfortunately, it looks like 12GB is all our Dell servers can handle. :(

>
> > I don't have real numbers to give you, but we know that our systems
> > are hurting i/o wise and we are growing by about 2GB+ per week (net).
> > We actually grow by about 5GB/week/server. However, when I run my
> > weekly maintenance of vacuum full, reindex, and the vacuum analyze, we
> > end up getting about 3GB back. Unfortunately, I do not have the i/o
> > bandwidth to vacuum during the day as it causes major slowdowns on our
> > system. Each night, I do run a vacuum analyze across all db's to try
> > and help. I also have my fsm parameters set high (8000000 fsm pages,
> > and 5000 fsm relations) to try and compensate.
>
> Generally, you can reduce data turnover for the same workload by
> normalizing your database. IOW, try and make your database more
> efficient in the way it stores data.
>
That's the ultimate goal, but this database structure was developed
and released into production before I started work here. I'm trying
to slowly change it into a better db, but it is a slow process.
Normalization does not make it at the top of the priority list,
unfortunately.

> > Right now, we are still on 7.3.4. However, these ideas would be
> > implemented as part of an upgrade to 8.x (plus, we'll initialize the
> > new clusters with a C locale).
> > > 2. I am also thinking of recommending we collapse all databases in a
> > cluster into one "mega" database. I can then use schema's and views
> > to control database access and ensure that no customer can see another
> > customers data.
>
> hm. keep in mind views are tightly bound to the tables they are created
> with (views can't 'float' over tables in different schemas). pl/pgsql
> functions can, though. This is a more efficient use of server
> resources, IMO, but not a windfall.

This I know. Each schema would have to have a "custom" set of views
replacing the tables with the view programmed to only return that
customers data.

I was thinking all of the tables in schema my_tables and the views all
querying the tables stored in the my_tables schema. I would add an
identifying column to each table so that I can differentiate the data.

Chris

In response to

Re: Some ideas for comment at 2005-08-24 18:59:12 from Merlin Moncure

Responses

Re: Some ideas for comment at 2005-08-25 07:18:22 from Jens-Wolfhard Schicke

Browse pgsql-performance by date

	From	Date	Subject
Next Message	Mark Fox	2005-08-24 20:43:51	Performance indexing of a simple query
Previous Message	Jignesh Shah	2005-08-24 20:20:34	Re: Read/Write block sizes