When/if to Reindex

From: "Steven Flatt" <steven(dot)flatt(at)gmail(dot)com>
To: pgsql-performance(at)postgresql(dot)org
Subject: When/if to Reindex
Date: 2007-07-18 17:08:30
Message-ID: 357fa7590707181008k65ab2f20l55bebb3c14f52770@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-performance

We're using Postgres 8.2.4.

I'm trying to decide whether it's worthwhile to implement a process that
does periodic reindexing. In a few ad hoc tests, where I've tried to set up
data similar to how our application does it, I've noticed decent performance
increases after doing a reindex as well as the planner being more likely to
choose an index scan.

Some background: we make extensive use of partitioned tables. In fact, I'm
really only considering reindexing partitions that have "just closed". In
our simplest/most general case, we have a table partitioned by a timestamp
column, each partition 24 hours wide. The partition will have an index on
the timestamp column as well as a few other indexes including a primary key
index (all b-tree). Is there a programmatic way I can decide, upon the
"closing" of a partition, which, if any, of these indexes will benefit from
a reindex? Can I determine things like average node density, node depth, or
any other indication as to the quality of an index? Will pg_class.relpages
be any help here?

Is it a simple matter of running some queries, reindexing the table, then
running the queries again to determine overall performance change? If so,
what queries would exercise this best?

Just trying to determine if the extra cost of reindexing newly closed
partitions will be worth the performance benefit of querying the data.
Reindexing a table with a day's worth of data is taking on the order of a
few hours (10s of millions of rows).

The docs say that:

"...for B-tree indexes a freshly-constructed index is somewhat faster to
access than one that has been updated many times, because logically adjacent
pages are usually also physically adjacent in a newly built index... It
might be worthwhile to reindex periodically just to improve access speed."

Thanks,
Steve

Responses

Browse pgsql-performance by date

  From Date Subject
Next Message Jim C. Nasby 2007-07-18 18:48:16 Re: ionice to make vacuum friendier?
Previous Message Erik Jones 2007-07-18 17:06:34 Re: How to use a trigger to write rows to a remote server