Re: Table maintenance: order of operations important?

From: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To: Jeff Boes <mur(at)qtm(dot)net>
Cc: pgsql-admin(at)postgresql(dot)org
Subject: Re: Table maintenance: order of operations important?
Date: 2004-05-20 17:52:12
Message-ID: 3030.1085075532@sss.pgh.pa.us
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-admin

Jeff Boes <mur(at)qtm(dot)net> writes:
> DELETE FROM foo WHERE date_expires < now();
> VACUUM ANALYZE foo;
> CLUSTER foo;
> REINDEX TABLE foo;

> How would you choose to order these (under 7.4.x) for fastest
> turn-around? Does it matter?

If you are going to CLUSTER then the VACUUM and the REINDEX are both
utterly redundant. The ANALYZE is still useful but should be done after
CLUSTER since its physical-order-correlation stats will be quite wrong
if done beforehand. In other words there is only one sane way to do
this and it is

DELETE FROM foo WHERE date_expires < now();
CLUSTER foo;
ANALYZE foo;

You could possibly make a case for

DELETE FROM foo WHERE date_expires < now();
CLUSTER foo;
VACUUM ANALYZE foo;

The VACUUM won't do anything useful in terms of reclaiming space (there
being none to reclaim just after a CLUSTER) but it would ensure that all
rows in the table are marked as committed-good, rather than leaving that
work to be done by the first transaction that happens to hit each row.

regards, tom lane

In response to

Responses

Browse pgsql-admin by date

  From Date Subject
Next Message Kris Kiger 2004-05-20 18:30:53 Clustering Postgres
Previous Message Tom Lane 2004-05-20 17:46:14 Re: cannot connect to postgres db, strange error!