Re: Autovaccum with cost_delay does not complete on one solaris 5.10 machine

From: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To: Josh Berkus <josh(at)agliodbs(dot)com>
Cc: Alvaro Herrera <alvherre(at)commandprompt(dot)com>, pgsql-performance(at)postgresql(dot)org
Subject: Re: Autovaccum with cost_delay does not complete on one solaris 5.10 machine
Date: 2010-04-15 22:48:17
Message-ID: 26178.1271371697@sss.pgh.pa.us
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-performance

Josh Berkus <josh(at)agliodbs(dot)com> writes:
>> But it shouldn't
>> be sleeping after each page with normal cost_delay parameters, should it?

> Right, that's why I find this puzzling. If the problem was easier to
> reproduce it would be easier to analyze.

The behavior would be explained if VacuumCostLimit were getting set to
zero (or some unreasonably small value) in the autovac worker process.
I looked at the autovac code that manages that, and it seems complicated
enough that a bug wouldn't surprise me in the least.

I especially note that wi_cost_limit is explicitly initialized to zero,
rather than something sane; and that table_recheck_autovac falls back to
setting vac_cost_limit from the previous value of VacuumCostLimit
... which is NOT constant but in general is left over from the
previously processed table. One should also keep in mind that SIGHUP
processing might reload VacuumCostLimit from GUC values. So I think
that area needs a closer look.

Josh, are you sure that both servers are identical in terms of both
GUC-related and per-table autovacuum settings?

regards, tom lane

In response to

Responses

Browse pgsql-performance by date

  From Date Subject
Next Message Josh Berkus 2010-04-15 22:52:09 Re: Autovaccum with cost_delay does not complete on one solaris 5.10 machine
Previous Message Greg Smith 2010-04-15 22:45:03 Re: 8.3.9 - latency spikes with Linux (and tuning for consistently low latency)