Re: intermittant performance problem

From: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To: Mike Charnoky <noky(at)nextbus(dot)com>
Cc: pgsql-general(at)postgresql(dot)org
Subject: Re: intermittant performance problem
Date: 2009-03-10 00:00:20
Message-ID: 18232.1236643220@sss.pgh.pa.us
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

Mike Charnoky <noky(at)nextbus(dot)com> writes:
> The sampling query which runs really slow on some days looks something
> like this:

> INSERT INTO sampled_data
> (item_name, timestmp, ... )
> SELECT item_name, timestmp, ... )
> FROM raw_data
> WHERE timestmp >= ? and timestmp < ?
> AND item_name=?
> AND some_data_field NOTNULL
> ORDER BY random()
> LIMIT ?;

Hmph, I'd expect that that would run pretty slowly *all* the time :-(.
There's no good way to optimize "ORDER BY random()". However, it seems
like the first thing you should do is modify the program so that it
issues an EXPLAIN for that right before actually doing the query, and
then you could see if the plan is different on the slow days.

> We have done a great deal of PG tuning, including the autovacuum for the
> "raw_data" table. Autovacuum kicks like clockwork every day on that
> table after the sampling process finishes (after one day's worth of data
> is deleted from "raw_data" table, a roughly 7% change in size).

Also, are you sure you have ruled out the possibility that the problem
comes from autovac kicking in *while* the update is running?

regards, tom lane

In response to

Responses

Browse pgsql-general by date

  From Date Subject
Next Message JohnD 2009-03-10 00:09:39 Re: Connection Refused Error
Previous Message Joshua D. Drake 2009-03-09 23:54:08 Re: Connection Refused Error