Planner statistics, correlations

From: Tobias Brox <tobias(at)nordicbet(dot)com>
To: pgsql-performance(at)postgresql(dot)org
Subject: Planner statistics, correlations
Date: 2007-01-12 08:16:31
Message-ID: 20070112081631.GA10636@oppetid.no
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-performance

We have a table with a timestamp attribute (event_time) and a state flag
which usually changes value around the event_time (it goes to 4). Now
we have more than two years of events in the database, and around 5k of
future events.

It is important to frequently pick out "overdue events", say:

select * from events where state<>4 and event_time<now()

This query would usually yield between 0 and 100 rows - however, the
planner doesn't see the correlation betewen state and event_time - since
most of the events have event_time<now, the planner also assumes most of
the events with state<>4 has event_time<now, so the expected number of
rows is closer to 5k. This matters, because I have a query with joins,
and I would really benefit from nested loops.

(I've tried replacing "now()" above with different timestamps from the
future and the past. I'm using pg 8.2)

Any suggestions?

Responses

Browse pgsql-performance by date

  From Date Subject
Next Message Peter Childs 2007-01-12 08:56:54 Re: Planner statistics, correlations
Previous Message Tom Lane 2007-01-12 02:51:39 Re: [HACKERS] unusual performance for vac following 8.2upgrade