I've got a table with about 10 million events in it.
Each has a user_id (about 1000 users) and a event_time timestamp
covering a 4 year period with about 50% of the events being in the last
year. Some users have only dozens of events. A few have hundreds of
The queries usually are in the form of, where "user_id = something and
event_time between something and something".
Half of my queries index off of the user_id and half index off the
event_time. I was thinking this would be a perfect opportunity to use a
dual index of (user_id,event_time) but I'm confused as to weather this
will help considering the size of this index given that there very few
tuples that have the exact same timestamp as another and I'm not sure
which order to put the user_id/event_time as I don't know what is meant
when people on this list ask which is more selective.
Also, would it make sense for me to raise my ANALYZE value and how would
I go about doing this?
Thanks for the help.
Orion Henry <orion(at)trustcommerce(dot)com>
pgsql-performance by date
|Next:||From: Andrew Sullivan||Date: 2004-01-23 19:01:48|
|Subject: Re: High Performance/High Reliability File system on SuSE64|
|Previous:||From: Greg Spiegelberg||Date: 2004-01-23 18:53:21|
|Subject: Linux Cluster File Systems|