Skip site navigation (1) Skip section navigation (2)

Forcing index scan on query produces 16x faster

From: Christian Brink <cbrink(at)r-stream(dot)com>
To: pgsql-performance(at)postgresql(dot)org
Subject: Forcing index scan on query produces 16x faster
Date: 2010-03-17 21:25:35
Message-ID: 4BA148CF.5080701@r-stream.com (view raw or flat)
Thread:
Lists: pgsql-performance
I am running into a problem with a particular query. The execution plan 
cost shows that the Seq Scan is a better bet (cost=54020.49..54020.55) 
over the forced index 'enable_seqscan =  false' 
(cost=1589703.87..1589703.93). But when I run the query both ways I get 
a vastly different result (below). It appears not to want to bracket the 
salesitems off of the 'id' foreign_key unless I force it.

Is there a way to rewrite or hint the planner to get me the better plan 
without resorting to 'enable_seqscan' manipulation (or am I missing 
something)?

postream=> select version();
                                                          version
-------------------------------------------------------------------------------------------------------------------------
  PostgreSQL 8.0.3 on i386-redhat-linux-gnu, compiled by GCC 
i386-redhat-linux-gcc (GCC) 4.0.0 20050505 (Red Hat 4.0.0-4)


postream=> SET enable_seqscan = false;
SET
postream=> EXPLAIN ANALYZE
postream-> SELECT si.group1_id as name, sum(si.qty) as count, 
sum(si.amt) as amt
postream->   FROM salesitems si, sales s, sysstrings
postream->  WHERE si.id = s.id
postream->    AND si.group1_id != ''
postream->    AND si.group1_id IS NOT NULL
postream->    AND NOT si.void
postream->    AND NOT s.void
postream->    AND NOT s.suspended
postream->    AND s.tranzdate >= (cast('2010-02-15' as date) + 
cast(sysstrings.data as time))
postream->    AND s.tranzdate < ((cast('2010-02-15' as date) + 1) + 
cast(sysstrings.data as time))
postream->    AND sysstrings.id='net/Console/Employee/Day End Time'
postream->  GROUP BY name;
                                                                                              QUERY PLAN
----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
  HashAggregate  (cost=1589703.87..1589703.93 rows=13 width=35) (actual 
time=33.414..33.442 rows=12 loops=1)
    ->  Nested Loop  (cost=0.01..1588978.22 rows=96753 width=35) (actual 
time=0.284..22.115 rows=894 loops=1)
          ->  Nested Loop  (cost=0.01..2394.31 rows=22530 width=4) 
(actual time=0.207..4.671 rows=225 loops=1)
                ->  Index Scan using sysstrings_pkey on sysstrings  
(cost=0.00..5.78 rows=1 width=175) (actual time=0.073..0.078 rows=1 loops=1)
                      Index Cond: (id = 'net/Console/Employee/Day End 
Time'::text)
                ->  Index Scan using sales_tranzdate_index on sales s  
(cost=0.01..1825.27 rows=22530 width=12) (actual time=0.072..3.464 
rows=225 loops=1)
                      Index Cond: ((s.tranzdate >= ('2010-02-15'::date + 
("outer".data)::time without time zone)) AND (s.tranzdate < 
('2010-02-16'::date + ("outer".data)::time without time zone)))
                      Filter: ((NOT void) AND (NOT suspended))
          ->  Index Scan using salesitems_pkey on salesitems si  
(cost=0.00..70.05 rows=30 width=39) (actual time=0.026..0.052 rows=4 
loops=225)
                Index Cond: (si.id = "outer".id)
                Filter: ((group1_id <> ''::text) AND (group1_id IS NOT 
NULL) AND (NOT void))
  Total runtime: 33.734 ms
(12 rows)

postream=> SET enable_seqscan = true;
SET
postream=> EXPLAIN ANALYZE
postream-> SELECT si.group1_id as name, sum(si.qty) as count, 
sum(si.amt) as amt
postream->   FROM salesitems si, sales s, sysstrings
postream->  WHERE si.id = s.id
postream->    AND si.group1_id != ''
postream->    AND si.group1_id IS NOT NULL
postream->    AND NOT si.void
postream->    AND NOT s.void
postream->    AND NOT s.suspended
postream->    AND s.tranzdate >= (cast('2010-02-15' as date) + 
cast(sysstrings.data as time))
postream->    AND s.tranzdate < ((cast('2010-02-15' as date) + 1) + 
cast(sysstrings.data as time))
postream->    AND sysstrings.id='net/Console/Employee/Day End Time'
postream->  GROUP BY name;
                                                                                                 QUERY PLAN
----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
  HashAggregate  (cost=54020.49..54020.55 rows=13 width=35) (actual 
time=5564.929..5564.957 rows=12 loops=1)
    ->  Hash Join  (cost=2539.63..53294.84 rows=96753 width=35) (actual 
time=5502.324..5556.262 rows=894 loops=1)
          Hash Cond: ("outer".id = "inner".id)
          ->  Seq Scan on salesitems si  (cost=0.00..30576.60 
rows=885215 width=39) (actual time=0.089..3099.453 rows=901249 loops=1)
                Filter: ((group1_id <> ''::text) AND (group1_id IS NOT 
NULL) AND (NOT void))
          ->  Hash  (cost=2394.31..2394.31 rows=22530 width=4) (actual 
time=3.329..3.329 rows=0 loops=1)
                ->  Nested Loop  (cost=0.01..2394.31 rows=22530 width=4) 
(actual time=0.217..2.749 rows=225 loops=1)
                      ->  Index Scan using sysstrings_pkey on 
sysstrings  (cost=0.00..5.78 rows=1 width=175) (actual time=0.077..0.085 
rows=1 loops=1)
                            Index Cond: (id = 'net/Console/Employee/Day 
End Time'::text)
                      ->  Index Scan using sales_tranzdate_index on 
sales s  (cost=0.01..1825.27 rows=22530 width=12) (actual 
time=0.074..1.945 rows=225 loops=1)
                            Index Cond: ((s.tranzdate >= 
('2010-02-15'::date + ("outer".data)::time without time zone)) AND 
(s.tranzdate < ('2010-02-16'::date + ("outer".data)::time without time 
zone)))
                            Filter: ((NOT void) AND (NOT suspended))
  Total runtime: 5565.262 ms
(13 rows)


-- 
Christian Brink



Responses

pgsql-performance by date

Next:From: David WilsonDate: 2010-03-17 21:56:55
Subject: Re: Forcing index scan on query produces 16x faster
Previous:From: Hannu KrosingDate: 2010-03-17 21:18:47
Subject: Re: Building multiple indexes concurrently

Privacy Policy | About PostgreSQL
Copyright © 1996-2014 The PostgreSQL Global Development Group