BUG #7509: x NOT IN (select x from z) extremely slow in compare to select x from y except select x from z;

From: stefan(at)konink(dot)de
To: pgsql-bugs(at)postgresql(dot)org
Subject: BUG #7509: x NOT IN (select x from z) extremely slow in compare to select x from y except select x from z;
Date: 2012-08-29 10:31:27
Message-ID: E1T6fYV-0001eM-S3@wrigleys.postgresql.org
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-bugs

The following bug has been logged on the website:

Bug reference: 7509
Logged by: Stefan de Konink
Email address: stefan(at)konink(dot)de
PostgreSQL version: 9.1.5
Operating system: Linux
Description:

The following is relatively fast:

bag-2012-aug=# explain select count(*) from (select kvk from kvk_normal
except select kvk from bag_kvk) as x;
QUERY PLAN

----------------------------------------------------------------------------------------------------------
Aggregate (cost=1110465.88..1110465.89 rows=1 width=0)
-> Subquery Scan on x (cost=1042163.45..1102413.23 rows=3221060
width=0)
-> SetOp Except (cost=1042163.45..1070202.63 rows=3221060
width=8)
-> Sort (cost=1042163.45..1056183.04 rows=5607836 width=8)
Sort Key: "*SELECT* 1".kvk
-> Append (cost=0.00..183539.72 rows=5607836
width=8)
-> Subquery Scan on "*SELECT* 1"
(cost=0.00..122902.20 rows=3221060 width=8)
-> Seq Scan on kvk_normal
(cost=0.00..90691.60 rows=3221060 width=8)
-> Subquery Scan on "*SELECT* 2"
(cost=0.00..60637.52 rows=2386776 width=8)
-> Seq Scan on bag_kvk
(cost=0.00..36769.76 rows=2386776 width=8)

The 'normal' case basically doesn't finish:

bag-2012-aug=# explain select count(*) from (select kvk_normal.kvk from
kvk_normal where kvk_normal.kvk not in (select bag_kvk.kvk from bag_kvk)) as
x;
QUERY PLAN

-------------------------------------------------------------------------------------
Aggregate (cost=103065293697.97..103065293697.98 rows=1 width=0)
-> Seq Scan on kvk_normal (cost=0.00..103065289671.65 rows=1610530
width=0)
Filter: (NOT (SubPlan 1))
SubPlan 1
-> Materialize (cost=0.00..58027.64 rows=2386776 width=8)
-> Seq Scan on bag_kvk (cost=0.00..36769.76 rows=2386776
width=8)
(6 rows)

Table size is 3.2mil rows in adres, and 2.3mil rows in bag_kvk.

Responses

Browse pgsql-bugs by date

  From Date Subject
Next Message Rajeev rastogi 2012-08-29 13:13:51 Re: [HACKERS] BUG #6572: The example of SPI_execute is bogus
Previous Message MirrorX 2012-08-29 09:16:19 hot standby lagging vs warm that is up-to-date