Quick Links

Re: Looking for an efficient way to replace efficient NOT IN when landling very large data

From:	David Rowley <dgrowleyml(at)gmail(dot)com>
To:	Shaozhong SHI <shishaozhong(at)gmail(dot)com>
Cc:	pgsql-sql <pgsql-sql(at)lists(dot)postgresql(dot)org>
Subject:	Re: Looking for an efficient way to replace efficient NOT IN when landling very large data
Date:	2023-04-11 09:33:42
Message-ID:	CAApHDvox+8heOrYyD5Y8gpz6+wcj0w4Th2gT9+msOaH__4eFGQ@mail.gmail.com
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-sql

On Tue, 11 Apr 2023 at 21:28, Shaozhong SHI <shishaozhong(at)gmail(dot)com> wrote:
>
> Select a.id, a.name, b.id, b.name from a_large_table a, definitive b where (a.id, b.name) not in
> (select b.id, b.name from definitive b)
>
> is very slow.
>
> Is there a faster way to do so?

It depends on what your exact requirements are for the NULL handling
that NOT IN provides. Do you need the query to return 0 rows if b.id
and b.name are null? This question is moot if none of the columns or
either table allow NULLs.

If you don't require that, then you'll give the planner more
flexibility to choose a more efficient plan if you use NOT EXISTS
instead.

David

In response to

Looking for an efficient way to replace efficient NOT IN when landling very large data at 2023-04-11 09:27:59 from Shaozhong SHI

Responses

Re: Looking for an efficient way to replace efficient NOT IN when landling very large data at 2023-04-11 09:44:02 from Shaozhong SHI

Browse pgsql-sql by date

	From	Date	Subject
Next Message	Shaozhong SHI	2023-04-11 09:44:02	Re: Looking for an efficient way to replace efficient NOT IN when landling very large data
Previous Message	Shaozhong SHI	2023-04-11 09:27:59	Looking for an efficient way to replace efficient NOT IN when landling very large data