Quick Links

Re: GIN improvements part2: fast scan

From:	Jesper Krogh <jesper(at)krogh(dot)cc>
To:	Tomas Vondra <tv(at)fuzzy(dot)cz>
Cc:	pgsql-hackers(at)postgresql(dot)org
Subject:	Re: GIN improvements part2: fast scan
Date:	2014-02-03 13:59:29
Message-ID:	52EFA0C1.7060103@krogh.cc
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

On 03/02/14 02:44, Tomas Vondra wrote:
> (2) The question is whether the new patch works fine on rare words. See
> this for comparison of the patches against HEAD:
>
> http://www.fuzzy.cz/tmp/gin/3-rare-words.png
> http://www.fuzzy.cz/tmp/gin/3-rare-words-new.png
>
> and this is the comparison of the two patches:
>
> http://www.fuzzy.cz/tmp/gin/patches-rare-words.png
>
> That seems fine to me - some queries are slower, but we're talking
> about queries taking 1 or 2 ms, so the measurement error is probably
> the main cause of the differences.
>
> (3) With higher numbers of frequent words, the differences (vs. HEAD or
> the previous patch) are not that dramatic as in (1) - the new patch
> is consistently by ~20% faster.
Just thinking, this is about one algorithm is being better or frequent words
and another algorithm being better at rare words... we do have
this information (at least or tsvector) in the statistics, would
it be possible to just call the "consistent" function more often if the
statistics gives signs that it actually is a frequent word?

Jesper - heavily dependent on tsvector-searches, with both frequent and
rare words.

In response to

Re: GIN improvements part2: fast scan at 2014-02-03 01:44:18 from Tomas Vondra

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Tom Lane	2014-02-03 14:05:04	Re: narwhal and PGDLLIMPORT
Previous Message	Rajni Baliyan	2014-02-03 12:59:32	Re: postgres FDW cost estimation options unrecognized in 9.3-beta1