Re: For full text indexing, which is better, tsearch2 or

From: Christopher Kings-Lynne <chriskl(at)familyhealth(dot)com(dot)au>
To: Steve Atkins <steve(at)blighty(dot)com>
Cc: pgsql-performance(at)postgresql(dot)org
Subject: Re: For full text indexing, which is better, tsearch2 or
Date: 2003-11-27 04:41:59
Message-ID: 3FC58097.8090803@familyhealth.com.au
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-performance

> Does anyone have any metrics on how fast tsearch2 actually is?
>
> I tried it on a synthetic dataset of a million documents of a hundred
> words each and while insertions were impressively fast I gave up on
> the search after 10 minutes.
>
> Broken? Unusable slow? This was on the last 7.4 release candidate.

I just created a 1.1million row dataset by copying one of our 30000 row
production tables and just taking out the txtidx column. Then I
inserted it into itself until it had 1.1 million rows.

Then I created the GiST index - THAT took forever - seriously like 20
mins or half an hour or something.

Now, to find a word:

select * from tsearchtest where ftiidx ## 'curry';
Time: 9760.75 ms

The AND of two words:
Time: 103.61 ms

The AND of three words:
select * from tsearchtest where ftiidx ## 'curry&green&thai';
Time: 61.86 ms

And now a one word query now that buffers are cached:
select * from tsearchtest where ftiidx ## 'curry';
Time: 444.89 ms

So, I have no idea why you think it's slow? Perhaps you forgot the
'create index using gist' step?

Also, if you use the NOT (!) operand, you can get yourself into a really
slow situation.

Chris

In response to

Responses

Browse pgsql-performance by date

  From Date Subject
Next Message Steve Atkins 2003-11-27 05:12:30 Re: For full text indexing, which is better, tsearch2 or
Previous Message Steve Atkins 2003-11-27 03:28:31 Re: For full text indexing, which is better, tsearch2 or