Re: full text indexing

From: Gilles DAROLD <gilles(at)darold(dot)net>
To: "Poul L(dot) Christiansen" <poulc(at)cs(dot)auc(dot)dk>
Cc: pgsql-general(at)postgresql(dot)org
Subject: Re: full text indexing
Date: 2000-09-28 08:25:33
Message-ID: 39D3007D.8B383C81@darold.net
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

Hi,

After taking a look to many way from fast search into full text (flat
field with regrep, UDMSearch,
and other stuff like contrib FTI) i have developped my own indexer and
fast search.
It's still specific to my database but can be easily ported to any other
database structure.
It also use stop word

The first load is very slow for big database but I have had incremental
indexing and then
following load are speeder.

Here is the usage of the indexer :

Usage: ./lincat-indexer.pl [-c -i file -h host -p port] -t table -f field
-l language -u user -s file -d dbname
-c => drop search tables and indexes, reset all indexing
-i file => do incremental index with file
-v => print a dot for each word processed
-h host => database hostname [default: localhost]
-p port => database service port [default: 5432]
-t table => table to index
-f field => field to index
-l fr => language to use (can be fr, en, ...)
-u user => postgres user who need read grant on search tables
-s file => file with a list of all words not to be indexed
-d dbname => name of the database
-q => quiet mode, do not print anything else than error
message

I also have a search script that can be use to test the index and from you
can cut and past
some code to add to your programme.

usage is ./lincat-search.pl "essai test".

I don't remember exactly what it does with search (AND/OR) but this depend
on what you want
as result, so hack it.

Of course it's Perl... Search is very fast !
And in any way you can easily hack it to match your needs.

Let me know if your are interested I will post it to the list.

Regards,

Gilles DAROLD

"Poul L. Christiansen" wrote:

> Hi
>
> In my PostgreSQL database I have a lot of newspaper articles (size:
> 100mb now, growing beyond 1gb within few months).
> I wan't to use full text indexing so that users can search the articles
> with a keyword and have the results in less than one second.
>
> How do I accomplish that?
>
> Does PostgreSQL have this feature?
>
> Which 3rd party indexing tools are available that easily interoperate
> with PostgreSQL?
>
> Thanks,
> Poul L. Christiansen
> Dynamic Paper

In response to

Responses

Browse pgsql-general by date

  From Date Subject
Next Message Stefan Huber 2000-09-28 08:37:39 Re: Postgres ODBC woes
Previous Message Steve Heaven 2000-09-28 06:55:16 Re: warning - virus on the loose.