Tsearch2 custom dictionaries

From: psql-mail(at)freeuk(dot)com
To: pgsql-general(at)postgresql(dot)org
Subject: Tsearch2 custom dictionaries
Date: 2003-08-07 14:09:46
Message-ID: E19klSc-000OPk-00@buckaroo.freeuk.net
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

Part1.

I have created a dictionary called 'webwords' which checks all words
and curtails them to 300 chars (for now)

after running
make
make install

I then copied the lib_webwords.so into my $libdir

I have run

psql mybd < dict_webwords.sql

The tutorial shows how to install the intdict for integer types. How
should i install my custom dictionary?

Part2.

The dictionary I am trying to create is to be used for searching
multilingual text. My aim is to have fast search over all text, but
ignore binary encoded data which is also present. (i will probably move
to ignoring long words in the text eventually).
What is the best approach to tackle this problem?
As the text can be multilingual I don't think stemming is possible?
I also need to include many none-standard words in the index such as
urls and message ID's contained in the text.

I get the feeling that building these indexs will by no means be an
easy task so any suggestions will be gratefully recieved!

Thanks...

--

Responses

Browse pgsql-general by date

  From Date Subject
Next Message Jonathan Bartlett 2003-08-07 14:21:22 Re: ext3 block size
Previous Message Tom Lane 2003-08-07 13:48:23 Re: compile error of 7.4 beta1 on AIX 4.3.3