From: | psql-mail(at)freeuk(dot)com |
---|---|
To: | pgsql-general(at)postgresql(dot)org |
Subject: | Tsearch2 custom dictionaries |
Date: | 2003-08-07 14:09:46 |
Message-ID: | E19klSc-000OPk-00@buckaroo.freeuk.net |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-general |
Part1.
I have created a dictionary called 'webwords' which checks all words
and curtails them to 300 chars (for now)
after running
make
make install
I then copied the lib_webwords.so into my $libdir
I have run
psql mybd < dict_webwords.sql
The tutorial shows how to install the intdict for integer types. How
should i install my custom dictionary?
Part2.
The dictionary I am trying to create is to be used for searching
multilingual text. My aim is to have fast search over all text, but
ignore binary encoded data which is also present. (i will probably move
to ignoring long words in the text eventually).
What is the best approach to tackle this problem?
As the text can be multilingual I don't think stemming is possible?
I also need to include many none-standard words in the index such as
urls and message ID's contained in the text.
I get the feeling that building these indexs will by no means be an
easy task so any suggestions will be gratefully recieved!
Thanks...
--
From | Date | Subject | |
---|---|---|---|
Next Message | Jonathan Bartlett | 2003-08-07 14:21:22 | Re: ext3 block size |
Previous Message | Tom Lane | 2003-08-07 13:48:23 | Re: compile error of 7.4 beta1 on AIX 4.3.3 |