Tsearch2 slovak UTF-8

From: "Jan Sunavec" <jan(dot)sunavec(at)b4b(dot)sk>
To: pgsql-general(at)postgresql(dot)org
Subject: Tsearch2 slovak UTF-8
Date: 2008-01-21 10:05:33
Message-ID: op.t49ybjyetuteoe@akela-pc
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

Hi All

I have PostgreSQL 8.2.6 running on Windows. I tryed install slovak
dictionary for tsearch2.

INSERT INTO pg_ts_dict
VALUES('ispell_slovak','spell_init(internal)','DictFile="C:/slovak_utf8.dict",
AffFile="C:/slovak_utf8.aff", StopFile="C:/slovak_utf8.stop"',
'spell_lexize(internal,internal,integer)','Slovak ISpell. UTF8 Encoding');

INSERT INTO pg_ts_cfg VALUES('utf8_slovak', 'default', 'Slovak_Slovakia');

INSERT INTO pg_ts_cfgmap
SELECT 'utf8_slovak',tok_alias,dict_name
FROM pg_ts_cfgmap WHERE ts_name='utf8_russian';

UPDATE pg_ts_cfgmap SET dict_name='{ispell_slovak,simple}'
WHERE ('ru_stem'=ANY(dict_name) OR 'ru_stem_utf8'=ANY(dict_name) OR
'en_stem' = ANY(dict_name))
AND ts_name='utf8_slovak';

There are few problems.
1. It's not possible use path DictFile="C:/Program
Files/PostgreSQL/8.2/slovak_utf8.dict"
2. I always got "ERROR: Affix parse error at 63 line

In statement:
SELECT to_tsvector('utf8_slovak', 'žltá vody');"

Second problem is most important. On that line is first UTF8 character. I
read that it's necessary install patch tsearch_snowball_82.gz. But I am
running under windows and I am not so skill for windows compilation. I
thought that lastest PostgreSQL version has this patch included.

Thanks for help.

Jan Sunavec

Browse pgsql-general by date

  From Date Subject
Next Message Christian Schröder 2008-01-21 10:47:15 Re: Views and permissions
Previous Message Albe Laurenz 2008-01-21 09:25:49 Re: Views and permissions