BUG #10589: hungarian.stop file spelling error

From: zsoros(at)gmail(dot)com
To: pgsql-bugs(at)postgresql(dot)org
Subject: BUG #10589: hungarian.stop file spelling error
Date: 2014-06-10 08:19:36
Message-ID: 20140610081936.2599.96998@wrigleys.postgresql.org
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-bugs

The following bug has been logged on the website:

Bug reference: 10589
Logged by: Sörös Zoltán
Email address: zsoros(at)gmail(dot)com
PostgreSQL version: 9.3.4
Operating system: Linux
Description:

Hi!
The 'hungarian.stop' file (for tsearch, located in
src/backend/snowball/stopwords in the source tarball) contains the õ
('otilde' in HTML) character instead of the correct 'ő' character. (There
are 7 occuerences in this file.)

Our database uses latin2 encoding, where we use the correct 'ő' characters.
Here's an excerpt from today's log:

< 2014-06-10 08:49:24.416 CEST >ERROR: character with byte sequence 0xc3
0xb5 in encoding "UTF8" has no equivalent in encoding "LATIN2"
< 2014-06-10 08:49:24.416 CEST >CONTEXT: line 58 of configuration file
"/usr/pgsql-9.3/share/tsearch_data/hungarian.stop"

After I replaced the tilde-capped letters in hungarian.stop file, the
problem vanished, and tsearch works fine.
I'm sorry, I can't give you the utf8 byte sequence for 'ő', but I can send
the corrected hungarian.stop file if needed.

Please fix this file in the next release.

Thanks in advance,
Zoltán Sörös

Responses

Browse pgsql-bugs by date

  From Date Subject
Next Message Serge Negodyuck 2014-06-10 09:26:27 Re: BUG #8673: Could not open file "pg_multixact/members/xxxx" on slave during hot_standby
Previous Message zhangzq 2014-06-10 02:03:55 Re: Many processes blocked at ProcArrayLock!