Re: 8.3 can't convert cyrillic text from 'iso-8859-5' to other cyrillic 8-bit encoding

From: Sergey Burladyan <eshkinkot(at)gmail(dot)com>
To: pgsql-bugs(at)postgresql(dot)org
Subject: Re: 8.3 can't convert cyrillic text from 'iso-8859-5' to other cyrillic 8-bit encoding
Date: 2008-03-18 00:28:49
Message-ID: 200803180328.49937.eshkinkot@gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-bugs

Hi, all !

i'm find the problem.

src/backend/utils/mb/conversion_procs/cyrillic_and_mic/cyrillic_and_mic.c
does not have cyrillic letter 'IO' in ISO-8859-5 to mule internal code
translation table (function iso2mic(const unsigned char *l, unsigned char *p,
int len)). this is bug, because it is widely used and it is main letter like
A, B or C in english :) and it is exist in all russian cyrillic's encoding
(koi8-r, iso-8859-5, windows-1251, cp866).
for example, in russian, words 'all', 'hedgehog', 'Christmas-tree' and many
other must be written with it.

here is the patch for add it to ISO-8859-5 to mule internal code translation
table. i am don't know is this ok and do not brake any internal rule or
code ?

By the way, as i can understand you are using koi8-r encoding for internal
representation of cyrillic charsets - this is have also another problem. the
second "widely" used char is <U2116> NUMERO SIGN (many accountants and
managers use it :) in cyrillic windows world) and it is exist in
windows-1251, cp866 and iso-8859-5 encoding, but not in koi8-r...

---

Attachment Content-Type Size
postgresql-8.3.0-add-IO-to-iso-trans-table.patch text/x-diff 1.5 KB

In response to

Responses

Browse pgsql-bugs by date

  From Date Subject
Next Message Alvaro Herrera 2008-03-18 00:47:49 Re: BUG #3983: pgxs files still missing in win32 install (8.3.1)
Previous Message Chan, Keith (LESA IT) 2008-03-18 00:23:04 Re: BUG #4041: error in Application Stack Builder