Re: Unicode mapping scripts cleanup

From: Tatsuo Ishii <ishii(at)postgresql(dot)org>
To: peter_e(at)gmx(dot)net
Cc: pgsql-hackers(at)postgresql(dot)org
Subject: Re: Unicode mapping scripts cleanup
Date: 2015-09-01 23:59:42
Message-ID: 20150902.085942.569773620042330572.t-ishii@sraoss.co.jp
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

> I discovered that some of the source files that one is supposed to
> download don't exist anymore or are labeled obsolete. Also, running the
> scripts produces slight differences in the output. So apparently, the
> CJK to Unicode mappings are still evolving and should be updated
> occasionally. Next steps would be to commit some or all of these
> differences after additional verification, and then update the scripts
> to use whatever the non-obsolete mapping sources are supposed to be.

Some of maps were "hand tweaked" from the output of the script, for
example utf8_to_sjis.map. See git log for more details. This is due to
part of the source file was not incomplete or inappropriate. Also we
needed to compromise while creating a mapping between some local
encodings (for example SJIS) and Unicode, because in the source
mapping file round trip conversion is not guaranteed.

Best regards,
--
Tatsuo Ishii
SRA OSS, Inc. Japan
English: http://www.sraoss.co.jp/index_en.php
Japanese:http://www.sraoss.co.jp

In response to

Browse pgsql-hackers by date

  From Date Subject
Next Message Bruce Momjian 2015-09-02 00:20:47 Re: Shouldn't we document "don't use a mountpoint as $PGDATA"?
Previous Message Bruce Momjian 2015-09-01 23:52:23 Re: Anyone working on the TOAST items on the TODO list?