Re: UNICODE/UTF-8 on win32

From: Tatsuo Ishii <t-ishii(at)sra(dot)co(dot)jp>
To: mha(at)sollentuna(dot)net
Cc: tgl(at)sss(dot)pgh(dot)pa(dot)us, pgsql-hackers-win32(at)postgresql(dot)org, pgsql-hackers(at)postgresql(dot)org
Subject: Re: UNICODE/UTF-8 on win32
Date: 2005-01-03 00:48:51
Message-ID: 20050103.094851.123968739.t-ishii@sra.co.jp
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers pgsql-hackers-win32

> >I do understand the problem, but don't undertstand the decision you
> >guys made. The fact that UPPER/LOWER and some other functions does not
> >work in win32 is surely a problem for some languages, but not a
> >problem for otheres. For example, Japanese (and probably Chinese and
> >Korean) does not have a concept upper/lower. So the fact UPPER/LOWER
> >does not work with UTF-8/win32 is not problem for Japanese (and for
> >some other languages). Just using C locale with UTF-8 is enough in
> >this case.
>
> The main issue is not with upper/lower, it's with ORDER BY (and doesn't
> that affect indexes as well). This affects Japanese as well, no?

As long as used with C locale, indexes should be ok. ORDER BY is not
perfect but we can live with it. Since Japanese is an ideogram, we
cannot rely on ORDER BY character codes to sort Japanese characters
anyway. I believe same thing can be said to Chinese.

> I didn't consider the C locale. Do you know for a fact that it works
> there on win32 as well, or is that an assumption? (I don't know either
> way)

I have not tested 8.0 on win32, but I think it should work with C
locale since I know PowerGres, which is based on 7.4, works.

> >In summary, I think you guys are going to overkill the multibyte
> >support functionality on UTF-8/win32 because of the fact that some
> >langauges do not work.
>
> I was under the impression that *no* languages worked. If some do work,
> then we definitly should not kill it.
>
> It would be good to have some way of detecting if it worked or not at
> the time of creation of the database. But I have no idea on how to do
> that in a reasonable way.
--
Tatsuo Ishii

In response to

Responses

Browse pgsql-hackers by date

  From Date Subject
Next Message Marc G. Fournier 2005-01-03 01:20:31 Re: PostgreSQL 8.0.0 Release Candidate 3
Previous Message Tatsuo Ishii 2005-01-03 00:48:02 Re: UNICODE/UTF-8 on win32

Browse pgsql-hackers-win32 by date

  From Date Subject
Next Message Jonathan Barnhart 2005-01-03 12:16:39 Any chance of a merge module?
Previous Message Tatsuo Ishii 2005-01-03 00:48:02 Re: UNICODE/UTF-8 on win32