Re: Unicode upper() bug still present

From: Hannu Krosing <hannu(at)tm(dot)ee>
To: Karel Zak <zakkr(at)zf(dot)jcu(dot)cz>
Cc: Peter Eisentraut <peter_e(at)gmx(dot)net>, Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>, Tatsuo Ishii <t-ishii(at)sra(dot)co(dot)jp>, "pgsql-hackers(at)postgresql(dot)org" <pgsql-hackers(at)postgresql(dot)org>
Subject: Re: Unicode upper() bug still present
Date: 2003-10-21 08:44:54
Message-ID: 1066725893.7917.5.camel@fuji.krosing.net
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

Karel Zak kirjutas T, 21.10.2003 kell 10:50:
> On Mon, Oct 20, 2003 at 10:58:00PM +0200, Peter Eisentraut wrote:
>
> > (Note that I say Unicode a lot here because those people do a lot of
> > research and standardization in this area, which is available for free,
> > but this does not constrain the result to work only with the Unicode
> > character set.)
>
> Why cannot do PostgreSQL as 100% pure Unicode system? We can do
> conversion from/to others encodings as client/server communication
> extension, but internaly in BE we can use only pure Unicode data. I
> think a lot of things will more simple...

I've heard that some far-east languages have had some issues with 16-bit
UNICODE, but the 32-bit should have fixed it.

I would also support a move to UNICODE (store as SCSU, process as 16 or
32 bit wchars, i/o as UTF-8) for NCHAR/NVARCHAR/NTEXT and pure 7-bit
byte-value ordered ASCII for CHAR/VARCHAR/TEXT.

But this would surely have some issues with backward compatibility.

------------
Hannu

In response to

Responses

Browse pgsql-hackers by date

  From Date Subject
Next Message Tatsuo Ishii 2003-10-21 09:07:13 Re: Unicode upper() bug still present
Previous Message Karel Zak 2003-10-21 07:50:30 Re: Unicode upper() bug still present