Quick Links

Re: Unicode upper() bug still present

From:	Hannu Krosing <hannu(at)tm(dot)ee>
To:	Karel Zak <zakkr(at)zf(dot)jcu(dot)cz>
Cc:	Peter Eisentraut <peter_e(at)gmx(dot)net>, Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>, Tatsuo Ishii <t-ishii(at)sra(dot)co(dot)jp>, "pgsql-hackers(at)postgresql(dot)org" <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: Unicode upper() bug still present
Date:	2003-10-21 08:44:54
Message-ID:	1066725893.7917.5.camel@fuji.krosing.net
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

Karel Zak kirjutas T, 21.10.2003 kell 10:50:
> On Mon, Oct 20, 2003 at 10:58:00PM +0200, Peter Eisentraut wrote:
>
> > (Note that I say Unicode a lot here because those people do a lot of
> > research and standardization in this area, which is available for free,
> > but this does not constrain the result to work only with the Unicode
> > character set.)
>
> Why cannot do PostgreSQL as 100% pure Unicode system? We can do
> conversion from/to others encodings as client/server communication
> extension, but internaly in BE we can use only pure Unicode data. I
> think a lot of things will more simple...

I've heard that some far-east languages have had some issues with 16-bit
UNICODE, but the 32-bit should have fixed it.

I would also support a move to UNICODE (store as SCSU, process as 16 or
32 bit wchars, i/o as UTF-8) for NCHAR/NVARCHAR/NTEXT and pure 7-bit
byte-value ordered ASCII for CHAR/VARCHAR/TEXT.

But this would surely have some issues with backward compatibility.

------------
Hannu

In response to

Re: Unicode upper() bug still present at 2003-10-21 07:50:30 from Karel Zak

Responses

Automatic conversion from Unicode at 2003-10-21 14:50:54 from Michael Brusser

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Tatsuo Ishii	2003-10-21 09:07:13	Re: Unicode upper() bug still present
Previous Message	Karel Zak	2003-10-21 07:50:30	Re: Unicode upper() bug still present