Re: Unicode problems on IRC

From: Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>
To: Christopher Kings-Lynne <chriskl(at)familyhealth(dot)com(dot)au>
Cc: pgsql-hackers(at)postgresql(dot)org
Subject: Re: Unicode problems on IRC
Date: 2005-04-09 22:17:48
Message-ID: 200504092217.j39MHmq28772@candle.pha.pa.us
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

Christopher Kings-Lynne wrote:
> Hey guys,
>
> The 'Unicode characters above 0x10000' issue keeps rearing its ugly head
> in the IRC channel. I propose that it be fixed, even backported...
>
> This is John Hansen's most recent patch to fix it:
>
> http://archives.postgresql.org/pgsql-patches/2004-11/msg00259.php
>
> And from what I can tell it was committed, then reverted because it
> wasn't a "bug". It was going to go in for 8.1.
>
> We on the channel are starting to think that it is in fact a bug. There
> are are people with legitimately utf-8 encoded XML documents that they
> cannot store in PostgreSQL. Apparently in the distant past, Unicode was
> limited to 0x10000, but then was extended.
>
> Perhaps we can reopen this case...

Uh, I thought we fixed this another way, buy not using Unicode-aware
functions for upper/lower/initcap when the locale is "C" or "POSIX".
That is backpatched to 8.0.X. Does that not fix the problem reported?

--
Bruce Momjian | http://candle.pha.pa.us
pgman(at)candle(dot)pha(dot)pa(dot)us | (610) 359-1001
+ If your life is a hard drive, | 13 Roberts Road
+ Christ can be your backup. | Newtown Square, Pennsylvania 19073

In response to

Responses

Browse pgsql-hackers by date

  From Date Subject
Next Message Andrew - Supernews 2005-04-10 00:03:36 Re: Unicode problems on IRC
Previous Message juan 2005-04-09 19:02:34 Case Sensitivity