Re: UNICODE characters above 0x10000

From: "John Hansen" <john(at)geeknet(dot)com(dot)au>
To: <olly(at)lfix(dot)co(dot)uk>
Cc: "Hackers" <pgsql-hackers(at)postgresql(dot)org>, "Patches" <pgsql-patches(at)postgresql(dot)org>
Subject: Re: UNICODE characters above 0x10000
Date: 2004-08-07 23:31:18
Message-ID: 5066E5A966339E42AA04BA10BA706AE56175@rodrick.geeknet.com.au
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers pgsql-patches

> -----Original Message-----
> From: Oliver Elphick [mailto:olly(at)lfix(dot)co(dot)uk]
> Sent: Sunday, August 08, 2004 7:43 AM
> To: Tom Lane
> Cc: John Hansen; Hackers; Patches
> Subject: Re: [HACKERS] UNICODE characters above 0x10000
>
> On Sat, 2004-08-07 at 07:10, Tom Lane wrote:
> > Oliver Elphick <olly(at)lfix(dot)co(dot)uk> writes:
> > > glibc provides various routines (mb...) for handling Unicode. How

> > > many of our supported platforms don't have these?
> >
> > Every one that doesn't use glibc. Don't bother proposing a
glibc-only
> > solution (and that's from someone who works for a glibc-only
company;
> > you don't even want to think about the push-back you'll get from
other
> > quarters).
>
> No. that's not what I was proposing. My suggestion was to
> use these routines if they are sufficiently widely
> implemented, and our own routines where standard ones are not
> available.
>
> The man page for mblen says
> "CONFORMING TO
> ISO/ANSI C, UNIX98"
>
> Is glibc really the only C library to conform?
>
> If using the mb... routines isn't feasible, IBM's ICU library
> (http://oss.software.ibm.com/icu/) is available under the X
> licence, which is compatible with BSD as far as I can see.
> Besides character conversion, ICU can also do collation in
> various locales and encodings.
> My point is, we shouldn't be writing a new set of routines to
> do half a job if there are already libraries available to do
> all of it.
>

This sounds like a brilliant move, if anything.

> --
> Oliver Elphick
> olly(at)lfix(dot)co(dot)uk
> Isle of Wight
> http://www.lfix.co.uk/oliver
> GPG: 1024D/A54310EA 92C8 39E7 280E 3631 3F0E 1EC0 5664 7A2F
> A543 10EA
> ========================================
> "Be still before the LORD and wait patiently for him;
> do not fret when men succeed in their ways, when they
> carry out their wicked schemes."
> Psalms 37:7
>
>
>

Kind Regards,

John Hansen

Browse pgsql-hackers by date

  From Date Subject
Next Message Bruce Momjian 2004-08-08 00:10:07 Re: parameter hints to the optimizer
Previous Message John Hansen 2004-08-07 23:28:44 Re: [PATCHES] UNICODE characters above 0x10000

Browse pgsql-patches by date

  From Date Subject
Next Message Bruce Momjian 2004-08-08 00:09:13 Re: Patch for Array min() / max()
Previous Message John Hansen 2004-08-07 23:28:44 Re: [PATCHES] UNICODE characters above 0x10000