Re: Encoding, Unicode, locales, etc.

From: Karsten Hilbert <Karsten(dot)Hilbert(at)gmx(dot)net>
To: pgsql-general(at)postgresql(dot)org
Subject: Re: Encoding, Unicode, locales, etc.
Date: 2006-11-01 22:52:06
Message-ID: 20061101225206.GA5129@merkur.hilbert.loc
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

On Wed, Nov 01, 2006 at 08:50:30PM +0100, Martijn van Oosterhout wrote:

> > Could this paragraph be put into the docs and/or the FAQ,
> > please ? Along with the recommendation that if you require
> > multiple encodings for your databases you better had your OS
> > locale configured properly for UTF8 and use UNICODE
> > databases or do initdb with the C-locale.
>
> Err, multiple encodings don't work full-stop.
Well, yes, I was thinking of multiple client encodings which
can be supported either via a C-locale-initdb with the
databases set to the encoding you require (but sorting/etc
won't work, I know) or by doing a unicode-initdb and using
unicode databases. In each case the client encodings can be
"multiple" ones - as long as conversion is possible. Sorting
etc may still be wrong, but at least the proper characters
are going in and coming back.

> Any particular locale (as
> defined by POSIX) is only really designed to work with one encoding.
Sure. What I meant is that if you have a unicode database
you can use several client encodings and get back the
properly encoded characters.

> The fact that the C locale produces an order when sorting UTF8 text is
> really just luck.
Yes.

> > Here are a few data points from my Debian/Testing system in
> > favour of not worrying too much about installed ICU size as
> > it is being used by other packages anyways:
>
> We'd need a suitable patch first before we start worrying about that. I
> think diskspace is less of an issue now.
Well, size did come up in a "recent" discussion so I thought
I'd mention the above facts.

Karsten
--
GPG key ID E4071346 @ wwwkeys.pgp.net
E167 67FD A291 2BEA 73BD 4537 78B9 A9F9 E407 1346

In response to

Browse pgsql-general by date

  From Date Subject
Next Message Oleg Bartunov 2006-11-01 23:22:01 Re: [HACKERS] Index greater than 8k
Previous Message Alvaro Herrera 2006-11-01 22:16:37 Re: [HACKERS] Index greater than 8k