Re: Collate order on Mac OS X, text with diacritics in UTF-8

From: Greg Stark <gsstark(at)mit(dot)edu>
To: Craig Ringer <craig(at)postnewspapers(dot)com(dot)au>
Cc: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>, Martin Flahault <martin(at)billjobs(dot)com>, pgsql-general(at)postgresql(dot)org
Subject: Re: Collate order on Mac OS X, text with diacritics in UTF-8
Date: 2010-01-16 21:10:53
Message-ID: 407d949e1001161310m3c2ff9d6nc4b3a498f75465c1@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

On Thu, Jan 14, 2010 at 6:15 AM, Craig Ringer
<craig(at)postnewspapers(dot)com(dot)au> wrote:
>>> Out of interest: Why not?
>>
>> There's plenty of discussion in the archives about it, but basically
>> ICU would represent a pretty enormous dependency and would lock us in
>> to having no other backend encoding but UTF8.
>
> Thanks. You're right - I should've just STFA ;-) so I appreciate the
> brief explanation.

There's also the question of whether being inconsistent with the rest
of the system is really the right thing at all. If a programmer does a
sort in the database and then writes application code using the same
collation on the same system which depends on the data being sorted
it's nice that that works. Or if an admin presorts the data using sort
before doing a data load he might reasonable expect the table to be
effectively clustered.

Switching to ICU means trading our current inconsistency from platform
to platform for a different inconsistency which would be better in
some cases and worse in others.

--
greg

In response to

Responses

Browse pgsql-general by date

  From Date Subject
Next Message Joel Alpers 2010-01-16 21:44:35 Trouble connecting to database using PQconnectdb (C/libpq)
Previous Message Ivan Sergio Borgonovo 2010-01-16 20:57:03 Re: OR tsquery