Re: The dangers of streaming across versions of glibc: A cautionary tale

From: Peter Geoghegan <peter(dot)geoghegan86(at)gmail(dot)com>
To: Tatsuo Ishii <ishii(at)postgresql(dot)org>
Cc: Bruce Momjian <bruce(at)momjian(dot)us>, Matthew Kelly <mkelly(at)tripadvisor(dot)com>, PGSQL Mailing List <pgsql-general(at)postgresql(dot)org>, Matthew Spilich <mspilich(at)tripadvisor(dot)com>
Subject: Re: The dangers of streaming across versions of glibc: A cautionary tale
Date: 2014-08-07 01:46:49
Message-ID: CAEYLb_WvdCzuL=Cyf1xyzjwn-1CVo6kZEaWMKbxTS3jPhtjOig@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

On Wed, Aug 6, 2014 at 6:30 PM, Tatsuo Ishii <ishii(at)postgresql(dot)org> wrote:
> Another idea could be having our own collation data to isolate any
> changes from outside world. I vaguley recall this had been discussed
> before.

That's probably the best solution. It would not be the first time that
we decided to stop relying on the operating system's facilities due to
various problems (e.g. we used to use the C standard library qsort()
until about 2006). The only problem is that it's a lot of work. One
possible solution that has been proposed is to adopt ICU [1]. That
might allow us to say "this is the official way that PostgreSQL 9.6
sorts Japanese; you may use the old way if you want, but it's
incompatible with the new way". ICU would give us a standard
versioning interface [2]. They seem to take this seriously, and are
aware of our considerations around B-Tree indexes on text.

[1] https://wiki.postgresql.org/wiki/Todo:ICU
[2] http://userguide.icu-project.org/collation/architecture#TOC-Versioning
--
Regards,
Peter Geoghegan

In response to

Responses

Browse pgsql-general by date

  From Date Subject
Next Message Phoenix Kiula 2014-08-07 01:52:03 Re: Reindex taking forever, and 99% CPU
Previous Message Tatsuo Ishii 2014-08-07 01:30:48 Re: The dangers of streaming across versions of glibc: A cautionary tale