Re: langauges, locales, regex, LIKE

From: Joel Matthew <rees(at)ddcom(dot)co(dot)jp>
To: pgsql-general(at)postgresql(dot)org
Subject: Re: langauges, locales, regex, LIKE
Date: 2004-06-24 06:35:16
Message-ID: 20040624150226.FD0A.REES@ddcom.co.jp
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

> If I've read everything right, in order to get:
>
> multiple languages on a site
>
> with the functionality of ALL of:
>
> REGEX
> LIKE
> Correctly sorted text
>
> ...

You might want to look at this:

http://www.unicode.org/reports/tr10/

to get an idea of where things stand at the Unicode Consortium.

But, for sorting mixed content, what sort order should take effect
between "now" and 「現時点」 (the latter being Japanese)?

A little more down-to-earth, would you want "genjiten" and the hiragana
equivalent, 「げんじてん」 to fold together in the collation? You
definitely would need a Japanese locale for that, if it ever could work.
(I'm not sure it could work, unless you had some way to specify the
method of romanization in a sublocale or something.)

And what happens when you mix "genjiten" with "genealogy"? Or the US
English name "Moench" with the German name using the correct character
for the "oe", with the romanized Japanese "moeru", and the same (「もえ
る」) in hiragana?

--
Joel <rees(at)ddcom(dot)co(dot)jp>
Just ranting, ignore me.

In response to

Browse pgsql-general by date

  From Date Subject
Next Message Scott Marlowe 2004-06-24 06:41:25 Re: clusters and database instances
Previous Message Dennis Gearon 2004-06-24 06:10:38 and here is a free OSS library to do ....