Re: sort order (ORDER BY) hu_HU.UTF-8 locale with UTF-8 encoding is not working correctly on FreeBSD

From: Adam PAPAI <adam(dot)papai(at)bsdsupportservice(dot)hu>
To: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Cc: pgsql-bugs(at)postgresql(dot)org
Subject: Re: sort order (ORDER BY) hu_HU.UTF-8 locale with UTF-8 encoding is not working correctly on FreeBSD
Date: 2011-02-06 16:38:59
Message-ID: 4D4ECEA3.5040203@bsdsupportservice.hu
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-bugs

Tom Lane wrote:
> Adam PAPAI <adam(dot)papai(at)bsdsupportservice(dot)hu> writes:
>> [ hu_HU.UTF-8 locale doesn't sort as expected on ]
>> FreeBSD titanium 8.1-RELEASE FreeBSD 8.1-RELEASE #0: Mon Jul 19 02:36:49
>> UTC 2010 root(at)mason(dot)cse(dot)buffalo(dot)edu:/usr/obj/usr/src/sys/GENERIC amd64
>
> Do you get the expected sort ordering from the "sort" command either?
> I'm betting not. This is not something Postgres controls -- we just
> rely on the results of strcoll().
>
> I don't know the state of affairs on FreeBSD for sure, but I do know
> that the UTF-8 locales are fairly broken on Mac OS X, which is a BSD
> derived platform too. It looks like nobody in BSD-land has bothered to
> put in the work to make multibyte characters sort properly.
>
> You might have better luck with hu_HU using a suitable LATINn character
> set instead of UTF8. If that still doesn't do what you want, you might
> have no choice but to move off BSD :-(
>
> regards, tom lane

Hi Tom,

I've tested the "sort" command. It fails too.

Perhaps it's really with the *BSD and not with PostgreSQL.

Thanks,

--
Adam PAPAI
BSD Support Service
http://www.bsdsupportservice.hu
E-mail: adam(dot)papai(at)bsdsupportservice(dot)hu
Phone: +36 30 33-55-735 (Hungary)

In response to

Browse pgsql-bugs by date

  From Date Subject
Next Message Bruce Momjian 2011-02-06 17:06:49 Re: BUG #5559: Full SSL verification fails when hostaddr provided
Previous Message Tom Lane 2011-02-06 16:06:10 Re: sort order (ORDER BY) hu_HU.UTF-8 locale with UTF-8 encoding is not working correctly on FreeBSD