Re: UTF-8 and LIKE vs =

From: Ian Barwick <barwick(at)gmail(dot)com>
To: David Wheeler <david(at)kineticode(dot)com>
Cc: pgsql-general(at)postgresql(dot)org
Subject: Re: UTF-8 and LIKE vs =
Date: 2004-08-24 00:07:38
Message-ID: 1d581afe04082317071dfca8ab@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

On Mon, 23 Aug 2004 16:50:04 -0700, David Wheeler <david(at)kineticode(dot)com> wrote:
> On Aug 23, 2004, at 4:34 PM, Ian Barwick wrote:
>
> > wild speculation in need of a Korean speaker, but:
> >
> > ian(at)linux:~/tmp> cat j.txt
> > テスト
> > 환경설
> > 전검색
> > 웹문서
> > 국방비
> > 북한의
> > てすと
> > ian(at)linux:~/tmp> uniq j.txt
> > テスト
> > 환경설
> > てすと
> >
> > All but the first and last lines are random Korean (Hangul)
> > characters. Evidently our respective locales think all Hangul strings
> > of the same length are identical, which is very probably not the
> > case...
>
> Does this go away if you change your locale to C?

Yes.

Ian Barwick

In response to

Responses

Browse pgsql-general by date

  From Date Subject
Next Message David Wheeler 2004-08-24 00:13:03 Re: UTF-8 and LIKE vs =
Previous Message David Wheeler 2004-08-23 23:51:32 Re: UTF-8 and LIKE vs =