Re: BUG #1931: ILIKE and LIKE fails on Turkish locale

From: Bruce Momjian <bruce(at)momjian(dot)us>
To: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Cc: Victor Snezhko <snezhko(at)indorsoft(dot)ru>, Volkan YAZICI <yazicivo(at)ttnet(dot)net(dot)tr>, pgsql-bugs(at)postgresql(dot)org
Subject: Re: BUG #1931: ILIKE and LIKE fails on Turkish locale
Date: 2006-10-02 21:23:25
Message-ID: 200610022123.k92LNPC25998@momjian.us
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-bugs pgsql-tr-genel


Has this been addressed?

---------------------------------------------------------------------------

Tom Lane wrote:
> Victor Snezhko <snezhko(at)indorsoft(dot)ru> writes:
> > My FreeBSD lists a whole heck of characters:
>
> > character 0x85 is a space
> > character 0xa0 is a space
> > character 0xaa is alphabetical
> > character 0xb5 is alphabetical
> > character 0xba is alphabetical
> > character 0xc0 is alphabetical
> > ... 0xc1-0xfe is alphabetical
> > character 0xff is alphabetical
>
> Hm. I'm still thinking that this behavior is wrong for UTF8 encoding,
> but it would be reasonable in LATINn and related encodings, so we
> probably ought to do something about it.
>
> After further thought, it's not so much that we can't tolerate
> locale-dependent behavior of isspace() in general, as that in this
> particular case we are expecting it to match the scanner's idea
> of a space: scan.l has
>
> space [ \t\n\r\f]
>
> which obviously is not locale-aware. I think we need convert_ident to
> use a plpgsql_isspace() that accepts these and only these as spaces.
> Any high-bit-set byte is part of an identifier according to scan.l's
> rules, and convert_ident must have the same behavior regardless of locale.
>
> There may be related risks in and around the other flex scanners
> ... will look.
>
> regards, tom lane
>
> ---------------------------(end of broadcast)---------------------------
> TIP 5: don't forget to increase your free space map settings

--
Bruce Momjian bruce(at)momjian(dot)us
EnterpriseDB http://www.enterprisedb.com

+ If your life is a hard drive, Christ can be your backup. +

In response to

Responses

Browse pgsql-bugs by date

  From Date Subject
Next Message Tom Lane 2006-10-02 21:35:23 Re: BUG #1931: ILIKE and LIKE fails on Turkish locale
Previous Message Stan Kaufman 2006-10-02 20:49:19 error compiling 7.4.13 on OSX 10.4.8 -- Intel

Browse pgsql-tr-genel by date

  From Date Subject
Next Message Tom Lane 2006-10-02 21:35:23 Re: BUG #1931: ILIKE and LIKE fails on Turkish locale
Previous Message Esref Atak 2006-09-29 20:32:26 Ekşi Sözlük'te PostgreSQL