Re: \w doesn't match non-ASCII letters

From: Markus Bertheau <twanger(at)bluetwanger(dot)de>
To: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Cc: Peter Eisentraut <peter_e(at)gmx(dot)net>, pgsql-bugs(at)postgresql(dot)org
Subject: Re: \w doesn't match non-ASCII letters
Date: 2004-06-14 16:36:49
Message-ID: 1087231009.2649.1.camel@teetnang
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-bugs

В Пнд, 14.06.2004, в 17:25, Tom Lane пишет:
> Peter Eisentraut <peter_e(at)gmx(dot)net> writes:
> > Markus Bertheau wrote:
> >> oocms=# select 'ф' ~ '^\\w$';
> >> ?column?
> >> ----------
> >> f
> >> (1 запись)
>
> > What locale are you using for LC_COLLATE? If it's C or POSIX, you need
> > to change it and re-initdb.
>
> Another likely cause of trouble is that the regexp character
> classification stuff is presently based on <ctype.h> functions and thus
> cannot work in multibyte encodings.

This is in a UTF-8 database, so yes, these are multibyte characters. Is
there something planned to support UTF-8 in regexps?

--
Markus Bertheau <twanger(at)bluetwanger(dot)de>

In response to

Responses

Browse pgsql-bugs by date

  From Date Subject
Next Message Tom Lane 2004-06-14 16:48:36 Re: \w doesn't match non-ASCII letters
Previous Message PostgreSQL Bugs List 2004-06-14 16:25:47 BUG #1163: cursor "jdbc_curs_1" does not exist