BUG #5766: regexp \y doesn't work properly when a word starts on ends with a UTF-8 char

From: "Grzegorz Daniluk" <gdaniluk(at)gmail(dot)com>
To: pgsql-bugs(at)postgresql(dot)org
Subject: BUG #5766: regexp \y doesn't work properly when a word starts on ends with a UTF-8 char
Date: 2010-11-24 16:43:43
Message-ID: 201011241643.oAOGhhB5083127@wwwmaster.postgresql.org
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-bugs


The following bug has been logged online:

Bug reference: 5766
Logged by: Grzegorz Daniluk
Email address: gdaniluk(at)gmail(dot)com
PostgreSQL version: 9.0.1
Operating system: Windows 7 64-bit
Description: regexp \y doesn't work properly when a word starts on
ends with a UTF-8 char
Details:

select regexp_replace('Foo Pasaż Bar', E'\\yPasaż\\y', '');

Above query doesn't replace the word 'Pasaż'. It returns full 'Foo Pasaż
Bar' string, when the correct behavior is to return 'Foo Bar'.

When the 'ż' is replaced with normal ASCII character like 'z',
regexp_replace works as expected.

My db details:
ENCODING = 'UTF8'
LC_COLLATE = 'Polish_Poland.1250'
LC_CTYPE = 'Polish_Poland.1250'

Responses

Browse pgsql-bugs by date

  From Date Subject
Next Message Tom Lane 2010-11-24 17:57:12 Re: BUG #5766: regexp \y doesn't work properly when a word starts on ends with a UTF-8 char
Previous Message Kevin Grittner 2010-11-24 15:30:36 Re: BUG #5764: installation error (incomplete process)