Skip site navigation (1) Skip section navigation (2)

Re: [9.1beta1] UTF-8/Regex Word-Character Definition excluding accented letters

From: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To: "David Johnston" <polobo(at)yahoo(dot)com>
Cc: pgsql-general(at)postgresql(dot)org
Subject: Re: [9.1beta1] UTF-8/Regex Word-Character Definition excluding accented letters
Date: 2011-05-31 02:39:48
Message-ID: 15451.1306809588@sss.pgh.pa.us (view raw or flat)
Thread:
Lists: pgsql-general
"David Johnston" <polobo(at)yahoo(dot)com> writes:
> PostgreSQL 9.1beta1, compiled by Visual C++ build 1500, 64-bit (EnterpriseDB
> Install Executable)

> CREATE DATABASE betatest 
>                 TEMPLATE template0 
>                 ENCODING 'UTF8'
>                 LC_COLLATE 'C' 
>                 LC_CTYPE 'C';

> CREATE DOMAIN idcode AS text
>                 NOT NULL CHECK (VALUE ~* '^\w[-:\w]*$')
> ;

> SELECT 'AAAAAaaaaa'::idcode; // -> SQL Error: ERROR:  value for domain
> idcode violates check constraint "idcode_check" (note the accented e
> between all the As)

AFAICS that's correct behavior.  C locale should not think that  is
a letter.

> This is running just fine against a 9.0 install on the same machine.

We made some strides towards getting locale-sensitive stuff to work as
it "should" in 9.1.  In particular, platform-specific creative
interpretations of what C locale means shouldn't happen anymore ...

			regards, tom lane

In response to

Responses

pgsql-general by date

Next:From: Tom LaneDate: 2011-05-31 03:09:45
Subject: Re: Function Column Expansion Causes Inserts To Fail
Previous:From: Tom LaneDate: 2011-05-31 02:14:08
Subject: Re: UTC4115FATAL: the database system is in recovery mode

Privacy Policy | About PostgreSQL
Copyright © 1996-2014 The PostgreSQL Global Development Group