Re: Concerning about Unicode-aware string handling

From: Vincas Dargis <vindrg(at)gmail(dot)com>
To: pgsql-general(at)postgresql(dot)org, pierce(at)hogranch(dot)com
Subject: Re: Concerning about Unicode-aware string handling
Date: 2012-05-21 13:31:56
Message-ID: CAPNCXk268raC5nHe0PyXHB6gE_XSWjTVnZu+KBkEPQnkNM8xaw@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

Sorry I have to write "manual" replay since I've messed up mailing
list settings (got "Partial Digest"...).

John R Pierce wrote:
> your database encoding is UTF8 ? the language or environment you're using to generate those strings such as 'acząčž' is also UTF8 ?

Database created using:
initdb -D ../data -E utf-8 -U postgres

I have to take back statements about "lower()" and "~*" - they do not
work only for (strange) clusters created with "--no-locale".

But regexp_replace issue is still there. Regexp "\w" understands only
as "ascii word character" ?

Responses

Browse pgsql-general by date

  From Date Subject
Next Message Luca Ferrari 2012-05-21 13:35:55 help understanding the bitmap heap scan costs
Previous Message Vincas Dargis 2012-05-21 13:30:52