Quick Links

Re: [rfc] unicode escapes for extended strings

From:	Marko Kreen <markokr(at)gmail(dot)com>
To:	Peter Eisentraut <peter_e(at)gmx(dot)net>
Cc:	Postgres Hackers <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: [rfc] unicode escapes for extended strings
Date:	2009-09-23 19:46:46
Message-ID:	e51f66da0909231246m2cb4276aq3e147c098ea5fd32@mail.gmail.com
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

On 9/23/09, Peter Eisentraut <peter_e(at)gmx(dot)net> wrote:
> On Wed, 2009-09-09 at 18:26 +0300, Marko Kreen wrote:
> > Unicode escapes for extended strings.
>
> Committed.

Thank you for handling the patch.

I looked at your code for U& and saw that you allow standalone
second half of the surrogate pair there, although you error
out on first half. Was that deliberate?

Standalone surrogate halfs cause headaches for anything that wants to
handle data in UTF16. The area 0xD800-0xDFFF is explicitly reserved
for UTF16 encoding and does not contain any valid Unicode codepoints.

Perhaps pg_verifymbstr() should be made to check for such values,
because even if we fix the escaping code, such data can still be
inserted via plain utf8 or \x escapes?

--
marko

In response to

Re: [rfc] unicode escapes for extended strings at 2009-09-22 23:53:07 from Peter Eisentraut

Responses

Re: [rfc] unicode escapes for extended strings at 2009-09-24 18:42:32 from Peter Eisentraut

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Stef Walter	2009-09-23 19:53:06	Re: pg_hba.conf: samehost and samenet [REVIEW]
Previous Message	Robert Haas	2009-09-23 19:27:03	Re: TODO item: Allow more complex user/database default GUC settings