Quick Links

Re: invalid UTF-8 via pl/perl

From:	Andrew Dunstan <andrew(at)dunslane(dot)net>
To:	Hannu Krosing <hannu(at)2ndquadrant(dot)com>
Cc:	pgsql-hackers(at)postgresql(dot)org
Subject:	Re: invalid UTF-8 via pl/perl
Date:	2010-01-03 19:54:53
Message-ID:	4B40F60D.5050200@dunslane.net
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

I wrote:
>
> I think the attached patch plugs the direct SPI holes as well.

There are two issues with this patch. First, how far if at all should it
be backpatched? All the way, or 8.3, where we tightened the encoding
rules, or not at all?

Second, It produces errors like this:

andrew=# select 'a' || invalid_utf_seq() || 'b';
ERROR: invalid byte sequence for encoding "UTF8": 0xd0
HINT: This error can also happen if the byte sequence does not
match the encoding expected by the server, which is controlled by
"client_encoding".
CONTEXT: PL/Perl function "invalid_utf_seq"
andrew=#

That hint seems rather misleading. I'm not sure what we can do about it
though. If we set the noError param on pg_verifymbstr() we would miss
the error message that actually identified the bad data, so that doesn't
seem like a good plan.

cheers

andrew

In response to

Re: invalid UTF-8 via pl/perl at 2010-01-03 14:28:29 from Andrew Dunstan

Responses

Re: invalid UTF-8 via pl/perl at 2010-01-03 20:09:39 from David E. Wheeler
Re: invalid UTF-8 via pl/perl at 2010-01-03 20:15:11 from Tom Lane
Re: invalid UTF-8 via pl/perl at 2010-01-03 20:17:11 from Tom Lane

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	David E. Wheeler	2010-01-03 20:07:48	Re: PATCH: Add hstore_to_json()
Previous Message	Andrew Dunstan	2010-01-03 19:40:02	Re: PATCH: Add hstore_to_json()