Quick Links

Re: Bug in UTF8-Validation Code?

From:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To:	Mark Dilger <pgsql(at)markdilger(dot)com>
Cc:	pgsql-hackers(at)postgresql(dot)org, andrew(at)supernews(dot)com
Subject:	Re: Bug in UTF8-Validation Code?
Date:	2007-04-02 22:37:11
Message-ID:	29864.1175553431@sss.pgh.pa.us
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

Mark Dilger <pgsql(at)markdilger(dot)com> writes:
>> pgsql=# select chr(14989485);
>> chr
>> -----
>>
>> (1 row)

Is there a principled rationale for this particular behavior as
opposed to any other?

In particular, in UTF8 land I'd have expected the argument of chr()
to be interpreted as a Unicode code point, not as actual UTF8 bytes
with a randomly-chosen endianness.

Not sure what to do in other multibyte encodings.

regards, tom lane

In response to

Re: Bug in UTF8-Validation Code? at 2007-04-02 20:50:36 from Mark Dilger

Responses

Re: Bug in UTF8-Validation Code? at 2007-04-02 22:02:21 from Mark Dilger

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Bruce Momjian	2007-04-02 22:46:15	Re: [HACKERS] timestamp subtraction (was Re: formatting intervals with to_char)
Previous Message	Bruce Momjian	2007-04-02 22:29:15	Re: [PATCHES] pg_standby