UTF16 surrogate pairs in UTF8 encoding

From: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To: pgsql-hackers(at)postgreSQL(dot)org
Subject: UTF16 surrogate pairs in UTF8 encoding
Date: 2010-08-22 18:29:20
Message-ID: 19456.1282501760@sss.pgh.pa.us
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

I just noticed that we are now advertising the ability to insert UTF16
surrogate pairs in strings and identifiers (see section 4.1.2.2 in
current docs, in particular). Is this really wise? I thought that
surrogate pairs were specifically prohibited in UTF8 strings, because
of the security hazards implicit in having more than one way to
represent the same code point.

regards, tom lane

Responses

Browse pgsql-hackers by date

  From Date Subject
Next Message Stephen Frost 2010-08-22 19:08:08 Re: security label support, part.2
Previous Message Erik Rijkers 2010-08-22 17:53:11 Re: pg_archivecleanup debug message consistency