BUG #3819: UTF8 can't handle \000

From: "Franklin Schmidt" <fschmidt(at)gmail(dot)com>
To: pgsql-bugs(at)postgresql(dot)org
Subject: BUG #3819: UTF8 can't handle \000
Date: 2007-12-17 07:34:39
Message-ID: 200712170734.lBH7YdG9034458@wwwmaster.postgresql.org
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-bugs


The following bug has been logged online:

Bug reference: 3819
Logged by: Franklin Schmidt
Email address: fschmidt(at)gmail(dot)com
PostgreSQL version: 8.2
Operating system: XP & Linux
Description: UTF8 can't handle \000
Details:

Trying to store \000 in a text field with UTF8 encoding causes an error. I
assume this is because Postgres is written in C, but it's still wrong. A
solution was suggested here:

http://www.nabble.com/invalid-byte-sequence-for-encoding-%22UTF8%22%3A-0x00-
tp9058998p9096326.html

"I can think of some ways the server could support it without extensive
changes .. e.g. use a modified UTF8 representation which stores \u0000 as
0xc0 0x80 internally"

Responses

Browse pgsql-bugs by date

  From Date Subject
Next Message Sergey 2007-12-17 08:50:31 BUG #3820: auto-installer-failed
Previous Message Tom Lane 2007-12-16 17:22:20 Re: BUG #3808: Connections stays open in stateCLOSE_WAIT