Re: BUG #3819: UTF8 can't handle \000

From: Bruce Momjian <bruce(at)momjian(dot)us>
To: Franklin Schmidt <fschmidt(at)gmail(dot)com>
Cc: pgsql-bugs(at)postgresql(dot)org
Subject: Re: BUG #3819: UTF8 can't handle \000
Date: 2007-12-17 09:28:57
Message-ID: 200712170928.lBH9SvG23916@momjian.us
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-bugs

Franklin Schmidt wrote:
> On Dec 17, 2007 12:54 AM, Bruce Momjian <bruce(at)momjian(dot)us> wrote:
> >
> > Uh, as far as I know 0x00 is not a valid UTF8 byte value.
>
>
> I think it is a valid value. RFC 3629 says:
>
> "Character numbers from U+0000 to U+007F (US-ASCII repertoire)
> correspond to octets 00 to 7F (7 bit US-ASCII values)."
>
> http://www.faqs.org/rfcs/rfc3629.html

Well, I realize 0x00 is a valid ASCII value and therefore a valid UTF8
value but we have never had anyone complain they can't store the 0x00
character because it doesn't mean anything in ASCII. They use bytea to
store binary data like 0x00.

--
Bruce Momjian <bruce(at)momjian(dot)us> http://momjian.us
EnterpriseDB http://postgres.enterprisedb.com

+ If your life is a hard drive, Christ can be your backup. +

In response to

Responses

Browse pgsql-bugs by date

  From Date Subject
Next Message Franklin Schmidt 2007-12-17 09:50:15 Re: BUG #3819: UTF8 can't handle \000
Previous Message Franklin Schmidt 2007-12-17 09:23:14 Re: BUG #3819: UTF8 can't handle \000