Quick Links

Re: UTF8 or Unicode

From:	Abhijit Menon-Sen <ams(at)oryx(dot)com>
To:	Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>
Cc:	PostgreSQL-development <pgsql-hackers(at)postgreSQL(dot)org>
Subject:	Re: UTF8 or Unicode
Date:	2005-02-15 02:27:32
Message-ID:	20050215022732.GB24807@penne.toroid.org
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

At 2005-02-14 21:14:54 -0500, pgman(at)candle(dot)pha(dot)pa(dot)us wrote:
>
> Should our multi-byte encoding be referred to as UTF8 or Unicode?

The *encoding* should certainly be referred to as UTF-8. Unicode is a
character set, not an encoding; Unicode characters may be encoded with
UTF-8, among other things.

(One might think of a charset as being a set of integers representing
characters, and an encoding as specifying how those integers may be
converted to bytes.)

> I know UTF8 is a type of unicode but do we need to rename anything
> from Unicode to UTF8?

I don't know. I'll go through the documentation to see if I can find
anything that needs changing.

-- ams

In response to

UTF8 or Unicode at 2005-02-15 02:14:54 from Bruce Momjian

Responses

Re: UTF8 or Unicode at 2005-02-15 03:05:08 from Bruce Momjian
Re: UTF8 or Unicode at 2005-02-15 03:13:32 from Agent M

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Joshua D. Drake	2005-02-15 02:56:49	Re: 8.0.X and the ARC patent
Previous Message	pgsql	2005-02-15 02:21:01	Re: 8.0.X and the ARC patent