Re: UTF8 or Unicode

From: Abhijit Menon-Sen <ams(at)oryx(dot)com>
To: Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>
Cc: PostgreSQL-development <pgsql-hackers(at)postgreSQL(dot)org>
Subject: Re: UTF8 or Unicode
Date: 2005-02-15 02:27:32
Message-ID: 20050215022732.GB24807@penne.toroid.org
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

At 2005-02-14 21:14:54 -0500, pgman(at)candle(dot)pha(dot)pa(dot)us wrote:
>
> Should our multi-byte encoding be referred to as UTF8 or Unicode?

The *encoding* should certainly be referred to as UTF-8. Unicode is a
character set, not an encoding; Unicode characters may be encoded with
UTF-8, among other things.

(One might think of a charset as being a set of integers representing
characters, and an encoding as specifying how those integers may be
converted to bytes.)

> I know UTF8 is a type of unicode but do we need to rename anything
> from Unicode to UTF8?

I don't know. I'll go through the documentation to see if I can find
anything that needs changing.

-- ams

In response to

Responses

Browse pgsql-hackers by date

  From Date Subject
Next Message Joshua D. Drake 2005-02-15 02:56:49 Re: 8.0.X and the ARC patent
Previous Message pgsql 2005-02-15 02:21:01 Re: 8.0.X and the ARC patent