Re: UTF8 or Unicode

From: Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>
To: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Cc: Dave Page <dpage(at)vale-housing(dot)co(dot)uk>, Oliver Jowett <oliver(at)opencloud(dot)com>, Karel Zak <zakkr(at)zf(dot)jcu(dot)cz>, List pgsql-hackers <pgsql-hackers(at)postgresql(dot)org>
Subject: Re: UTF8 or Unicode
Date: 2005-02-22 03:08:25
Message-ID: 200502220308.j1M38PV03238@candle.pha.pa.us
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers pgsql-patches

Tom Lane wrote:
> Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us> writes:
> > I think we just need to _favor_ UTF8.
>
> I agree.
>
> > The question is where are we
> > favoring Unicode rather than UTF8?
>
> It's the canonical name of the encoding, both in the code and the docs.
>
> regression=# create database e encoding 'utf-8';
> CREATE DATABASE
> regression=# \l
> List of databases
> Name | Owner | Encoding
> ------------+----------+-----------
> e | postgres | UNICODE
> regression | postgres | SQL_ASCII
> template0 | postgres | SQL_ASCII
> template1 | postgres | SQL_ASCII
> (5 rows)
>
> As soon as we decide whether the canonical name is "UTF8" or "UTF-8"
> ;-) we can fix it.

I checked and it looks like "UTF-8" is the correct usage:

http://www.unicode.org/glossary/

--
Bruce Momjian | http://candle.pha.pa.us
pgman(at)candle(dot)pha(dot)pa(dot)us | (610) 359-1001
+ If your life is a hard drive, | 13 Roberts Road
+ Christ can be your backup. | Newtown Square, Pennsylvania 19073

In response to

Responses

Browse pgsql-hackers by date

  From Date Subject
Next Message lsunley 2005-02-22 03:41:58 Re: psql: recall previous command?
Previous Message Tom Lane 2005-02-22 01:12:51 Re: Design notes for BufMgrLock rewrite

Browse pgsql-patches by date

  From Date Subject
Next Message Bruce Momjian 2005-02-22 03:53:08 Re: [PATCHES] [pgsql-hackers-win32] Repleacement for src/port/snprintf.c
Previous Message Andrew Dunstan 2005-02-21 22:42:58 Re: CSV multiline final fix