From:
Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To:
Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>
Cc:
"Markus Bertheau ?" <twanger(at)bluetwanger(dot)de>,Peter Eisentraut <peter_e(at)gmx(dot)net>, Tatsuo Ishii <t-ishii(at)sra(dot)co(dot)jp>,dpage(at)vale-housing(dot)co(dot)uk, oliver(at)opencloud(dot)com, zakkr(at)zf(dot)jcu(dot)cz,PostgreSQL-patches <pgsql-patches(at)postgresql(dot)org>
Subject:
Re: [HACKERS] UTF8 or Unicode
Date:
2005-03-02 17:54:20
Message-ID:
11919.1109786060@sss.pgh.pa.us (view raw or flat )
Thread:
2005-02-18 12:49:36 from "Dave Page" <dpage(at)vale-housing(dot)co(dot)uk>
2005-02-19 03:35:07 from Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>
2005-02-19 03:58:34 from Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
2005-02-22 03:08:25 from Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>
2005-02-23 02:44:29 from Tatsuo Ishii <t-ishii(at)sra(dot)co(dot)jp>
2005-02-25 04:51:16 from Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>
2005-02-25 07:02:45 from Peter Eisentraut <peter_e(at)gmx(dot)net>
2005-02-25 08:59:25 from Karel Zak <zakkr(at)zf(dot)jcu(dot)cz>
2005-02-25 10:01:44 from Peter Eisentraut <peter_e(at)gmx(dot)net>
2005-02-25 15:26:05 from Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>
2005-02-25 15:33:13 from Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
2005-02-25 16:32:21 from Peter Eisentraut <peter_e(at)gmx(dot)net>
2005-02-25 18:36:57 from Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>
2005-02-26 20:50:23 from Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>
2005-02-26 23:08:52 from lsunley(at)mb(dot)sympatico(dot)ca
2005-02-26 23:49:36 from lsunley(at)mb(dot)sympatico(dot)ca
2005-02-27 00:55:40 from Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>
2005-02-26 23:56:59 from lsunley(at)mb(dot)sympatico(dot)ca
2005-02-27 01:07:22 from Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>
2005-02-27 04:09:43 from Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>
2005-02-27 04:50:12 from lsunley(at)mb(dot)sympatico(dot)ca
2005-02-27 04:53:48 from lsunley(at)mb(dot)sympatico(dot)ca
2005-03-02 09:48:43 from Markus Bertheau ☭ <twanger(at)bluetwanger(dot)de>
2005-03-02 13:31:11 from Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>
2005-03-02 17:54:20 from Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
2005-03-03 05:20:35 from Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>
Lists:
pgsql-hackers pgsql-patches
Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us> writes:
>> The correct encoding name is "UTF-8".
> True, but Peter says the ANSI standard calls it UTF8 so that's what I
> used.
What SQL99 actually says is
- UTF8 specifies the name of a character repertoire that consists
of every character represented by The Unicode Standard Version
2.0 and by ISO/IEC 10646 UTF-8, where each character is encoded
using the UTF-8 encoding, occupying from 1 (one) through 6
octets.
That is, "UTF8" is an identifier chosen to refer to an encoding which
they know perfectly well is really called UTF-8. We should probably
follow the same convention of using UTF8 in code identifiers and UTF-8
in documentation. In particular, UTF_8 with an underscore is sanctioned
by nobody and should be avoided.
regards, tom lane
In response to
Responses
pgsql-hackers by date
Next :From: Tom LaneDate: 2005-03-02 18:16:10
Subject : Re: Vacuum time degrading
Previous :From : Andreas PflugDate : 2005-03-02 17:35:45
Subject : Re: logging as inserts
pgsql-patches by date
Next :From: Bruce MomjianDate: 2005-03-02 18:16:27
Subject : Re: [pgsql-hackers-win32] [HACKERS] snprintf causes regression
Previous :From : Stefan HansDate : 2005-03-02 17:35:01
Subject : typos in the docu