From: | Karel Zak <zakkr(at)zf(dot)jcu(dot)cz> |
---|---|
To: | Peter Eisentraut <peter_e(at)gmx(dot)net> |
Cc: | pgsql-patches <pgsql-patches(at)postgreSQL(dot)org> |
Subject: | Re: encoding names v2. |
Date: | 2001-08-23 07:51:14 |
Message-ID: | 20010823095114.B7892@zf.jcu.cz |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-patches |
On Wed, Aug 22, 2001 at 09:38:03PM +0200, Peter Eisentraut wrote:
> Okay, here is some bad news: I just looked into the SQL99 standard for
> the names of predefined character set names, and here is the list:
>
> SQL_CHARACTER
> GRAPHIC_IRV or ASCII_GRAPHIC
> LATIN1 <==== !!!
> ISO8BIT or ASCII_FULL
> UTF16
> UTF8
> UCS2
> SQL_TEXT
> SQL_IDENTIFIER
>
> So perhaps we should keep the LATIN1 thing after all? I don't like it,
> but the rules...
>
> Comments?
Oh man... what do you want to hear? :-(
Here is ***no problem*** add arbitrary alias (for example LATIN1 is still
correct name for our code), but a question is how names select as primary
and use it as output for user eyes. I'm really unsure if we must
blindly support SQL99 if this standard *ignore* in some rules other
standards and conventions. We can support SQL99's ignoran names for example
in pg_char_to_encoding(), but we needn't show these names to users (for
example in psql's \l command).
> > - getdatabaseencoding() is compatible with old versions, but
> > in the code is commented as deprecated.
> >
> > - getdbencoding() is new function that return correct encoding names
>
> See my other message about this. I don't think this is a good choice of
> names.
OK.
> This is okay, look at the list above for precedent.
>
> > - the ./configure.in:
> > * use new encoding names too for --enable-multibyte
> > * define MULTIBYTE that handle default encoding id
>
> Where is this needed?
In "mb/mbutils.c" was/is set default database encoding by encoding id
(maybe it's never used, because standard backend init encoding during
start, but old code used it and I keep it).
>
> > * define MULTIBYTE_NAME that handle default encoding name (neeful
> > for initdb)
>
> Can you rename this to something like DEFAULT_CHARACTER_SET? There is
> really nothing "multibyte" here.
Good point.
> > src/utils/mb/Unicode/KOI8_to_utf8.map --> src/utils/mb/Unicode/KOI8R_to_utf8.map
> > src/utils/mb/Unicode/WIN_to_utf8.map --> src/utils/mb/Unicode/WIN1251_to_utf8.map
> > src/utils/mb/Unicode/utf8_to_KOI8.map --> src/utils/mb/Unicode/utf8_to_KOI8R.map
> > src/utils/mb/Unicode/utf8_to_WIN.map --> src/utils/mb/Unicode/utf8_to_WIN1251.map
>
> Can you introduce some uniform capitalization (e.g., all lower case)?
OK.
> Don't worry, we'll get there. ;-)
I'm still happy :-)
Karel
--
Karel Zak <zakkr(at)zf(dot)jcu(dot)cz>
http://home.zf.jcu.cz/~zakkr/
C, PostgreSQL, PHP, WWW, http://docs.linux.cz, http://mape.jcu.cz
From | Date | Subject | |
---|---|---|---|
Next Message | Peter Eisentraut | 2001-08-23 15:36:01 | Re: [PATCHES] encoding names |
Previous Message | Karel Zak | 2001-08-23 07:22:36 | Re: encoding names v2. |