Re: UTF-8 docs?

From: Tatsuo Ishii <ishii(at)sraoss(dot)co(dot)jp>
To: vitus(at)wagner(dot)pp(dot)ru
Cc: pgsql-hackers(at)postgresql(dot)org
Subject: Re: UTF-8 docs?
Date: 2016-08-22 08:28:31
Message-ID: 20160822.172831.1086189593119146685.t-ishii@sraoss.co.jp
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

> On Mon, 22 Aug 2016 14:16:45 +0900 (JST)
> Tatsuo Ishii <ishii(at)sraoss(dot)co(dot)jp> wrote:
>
>> Just out of curiopusity, I wonder why we can't make the encoding of
>> SGML docs to be UTF-8, rather than current ISO-8859-1.
>
>
> What a reason of "make the encoding of sgml docs" to be something?
> What actual change should be made and what problems it would solve?

Problem is, the PostgreSQL docs is fixed to ISO-8859-1, and if I want
to use other encoding, I need to change the build system, which is
annoying. Ideally, if someone wants to use other than ISO-8859-1, then
he/she just change the contents of sgml files. Just changing the
ISO-8859-1 to UTF-8 will solve most problems.

(Probably allowing to specify arbitrary encoding is better but it
needs some work).

> There are various translations of postgreSQL docs, and they use various
> encodings. Translated versions of docs on http://postgresql.org/docs
> are just links to external sites where translations are maintained.
> English documentation uses ISO-8859-1 (actually ASCII),
> Russian uses UTF-8 (you can download our source tarball from
> http://repo.postgrespro.ru/pgpro-9.5/src and see postgres source
> distribution with UTF-8 sgmls inside).
>
> Japanese documentation in HTML form is served from
> http://www.postgresql.jp/document/9.5/html/
> in utf-8 too.
>
> I.e. everybody who need utf-8 to represent translation of
> documentation, already uses it.
>
> What exatly you proposes do be done?

See above.

> Really, what change we need, it is conversion from SGML to XML format.
> It would solve some real problems, such as ability to include diagrams
> in the docs, and also let everyone to explicitely specify encoding in
> XML declaration (and probably cause switch to UTF-8 as side effect,
> because most XML-based tools use UTF-8 as default).

That's another story.

Best regards,
--
Tatsuo Ishii
SRA OSS, Inc. Japan
English: http://www.sraoss.co.jp/index_en.php
Japanese:http://www.sraoss.co.jp

In response to

Browse pgsql-hackers by date

  From Date Subject
Next Message Pavel Stehule 2016-08-22 08:49:49 Re: Showing parallel status in \df+
Previous Message Victor Wagner 2016-08-22 07:13:48 Re: UTF-8 docs?