Re: UTF8 for docs

From: Bruce Momjian <bruce(at)momjian(dot)us>
To: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Cc: PostgreSQL-documentation <pgsql-docs(at)postgresql(dot)org>
Subject: Re: UTF8 for docs
Date: 2018-05-01 14:18:14
Message-ID: 20180501141814.GB6470@momjian.us
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-docs

On Tue, May 1, 2018 at 10:14:48AM -0400, Tom Lane wrote:
> Bruce Momjian <bruce(at)momjian(dot)us> writes:
> > Our release.sgml contains these lines, that I wrote:
> > we cannot use UTF8 because SGML Docbook does not support it
> > do not use numeric _UTF_ numeric character escapes (&#nnn;),
> > we can only use Latin1
>
> > Should this be changed now that we are using XML for head? It cannot be
> > changed for back branch releases since those are still SGML, so I
> > suggest we keep this restriction. I have updated the doc comments.
>
> I might be wrong, but I was under the impression that restricting the
> character set was still a good idea because of downstream restrictions
> on rendering of the docs. For instance, pretty much every web browser
> can render Latin1 characters, but I wouldn't bet on Klingon working.

Oh, uh, I was unclear if those SGML specifications were passed unchanged
into the output.

> Maybe we could go a little further than the standard named-entity
> characters, but it'd take some research to figure out what is safe.

Yeah. I have added this as a doc comment so we don't forget.

--
Bruce Momjian <bruce(at)momjian(dot)us> http://momjian.us
EnterpriseDB http://enterprisedb.com

+ As you are, so once was I. As I am, so you will be. +
+ Ancient Roman grave inscription +

In response to

Browse pgsql-docs by date

  From Date Subject
Next Message PG Doc comments form 2018-05-02 07:10:02 ALTER
Previous Message Tom Lane 2018-05-01 14:14:48 Re: UTF8 for docs