sgml cleanup: unescaped '>' characters

From: Josh Kupershmidt <schmiddy(at)gmail(dot)com>
To: pgsql-docs <pgsql-docs(at)postgresql(dot)org>
Subject: sgml cleanup: unescaped '>' characters
Date: 2011-08-25 03:28:45
Message-ID: CAK3UJRENGTA97ZEhguS6CzjqQgqU-D8fpFdh54tHT-1xfx2LpQ@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-docs

I found myself rewriting the ./src/tools/find_gt_lt script in Perl
this evening, since the existing script was quite broken (the main
problem is it's not capable of understanding CDATA or sgml comment
sections, and hence produces a bunch of noise).

The rewritten version picked up a few stylistic inconsistencies in the
SGML, such as:
* breaking the trailing '>' of an SGML marker across lines. AFAIK
this is legal, but is a bit inconsistent and just confuses simplistic
tools like find_gt_lt
* using single quotes instead of double quotes to surround a node
attribute, as in <orderedlist numeration='loweralpha'>

as well as seemingly-invalid SGML, such as using '>' unescaped inside
normal SGML entries.

I've attached a patch to fix these problems. I can send in the new
version of find_gt_lt if these changes prove useful.

Josh

Attachment Content-Type Size
sgml_unescaped_gt.patch text/x-patch 16.8 KB

Responses

Browse pgsql-docs by date

  From Date Subject
Next Message Bruce Momjian 2011-08-25 03:40:45 Re: [GENERAL] documentation suggestion
Previous Message Peter Eisentraut 2011-08-24 18:59:13 Re: [GENERAL] documentation suggestion