Re: PostgreSQL 8.3 XML parser seems not to recognize the DOCTYPE element in XML files

From: Bruce Momjian <bruce(at)momjian(dot)us>
To: Lawrence Oluyede <l(dot)oluyede(at)gmail(dot)com>
Cc: pgsql-general(at)postgresql(dot)org
Subject: Re: PostgreSQL 8.3 XML parser seems not to recognize the DOCTYPE element in XML files
Date: 2008-04-15 14:24:35
Message-ID: 200804151424.m3FEOZ106087@momjian.us
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general pgsql-hackers


Added to TODO:

* Allow XML to accept more liberal DOCTYPE specifications

http://archives.postgresql.org/pgsql-general/2008-02/msg00347.php

---------------------------------------------------------------------------

Lawrence Oluyede wrote:
> As specified in the W3C Recommendation for XML the DOCTYPE element is
> perfectly valid in a document.
> I have a bunch of XML files generated by the boost library which
> contains a doctype like this:
>
> <!DOCTYPE boost_serialization>
>
> which lies within the bound of the recommendation
> (http://www.w3.org/TR/xml/#sec-prolog-dtd):
>
> "Note that it is possible to construct a well-formed document
> containing a doctypedecl that neither points to an external subset nor
> contains an internal subset."
>
> PostgreSQL 8.3 instead doesn't allow the insertion of XML with doctype
> in its new native data type returning this error message:
>
> """
> ERROR: invalid XML content
> DETAIL: Entity: line 2: parser error : StartTag: invalid element name
> <!DOCTYPE foo>
> ^
>
> ********** Error **********
>
> ERROR: invalid XML content
> SQL state: 2200N
> Detail: Entity: line 2: parser error : StartTag: invalid element name
> <!DOCTYPE foo>
> """
>
> This kind of behavior surprises me because pgsql has been compiled
> with the following flags on the development machine:
> ./configure --with-python --with-openssl --with-pam --with-libxml
> --with-libxslt --enable-thread-safety --enable-debug
>
> During the configuration stage it creates a Makefile binding the
> system version of the libxml2 library which is 2.6.30, the same
> library I use through Python (which parses correctly the XML file with
> the doctype).
>
> Any hints?
>
> ---------------------------(end of broadcast)---------------------------
> TIP 9: In versions below 8.0, the planner will ignore your desire to
> choose an index scan if your joining column's datatypes do not
> match

--
Bruce Momjian <bruce(at)momjian(dot)us> http://momjian.us
EnterpriseDB http://enterprisedb.com

+ If your life is a hard drive, Christ can be your backup. +

In response to

Responses

Browse pgsql-general by date

  From Date Subject
Next Message Sam Mason 2008-04-15 14:31:46 Re: Storage sizes for dates/times (documentation bug?)
Previous Message Karsten Hilbert 2008-04-15 14:23:42 Re: Storage sizes for dates/times (documentation bug?)

Browse pgsql-hackers by date

  From Date Subject
Next Message Tom Lane 2008-04-15 14:34:15 Re: pgwin32_safestat weirdness
Previous Message Bruce Momjian 2008-04-15 14:22:02 Re: Problem with site doc search