Re: XMLEXISTS on legacy XML with malformed xmlns

From: Pavel Stehule <pavel(dot)stehule(at)gmail(dot)com>
To: Edson Richter <edsonrichter(at)hotmail(dot)com>
Cc: "pgsql-general(at)postgresql(dot)org" <pgsql-general(at)postgresql(dot)org>
Subject: Re: XMLEXISTS on legacy XML with malformed xmlns
Date: 2016-02-12 04:49:29
Message-ID: CAFj8pRAf1Q-SnHZ5LUsXJ2dDDcEv0k0pK8zQAFXYgd89cEeuiA@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

2016-02-12 1:53 GMT+01:00 Edson Richter <edsonrichter(at)hotmail(dot)com>:

> Hi!
>
> I've some (about 1M records) containing legacy XML I would like to parse
> and apply XMLEXISTS.
>
> This is the query:
>
> select * from xmllog
> where xpath_exists(('//MyDocument[(at)DocNum = ''000411828'']'::text),
> xmlparse(document cdataout));
>
> This is the error:
>
> ERRO: could not parse XML document
> SQL state: 2200M
> Detail: line 2: xmlns:leg: 'LEGACYAPP - SEND MSG EVENTS ABOUT' is not a
> valid URI
> <leg:sendmsgeventsabout xmlns:leg="LEGACYAPP - SEND MSG EVENTS ABOUT">
> ^
>
> This is the sample XML with malformed xmlns (I've shortenet the data, but
> the important thing here is the malformed xmlns):
>
> "<?xml version="1.0" encoding="utf-8" ?>
> <leg:sendmsgeventsabout xmlns:leg="LEGACYAPP - SEND MSG EVENTS ABOUT">
> <carrier xmlns="" controlnum="04503660000146">
> <MyDocument DocNum="000511852">
> <other_info>0</other_info>
> <complement info (...)"
>
>
> I can easly read this XML in Notepad++, and also in Java - but PostgreSQL
> always throw error.
>
> Can you plase tell me how can make PostgreSQL ignore this malformed xmlns
> and proceed processing the XML?
>

PostgreSQL uses libxml2, but the usage isn't too configurable. So my advice
is using defensive strategy and clean/fix wrong namespace with string tools
- replace function.

Regards

Pavel

>
> Thanks,
>
> --
> Atenciosamente,
>
> Edson Carlos Ericksson Richter
>
>
>
> --
> Sent via pgsql-general mailing list (pgsql-general(at)postgresql(dot)org)
> To make changes to your subscription:
> http://www.postgresql.org/mailpref/pgsql-general
>

In response to

Responses

Browse pgsql-general by date

  From Date Subject
Next Message Albe Laurenz 2016-02-12 09:02:07 Re: PosgreSQL Security Architecture
Previous Message Tom Lane 2016-02-12 04:48:53 Re: 9.4 -> 9.5 dump size reduction