Re: BUG #18274: Error 'invalid XML content'

From: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To: Michael Paquier <michael(at)paquier(dot)xyz>
Cc: Dmitry Koval <d(dot)koval(at)postgrespro(dot)ru>, pgsql-bugs(at)lists(dot)postgresql(dot)org
Subject: Re: BUG #18274: Error 'invalid XML content'
Date: 2024-01-26 04:14:55
Message-ID: 696008.1706242495@sss.pgh.pa.us
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-bugs

Michael Paquier <michael(at)paquier(dot)xyz> writes:
> On Thu, Jan 25, 2024 at 03:12:07PM +0300, Dmitry Koval wrote:
>> I agree that reverting a patch is a good idea if there are concerns about
>> server resources (XML is used by few users and there are even fewer users
>> who need to parse elements larger than 10Mb).
>> For such users it is better to create custom PostgreSQL build.

> And done with f2743a7d70e7.

Related to this: I just read some interesting things in libxml2 2.12's
release notes:

Most of the known issues leading to quadratic behavior in the XML parser
were fixed. Internal hash tables were rewritten to reduce memory
consumption.

A new API function xmlCtxtSetMaxAmplification was added to allow parsing
of files that would otherwise trigger the billion laughs protection.

Could it be that if we see this new function is available and use it,
we could allow more than we have done historically? I don't have a
whole lot of faith here, but perhaps this is worth investigation.

(BTW, 2.12 has created some annoying API breaks, which seems to be why
caiman is failing. We have some work to do there in any case.)

regards, tom lane

In response to

Browse pgsql-bugs by date

  From Date Subject
Next Message Tender Wang 2024-01-26 04:54:07 Re: BUG #18310: Some SQL commands fail to process duplicate objects with error: tuple already updated by self
Previous Message Michael Paquier 2024-01-26 03:41:16 Re: BUG #18310: Some SQL commands fail to process duplicate objects with error: tuple already updated by self