Better locale-specific-character-class handling for regexps

From: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To: pgsql-hackers(at)postgreSQL(dot)org
Cc: Bruno Wolff III <bruno(at)wolff(dot)to>
Subject: Better locale-specific-character-class handling for regexps
Date: 2016-08-23 00:54:58
Message-ID: 15563.1471913698@sss.pgh.pa.us
Views: Raw Message | Whole Thread | Download mbox
Thread:
Lists: pgsql-hackers

I got tired of hearing complaints about the issue described in
this thread:
https://www.postgresql.org/message-id/flat/24241.1329347196%40sss.pgh.pa.us

Here's a proposed fix. I've not done extensive performance testing,
but it seems to be as fast or faster than the old code in cases where
there are not too many "large" characters in the input. And, more
to the point, it gets the right answer for such large characters.

I'll add this to the upcoming commitfest.

regards, tom lane

Attachment Content-Type Size
better-regex-colormaps-1.patch text/x-diff 69.7 KB

Responses

Browse pgsql-hackers by date

  From Date Subject
Next Message Craig Ringer 2016-08-23 00:55:00 Re: [PATCH] Transaction traceability - txid_status(bigint)
Previous Message Peter Geoghegan 2016-08-23 00:36:31 Re: [BUGS] Re: Missing rows with index scan when collation is not "C" (PostgreSQL 9.5)