Re: Built-in CTYPE provider

From: Jeff Davis <pgsql(at)j-davis(dot)com>
To: Peter Eisentraut <peter(at)eisentraut(dot)org>, Daniel Verite <daniel(at)manitou-mail(dot)org>
Cc: Robert Haas <robertmhaas(at)gmail(dot)com>, Jeremy Schneider <schneider(at)ardentperf(dot)com>, pgsql-hackers(at)postgresql(dot)org
Subject: Re: Built-in CTYPE provider
Date: 2024-03-08 01:00:21
Message-ID: 6bdb98e68b2b05aa71f7f934e227738eac84ecee.camel@j-davis.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

On Sat, 2024-03-02 at 15:02 -0800, Jeff Davis wrote:
> Attached v20.

And here's v22 (I didn't post v21).

I committed Unicode property tables and functions, and the simple case
mapping. I separated out the full case mapping changes (based on
SpecialCasing.txt) into patch 0006.

Not a lot of technical changes, but I cleaned up the remaining patches
and put them into a nicer order with nicer commit messages.

0001: Catalog renaming: colliculocale to colllocale and daticulocale to
datlocale.

0002: Basic builtin collation provider that only supports "C".

0003: C.UTF-8 locale for builtin collation provider and collation
pg_c_utf8.

0004: Inline some UTF-8 functions to improve performance

0005: Add a unicode_strtitle() function and move the implementation for
the builtin provider out of formatting.c.

0006: Add full case mapping support

0007: Add PG_UNICODE_FAST locale for builtin collation provider and
collation pg_unicode_fast. This behaves like the standard says
UCS_BASIC should behave -- sort by code point order but use Unicode
character semantics with full case mapping.

0004 and beyond could use some review. 0004 and 0005 are pretty simple
and non-controversial. 0006 and 0007 are a bit more interesting and
could use some discussion if we want to go ahead with full case mapping
in 17.

Regards,
Jeff Davis

Attachment Content-Type Size
v22-0001-Catalog-changes-preparing-for-builtin-collation-.patch text/x-patch 48.8 KB
v22-0002-Introduce-collation-provider-builtin.patch text/x-patch 59.7 KB
v22-0003-Add-C.UTF-8-locale-to-the-new-builtin-collation-.patch text/x-patch 27.4 KB
v22-0004-Inline-basic-UTF-8-functions.patch text/x-patch 6.2 KB
v22-0005-Add-unicode_strtitle-for-Unicode-Default-Case-Co.patch text/x-patch 9.3 KB
v22-0006-Support-Unicode-full-case-mapping-and-conversion.patch text/x-patch 555.7 KB
v22-0007-Add-PG_UNICODE_FAST-locale-to-the-builtin-collat.patch text/x-patch 16.9 KB

In response to

Responses

Browse pgsql-hackers by date

  From Date Subject
Next Message John Naylor 2024-03-08 01:04:39 Re: [PoC] Improve dead tuple storage for lazy vacuum
Previous Message Melanie Plageman 2024-03-08 00:46:14 Re: Confine vacuum skip logic to lazy_scan_skip