Re: GB18030-2022 Support in PostgreSQL

From: Chao Li <li(dot)evan(dot)chao(at)gmail(dot)com>
To: John Naylor <johncnaylorls(at)gmail(dot)com>
Cc: Peter Eisentraut <peter(at)eisentraut(dot)org>, pgsql-hackers(at)lists(dot)postgresql(dot)org, Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>, Andrew Dunstan <andrew(at)dunslane(dot)net>
Subject: Re: GB18030-2022 Support in PostgreSQL
Date: 2025-08-13 08:08:45
Message-ID: 96251bdc-4671-49ac-b281-fae8f3d477d4@gmail.com
Views: Whole Thread | Raw Message | Download mbox | Resend email
Thread:
Lists: pgsql-hackers


On 2025/8/13 15:20, Chao Li wrote:
>
>
> Sounds good. Let me recreate the patch.
>
>
Attached is the new patch. It downloads the UCM file in make:

```
Unicode % make gb18030_to_utf8.map
wget -O gb-18030-2000.ucm --no-use-server-timestamps
https://raw.githubusercontent.com/unicode-org/icu-data/d9d3a6ed27bb98a7106763e940258f0be8cd995b/charset/data/ucm/gb-18030-2000.ucm
--2025-08-13 15:54:53--
https://raw.githubusercontent.com/unicode-org/icu-data/d9d3a6ed27bb98a7106763e940258f0be8cd995b/charset/data/ucm/gb-18030-2000.ucm
HTTP request sent, awaiting response... 200 OK
Length: 672885 (657K) [text/plain]
Saving to: ‘gb-18030-2000.ucm’

gb-18030-2000.ucm  100%[=====================================>] 657.11K
 2.78MB/s    in 0.2s

2025-08-13 15:54:54 (2.78 MB/s) - ‘gb-18030-2000.ucm’ saved [672885/672885]

'/usr/bin/perl' -I . UCS_to_GB18030.pl
- Writing UTF8=>GB18030 conversion table: utf8_to_gb18030.map
- Writing GB18030=>UTF8 conversion table: gb18030_to_utf8.map
Unicode % git diff
Unicode %
```

After regenerating the map files, there is no change found in the map files.

Best regards,

Chao Li (Evan)
--------------------
HighGo Software Co., Ltd.
https://www.highgo.com/

Attachment Content-Type Size
v1-0001-GB18030-Switch-to-using-gb-18030-2000.ucm.patch text/plain 5.1 KB

In response to

Browse pgsql-hackers by date

  From Date Subject
Next Message kasaharatt 2025-08-13 08:44:06 Re: Add log_autovacuum_{vacuum|analyze}_min_duration
Previous Message Nazir Bilal Yavuz 2025-08-13 08:05:23 Re: meson: Add _static and _shared suffixes to the library names