BUG #18216: Unaccent function is unable to remove accents (diacritic signs) from Japanese character 'ド'

From: PG Bug reporting form <noreply(at)postgresql(dot)org>
To: pgsql-bugs(at)lists(dot)postgresql(dot)org
Cc: shailesh(dot)totale(at)sailpoint(dot)com
Subject: BUG #18216: Unaccent function is unable to remove accents (diacritic signs) from Japanese character 'ド'
Date: 2023-11-28 07:15:41
Message-ID: 18216-1e37bb8229aeab67@postgresql.org
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-bugs

The following bug has been logged on the website:

Bug reference: 18216
Logged by: Shailesh Totale
Email address: shailesh(dot)totale(at)sailpoint(dot)com
PostgreSQL version: 13.8
Operating system: Linux
Description:

Hello team ,

PostgreSQL's unaccent module does not use Unicode normalisation, but only a
simple search-and-replace dictionary. The dictionary, unaccent.rules
(https://github.com/postgres/postgres/blob/master/contrib/unaccent/unaccent.rules)
, does not contain these Japanese characters, thus its unable to remove
the diacritic signs. Can someone please guide when we can expect these
Japanese characters will be added.

Also tried to check with latest versions of Postgresql still the latest
version does not have support for the Japanese characters.

https://pgpedia.info/u/unaccent.html

Thanks,
Shailesh

Responses

Browse pgsql-bugs by date

  From Date Subject
Next Message Richard Guo 2023-11-28 08:03:47 Re: BUG #18187: Unexpected error: "variable not found in subplan target lists" triggered by JOIN
Previous Message Frank Büttner 2023-11-28 07:06:33 Re: [ext] Re: Misconfiguration on SSL for download.postgresql.org ?