Re: BUG #13440: unaccent does not remove all diacritics

From: Teodor Sigaev <teodor(at)sigaev(dot)ru>
To: Léonard Benedetti <benedetti(at)mlpo(dot)fr>, PostgreSQL Bugs <pgsql-bugs(at)postgresql(dot)org>
Subject: Re: BUG #13440: unaccent does not remove all diacritics
Date: 2016-03-10 13:46:21
Message-ID: 56E17AAD.6080605@sigaev.ru
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-bugs

>
> On the other hand, UTF-8 encoding for source code is *a feature of
> Python 3* (to quote the documentation: “The default encoding for Python
> source code is UTF-8”) so there is no possible ambiguity, and it will
> not be a problem. That said, some non-ASCII characters may be removed
> without prejudice from the source code of the script (I think in
> particular to "“" and "”"). Nevertheless, for some comments, it would be
> unfortunate (e.g. “# RegEx to parse rules (e.g. “Đ → D ; […]”)” or “# ℃
> °C”).
Ok, I didn't know that.

> Thus, I propose to adapt the code to Python 3 (the encoding of the
> script does not seem to be a problem for the above reasons). I try to do
> it shortly.
We are waiting...

--
Teodor Sigaev E-mail: teodor(at)sigaev(dot)ru
WWW: http://www.sigaev.ru/

In response to

Responses

Browse pgsql-bugs by date

  From Date Subject
Next Message Léonard Benedetti 2016-03-10 14:35:00 Re: BUG #13440: unaccent does not remove all diacritics
Previous Message Nick Cleaton 2016-03-10 13:13:37 Re: streaming replication master can fail to shut down