Re: BUG #13440: unaccent does not remove all diacritics

From: Alvaro Herrera <alvherre(at)2ndquadrant(dot)com>
To: Michael Gradek <mike(at)busbud(dot)com>
Cc: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>, pgsql-bugs(at)postgresql(dot)org
Subject: Re: BUG #13440: unaccent does not remove all diacritics
Date: 2015-06-15 04:50:56
Message-ID: 20150615045056.GM133018@postgresql.org
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-bugs

Michael Gradek wrote:

> grand-central=# select 'ț' as input, unaccent('ț') as observed, 't' as
> expected;
>
> input | observed | expected
>
> -------+----------+----------
>
> ț | ț | t

> > Hm, I do see
> >
> > ţ t

My terminal shows these characters to be different. One is
http://graphemica.com/%C8%9B
latin small letter t with comma below (U+021B)

The other is
http://graphemica.com/%C5%A3
latin small letter t with cedilla (U+0163)

--
Álvaro Herrera http://www.2ndQuadrant.com/
PostgreSQL Development, 24x7 Support, Remote DBA, Training & Services

In response to

Responses

Browse pgsql-bugs by date

  From Date Subject
Next Message Michael Paquier 2015-06-15 07:35:47 Re: Lack of Sanity Checking in file 'misc.c' for PostgreSQL 9.4.x
Previous Message Thomas Munro 2015-06-15 04:47:01 Re: BUG #13440: unaccent does not remove all diacritics