Re: BUG #13440: unaccent does not remove all diacritics

From: Thomas Munro <thomas(dot)munro(at)enterprisedb(dot)com>
To: Andres Freund <andres(at)anarazel(dot)de>
Cc: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>, Alvaro Herrera <alvherre(at)2ndquadrant(dot)com>, Michael Gradek <mike(at)busbud(dot)com>, PostgreSQL Bugs <pgsql-bugs(at)postgresql(dot)org>
Subject: Re: BUG #13440: unaccent does not remove all diacritics
Date: 2015-09-03 03:16:38
Message-ID: CAEepm=3SRnDYAFBJ1YWWzgTxSFh30WLsqVahg5Sz7sY_vP6TPw@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-bugs

On Thu, Sep 3, 2015 at 12:06 PM, Andres Freund <andres(at)anarazel(dot)de> wrote:

> On 2015-09-02 19:59:37 -0400, Tom Lane wrote:
> > No, not after someone pointed out that it could have strange side-effects
> > on full text search configurations that used unaccent. You'd stop being
> > able to find documents whenever your search term is stripped of accents
> > more thoroughly than before. That might be all right in a new major
> > release (if it documents that you might have to rebuild your FTS indexes
> > and derived tsvector columns). It's not all right in a minor release.
>
> Yes, it was me that pointed that out and argued against it ;)
>
> http://archives.postgresql.org/message-id/20150618202135.GB29350%40alap3.anarazel.de
> and following.
>
> Thomas, will you repost context & a patch implementing this (instead of
> just files)?
>

Attached. I gave the script a better name and some comments.

--
Thomas Munro
http://www.enterprisedb.com

Attachment Content-Type Size
unaccent-rules-with-all-diacritics-and-ligatures.patch application/octet-stream 8.8 KB

In response to

Browse pgsql-bugs by date

  From Date Subject
Next Message Gerdan Rezende dos Santos 2015-09-03 03:20:18 Re: error on online backup using pg_basebackup tool
Previous Message Michael Paquier 2015-09-03 02:52:18 Re: error on online backup using pg_basebackup tool