Re: BUG #14278: Problem searching spanish words with accent mark outside the stem

From: Alvaro Herrera <alvherre(at)2ndquadrant(dot)com>
To: paco(at)hernandezgomez(dot)com
Cc: pgsql-bugs(at)postgresql(dot)org
Subject: Re: BUG #14278: Problem searching spanish words with accent mark outside the stem
Date: 2016-08-08 04:58:47
Message-ID: 20160808045847.GA878137@alvherre.pgsql
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-bugs

paco(at)hernandezgomez(dot)com wrote:

> Search without accent mark is not working correctly when the accent mark is
> outside the stem of the word.

I think it'd be better to apply unaccent() to both the stored text
before ts_vectorization and to the query terms. That would reliably
remove all diacritics (eñes too, though I suppose nobody would search
for their ñandúes by writing nandú, so it's not as severe).

--
Álvaro Herrera http://www.2ndQuadrant.com/
PostgreSQL Development, 24x7 Support, Remote DBA, Training & Services

In response to

Responses

Browse pgsql-bugs by date

  From Date Subject
Next Message Jaime Casanova 2016-08-08 05:27:04 Re: BUG #14278: Problem searching spanish words with accent mark outside the stem
Previous Message Peter Geoghegan 2016-08-08 04:58:14 Re: BUG #14150: Attempted to delete invisible tuple