Quick Links

Re: PATCH: Allow empty targets in unaccent dictionary

From:	Abhijit Menon-Sen <ams(at)2ndQuadrant(dot)com>
To:	pgsql-hackers(at)postgresql(dot)org
Cc:	Mohammad Alhashash <alhashash(at)alhashash(dot)net>
Subject:	Re: PATCH: Allow empty targets in unaccent dictionary
Date:	2014-06-29 11:43:28
Message-ID:	20140629114328.GA31670@toroid.org
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

Hi.

I've attached a patch to contrib/unaccent as outlined in my review the
other day. I'm familiar with multiple languages in which modifiers are
separate characters (but not Arabic), so I decided to try a quick test
because I was curious.

I added a line containing only U+0940 (DEVANAGARI VOWEL SIGN II) to
unaccent.rules, and tried the following (the argument to unaccent is
U+0915 U+0940, and the result is U+0915):

ams=# select unaccent('unaccent','की ');
unaccent
----------
क
(1 row)

So the patch works fine: it correctly removes the modifier.

To add a test, however, it would be necessary to add this modifier to
unaccent.rules. But if we're adding one modifier to unaccent.rules, we
really should add them all. I have nowhere near the motivation needed to
add all the Devanagari modifiers, let alone any of the other languages I
know, and even if I did, it still wouldn't address Mohammad's use case.

(As a separate matter, it's not clear to me if stripping these modifiers
using unaccent is something everyone will want to do.)

So, though I'm not fond of saying it, perhaps the right thing to do is
to forget my earlier objection (that the patch didn't have tests), and
just commit as-is. It's a pretty straightforward patch, and it works.

I'm setting this as ready for committer.

-- अभजत "unaccented in three languages" മനന-সন

Attachment	Content-Type	Size
unaccent.diff	text/x-diff	1.2 KB

In response to

Re: PATCH: Allow empty targets in unaccent dictionary at 2014-06-25 05:20:12 from Abhijit Menon-Sen

Responses

Re: PATCH: Allow empty targets in unaccent dictionary at 2014-06-30 19:19:17 from Tom Lane

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Mohammad Alhashash	2014-06-29 12:03:10	Re: PATCH: Allow empty targets in unaccent dictionary
Previous Message	MauMau	2014-06-29 11:35:04	Re: [Fwd: Re: proposal: new long psql parameter --on-error-stop]