Re: search on accents -> Why not include this function

From: Peter Eisentraut <peter_e(at)gmx(dot)net>
To: Jaume Teixi <teixi(at)6tems(dot)com>
Cc: <pgsql-admin(at)postgresql(dot)org>, Patrice Hédé <patrice(dot)hede(at)idf(dot)net>
Subject: Re: search on accents -> Why not include this function
Date: 2001-03-29 18:13:04
Message-ID: Pine.LNX.4.30.0103292007570.2091-100000@peter.localdomain
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-admin

Jaume Teixi writes:

> > The reason there is no such implementation, and probably won't be any time
> > soon, is that this tool would either have to hard-code or ignore natural
> > language semantics, neither of which would make it practical. Not all
> > languages have the same accent ignoring or accent folding rules or
> > conventions.
>
> This function is really fast.
> The accent method is a REAL need for almost all non-english languages.
> You should to explicity call this funciton like:
> select accents ('dali');
> accents
> ----------------------------------
> [dðÐ][aáÁàÀâÂäÄåÅãÃ]l[iíÍìÌîÎïÏ]
>
> so why to not to include on the next release ?

For the reason I cited above: it is a too abstract approach for many
languages and/or applications. For example in Swedish, a search for 'e'
should probably include 'é', since most users will not type that in
explicitly (it's not on the keyboard), but a search for 'a' should
normally not include 'å', since that it a completely separate letter (and
it is on the keyboard). Additionally, this particular implementation
seems to be ISO-8859-1 charset specific. I know a number of accented
letters that are a lot closer "siblings" to 'd' than 'ð' is.

--
Peter Eisentraut peter_e(at)gmx(dot)net http://yi.org/peter-e/

In response to

Responses

Browse pgsql-admin by date

  From Date Subject
Next Message Jaume Teixi 2001-03-29 18:18:11 Re: search on accents -> Why not include this function
Previous Message Tom Lane 2001-03-29 17:55:54 Re: PG 7.0.3 & RH 7 IPC problems?