hunspell and tsearch2 ?

From: Dirk Lutzebäck <dirk(dot)lutzebaeck(at)thinkproject(dot)com>
To: pgsql-hackers(at)postgresql(dot)org
Subject: hunspell and tsearch2 ?
Date: 2012-08-27 12:31:15
Message-ID: 503B6893.4020103@thinkproject.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

Hi,

we have issues with compound words in tsearch2 using the german (ispell)
dictionary. This has been discussed before but there is no real solution
using the recommended german dictionary at
http://www.sai.msu.su/~megera/postgres/gist/tsearch/V2 (convert old
openoffice dict file to ispell suitable for tsearch):

# select ts_lexize('german_ispell', 'vollklimatisiert');
ts_lexize
--------------------
{vollklimatisiert}
(1 row)

This should return atleast

{vollklimatisiert, voll, klimatisiert}

The issue with compound words in ispell has been addressed in hunspell.
But this has not been integrated fully to tsearch2 (according to the
documentation).

Are there any plans to fully integrate hunspell into tsearch2? What is
needed to do this? What is the functional delta which is missing? Maybe
we can help...

Thanks for help

Dirk

Responses

Browse pgsql-hackers by date

  From Date Subject
Next Message Heikki Linnakangas 2012-08-27 13:00:55 Re: Statistics and selectivity estimation for ranges
Previous Message Heikki Linnakangas 2012-08-27 12:28:19 Re: [WIP] Performance Improvement by reducing WAL for Update Operation