Re: new full text search configurations

From: Emre Hasegeli <emre(at)hasegeli(dot)com>
To: obartunov(at)gmail(dot)com
Cc: Pgsql Hackers <pgsql-hackers(at)postgresql(dot)org>
Subject: Re: new full text search configurations
Date: 2015-11-21 09:26:04
Message-ID: CAE2gYzzFT-jxYv2gRaQC-MVNvwpmYLr8NCFxYwjKOojPLP1vLg@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

> I checked new snowball site http://snowballstem.org/ and found several new
> stemmers appeared (as external contributions):
>
> Irish and Czech
> Object Pascal codegenerator for Snowball
> Two stemmers for Romanian
> Hungarian
> Turkish
> Armenian
> Basque (Euskera)
> Catalan
>
> Some of them we don't have in our list of default configurations. Since
> these are external, not official stemmers, it'd be nice if people look and
> test them. If they are fine, we can prepare new configurations for 9.6.

We have configurations for the ones included to the Snowball, namely
Romanian, Hungarian, and Turkish. I don't know why the others are not
included but listed on the page as external contributions. It might
be a good idea to wait for someone to commit them to the upstream.

I have checked the changes on the algorithms [1]. They don't seemed
to be updated much after 2007, but recently a new one for Tamil
language is added. It might be a good candidate for a new
configuration.

[1] https://github.com/snowballstem/snowball/commits/master/algorithms

In response to

Browse pgsql-hackers by date

  From Date Subject
Next Message Alvaro Herrera 2015-11-21 11:59:02 Re: CreateFunction Statement
Previous Message Pavel Stehule 2015-11-21 06:49:33 custom function for converting human readable sizes to bytes