Re: tsearch: how to get a list of stopwords?

From: Oleg Bartunov <oleg(at)sai(dot)msu(dot)su>
To: Joerg Erdmenger <joe(at)woerd(dot)com>
Cc: pgsql-general(at)postgresql(dot)org
Subject: Re: tsearch: how to get a list of stopwords?
Date: 2003-08-28 13:01:34
Message-ID: Pine.GSO.4.56.0308281659320.22257@ra.sai.msu.su
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

On Thu, 28 Aug 2003, Joerg Erdmenger wrote:

> hi
>
> > > me again. How do I find the stopwords that tsearch uses in its standard
> > > configuration? I've looked at contrib/tsearch/dict/porter_english.dct and
> > > get a feeling it's somewhere in there but I can't decipher it. Any
> > > suggestions?
> >
> > You're right. They're encoded in engstoptree :)
> > I suggest you not bother with old tsearch and look to tsearch2 version
> > which is much improved both in performance and flexibility.
> > http://www.sai.msu.su/~megera/postgres/gist/tsearch/V2/
> >
> well, I would like but I've got to get it to work on a production server; I
> will try to get the admins to install it but I guess it will take some time -
> meanwhile - is there anyway to get to the list of stopwords so that I can
> build a filter for those as a temporary workaround?

tsearch2 could live with tsearch, so you may play with it.
I attached english.stop file from OpenFTS distribution. But I'm not 100% sure
it's the same as in portereng.c :)

>
> thanks
>
> Joerg
>

Regards,
Oleg
_____________________________________________________________
Oleg Bartunov, sci.researcher, hostmaster of AstroNet,
Sternberg Astronomical Institute, Moscow University (Russia)
Internet: oleg(at)sai(dot)msu(dot)su, http://www.sai.msu.su/~megera/
phone: +007(095)939-16-83, +007(095)939-23-83

Attachment Content-Type Size
english.stop text/plain 588 bytes

In response to

Browse pgsql-general by date

  From Date Subject
Next Message Alex 2003-08-28 13:10:38 Re: Question Join/Subselect
Previous Message Bo Lorentsen 2003-08-28 12:52:57 Re: mysql's last_insert_id