From: | Oleg Bartunov <oleg(at)sai(dot)msu(dot)su> |
---|---|
To: | Joerg Erdmenger <joe(at)woerd(dot)com> |
Cc: | pgsql-general(at)postgresql(dot)org |
Subject: | Re: tsearch: how to get a list of stopwords? |
Date: | 2003-08-28 13:01:34 |
Message-ID: | Pine.GSO.4.56.0308281659320.22257@ra.sai.msu.su |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-general |
On Thu, 28 Aug 2003, Joerg Erdmenger wrote:
> hi
>
> > > me again. How do I find the stopwords that tsearch uses in its standard
> > > configuration? I've looked at contrib/tsearch/dict/porter_english.dct and
> > > get a feeling it's somewhere in there but I can't decipher it. Any
> > > suggestions?
> >
> > You're right. They're encoded in engstoptree :)
> > I suggest you not bother with old tsearch and look to tsearch2 version
> > which is much improved both in performance and flexibility.
> > http://www.sai.msu.su/~megera/postgres/gist/tsearch/V2/
> >
> well, I would like but I've got to get it to work on a production server; I
> will try to get the admins to install it but I guess it will take some time -
> meanwhile - is there anyway to get to the list of stopwords so that I can
> build a filter for those as a temporary workaround?
tsearch2 could live with tsearch, so you may play with it.
I attached english.stop file from OpenFTS distribution. But I'm not 100% sure
it's the same as in portereng.c :)
>
> thanks
>
> Joerg
>
Regards,
Oleg
_____________________________________________________________
Oleg Bartunov, sci.researcher, hostmaster of AstroNet,
Sternberg Astronomical Institute, Moscow University (Russia)
Internet: oleg(at)sai(dot)msu(dot)su, http://www.sai.msu.su/~megera/
phone: +007(095)939-16-83, +007(095)939-23-83
Attachment | Content-Type | Size |
---|---|---|
english.stop | text/plain | 588 bytes |
From | Date | Subject | |
---|---|---|---|
Next Message | Alex | 2003-08-28 13:10:38 | Re: Question Join/Subselect |
Previous Message | Bo Lorentsen | 2003-08-28 12:52:57 | Re: mysql's last_insert_id |