Re: Postgresql.org search engine.

From: Oleg Bartunov <oleg(at)sai(dot)msu(dot)su>
To: Dave Page <dpage(at)vale-housing(dot)co(dot)uk>
Cc: josh(at)agliodbs(dot)com, pgsql-www(at)postgresql(dot)org
Subject: Re: Postgresql.org search engine.
Date: 2004-01-30 19:51:37
Message-ID: Pine.GSO.4.58.0401302249450.19778@ra.sai.msu.su
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-www

On Fri, 30 Jan 2004, Dave Page wrote:

>
>
> > -----Original Message-----
> > From: Oleg Bartunov [mailto:oleg(at)sai(dot)msu(dot)su]
> > Sent: 30 January 2004 19:06
> > To: Dave Page
> > Cc: josh(at)agliodbs(dot)com; pgsql-www(at)postgresql(dot)org
> > Subject: Re: [pgsql-www] Postgresql.org search engine.
> >
> >
> > Guys, there is a big difference between semi-static index
> > (aspseek) and incremental indexing of incoming documents
> > (tsearch2). Our approach is to develop fully automatical
> > searchable mailing list archive with instant indexing. So,
> > for example, I see my postings about subj.
> > already in database and *searchable* ! I don't expect
> > aspseek's search engine at postgresql.org has my recent
> > postings in its index.
>
> No it doesn't, but it probably could do with a little clever scripting
> to expire the right index pages before each run.
>
> In addition, one of the mods made in the version we are using is the
> addition of an XML feed to the indexer - John (the guy responsible for
> the port) is keen for me to use this for far more efficient indexing of
> the archives, however I have yet to do this mainly because it requires
> hacking mhonarc about to output the XML data.
>
> > OpenFTS has full access to metadata of documents, so we could
> > limit search '
> > range by date, by list, by authors, so smart user could get
> > reasonable search performance (relevance is very good,
> > because it based on proximity). So, different searches for
> > different purposes !
>
> We don't have those fields, but the XML feed was originally written for
> indexing data from online catalogues and has added fields like price.
> I'd be surprised if others couldn't be added as well.

This is what you need to look for to optimize search (limit search region
by date period). Default search should use something like
search last year documents.

>
> Regards, Dave.
>

Regards,
Oleg
_____________________________________________________________
Oleg Bartunov, sci.researcher, hostmaster of AstroNet,
Sternberg Astronomical Institute, Moscow University (Russia)
Internet: oleg(at)sai(dot)msu(dot)su, http://www.sai.msu.su/~megera/
phone: +007(095)939-16-83, +007(095)939-23-83

In response to

Browse pgsql-www by date

  From Date Subject
Next Message Dave Page 2004-01-30 19:54:53 Re: Postgresql.org search engine.
Previous Message Dave Page 2004-01-30 19:30:26 Re: Postgresql.org search engine.