Skip site navigation (1) Skip section navigation (2)

Re: [pgsql-advocacy] New PostgreSQL search resource

From: Oleg Bartunov <oleg(at)sai(dot)msu(dot)su>
To: "Marc G(dot) Fournier" <scrappy(at)postgresql(dot)org>
Cc: Robert Treat <xzilla(at)users(dot)sourceforge(dot)net>,pgsql-www(at)postgresql(dot)org, "Joshua D(dot) Drake" <jd(at)commandprompt(dot)com>
Subject: Re: [pgsql-advocacy] New PostgreSQL search resource
Date: 2004-01-17 04:57:36
Message-ID: Pine.GSO.4.58.0401170749130.3310@ra.sai.msu.su (view raw or flat)
Thread:
Lists: pgsql-advocacypgsql-generalpgsql-www
On Fri, 16 Jan 2004, Marc G. Fournier wrote:

> On Fri, 16 Jan 2004, Robert Treat wrote:
>
> > Ok, this is now the second site that has come to be in the last few days that
> > is using FTS and Tsearch for site search, and providing something that seems
> > a lot better than the search available on the main website... what is
> > stopping us from implementing this and dumping mnogosearch?  at least for the
> > main site if not for the archives?
>
> actually, Dave is working on an improved search ... but, Oleg just
> announced a crawler using tsearch that I'm going to take a look at
> implementing as well ...

I suggest to follow Roberts suggestion about indexing main site.
Archives is a bit another thing. I already wrote it needs to be
optimized for crawlers (headers).

Also, It would be much better to be able to index just content without
headers/footers etc. If you have access to files :), you may index
much faster without any crawler ! OpenFTS distribution contains
example scripts to index file collections. It's very very easy.

I quoting from "Survival Guide":

   APOD collection is consists of 1757 articles (about 7 Mb) and ideally suited
   for OpenFTS. Indexing tooks about 29 seconds on my IBM ThinkPad T21 notebook
   ( Linux, 2.4.17, 256 Mb RAM, 20 Gb IDE HD). Total number of lexems is
   131310, while the number of unique lexemes is only 8,806
  ( using Porter's stemmer ).


Official PostgreSQL documentation is about the same size.






>
>
> >
> > Robert Treat
> >
> > On Friday 16 January 2004 19:33, Joshua D. Drake wrote:
> > > Hello,
> > >
> > > Took an hour today and made the 7.3.4, 7.4.1 and Practical PostgreSQL
> > > documentation
> > > all searchable using OpenFTS and Tsearch2. You can take a look at:
> > >
> > > http://www.commandprompt.com/community/
> > >
> > > Sincerely,
> > >
> > > Joshua Drake
> >
> > --
> > Build A Brighter Lamp :: Linux Apache {middleware} PostgreSQL
> >
> > ---------------------------(end of broadcast)---------------------------
> > TIP 4: Don't 'kill -9' the postmaster
> >
>
> ----
> Marc G. Fournier           Hub.Org Networking Services (http://www.hub.org)
> Email: scrappy(at)hub(dot)org           Yahoo!: yscrappy              ICQ: 7615664
>
> ---------------------------(end of broadcast)---------------------------
> TIP 5: Have you checked our extensive FAQ?
>
>                http://www.postgresql.org/docs/faqs/FAQ.html
>

	Regards,
		Oleg
_____________________________________________________________
Oleg Bartunov, sci.researcher, hostmaster of AstroNet,
Sternberg Astronomical Institute, Moscow University (Russia)
Internet: oleg(at)sai(dot)msu(dot)su, http://www.sai.msu.su/~megera/
phone: +007(095)939-16-83, +007(095)939-23-83

In response to

pgsql-www by date

Next:From: Marc G. FournierDate: 2004-01-17 05:01:31
Subject: Re: Search-OpenFTS-Crawler released
Previous:From: Joshua D. DrakeDate: 2004-01-17 04:52:22
Subject: Re: [GENERAL] New PostgreSQL search resource

pgsql-advocacy by date

Next:From: scott.marloweDate: 2004-01-19 17:38:04
Subject: Re: New PostgreSQL search resource
Previous:From: Joshua D. DrakeDate: 2004-01-17 04:52:22
Subject: Re: [GENERAL] New PostgreSQL search resource

pgsql-general by date

Next:From: Tom LaneDate: 2004-01-17 05:05:20
Subject: Re: error creating sql function
Previous:From: Joshua D. DrakeDate: 2004-01-17 04:52:22
Subject: Re: [GENERAL] New PostgreSQL search resource

Privacy Policy | About PostgreSQL
Copyright © 1996-2014 The PostgreSQL Global Development Group