From: | Joshua Kramer <josh(at)bitbuckets(dot)com> |
---|---|
To: | pgsql-performance(at)postgresql(dot)org |
Subject: | TSearch2 vs. Apache Lucene |
Date: | 2005-12-06 16:47:44 |
Message-ID: | Pine.LNX.4.63.0512061143170.5133@localhost.localdomain |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-performance |
Greetings all,
I'm going to do a performance comparison with DocMgr and PG81/TSearch2 on
one end, and Apache Lucene on the other end.
In order to do this, I'm going to create a derivative of the
docmgr-autoimport script so that I can specify one file to import at a
time. I'll then create a Perl script which logs all details (such as
timing, etc.) as the test progresses.
As test data, I have approximately 9,000 text files from Project Gutenberg
ranging in size from a few hundred bytes to 4.5M.
I plan to test the speed of import of each file. Then, I plan to write a
web-robot in Perl that will test the speed and number of results returned.
Can anyone think of a validation of this test, or how I should configure
PG to maximise import and search speed? Can I maximise search speed and
import speed, or are those things mutually exclusive? (Note that this
will be run on limited hardware - 900MHz Athlon with 512M of ram)
Has anyone ever compared TSearch2 to Lucene, as far as performance is
concerned?
Thanks,
-Josh
From | Date | Subject | |
---|---|---|---|
Next Message | Tom Lane | 2005-12-06 16:57:19 | Re: Missed index opportunity for outer join? |
Previous Message | Pailloncy Jean-Gerard | 2005-12-06 16:19:34 | Re: 8.1 count(*) distinct: IndexScan/SeqScan |