| From: | Mark Rae <mrae(at)purplebat(dot)com> |
|---|---|
| To: | Sebastian Hennebrueder <usenet(at)laliluna(dot)de> |
| Cc: | postgres performance <pgsql-performance(at)postgresql(dot)org> |
| Subject: | Re: Looking for a large database for testing |
| Date: | 2005-08-16 08:39:10 |
| Message-ID: | 20050816083910.GA31460@purplebat.com |
| Views: | Whole Thread | Raw Message | Download mbox | Resend email |
| Thread: | |
| Lists: | pgsql-performance |
On Tue, Aug 16, 2005 at 09:29:32AM +0200, Sebastian Hennebrueder wrote:
> I would like to test the performance of my Java/PostgreSQL applications
> especially when making full text searches.
> For this I am looking for a database with 50 to 300 MB having text fields.
> e.g. A table with books with fields holding a comment, table of content
> or example chapters
> or what ever else.
You could try the OMIM database, which is currently 100M
It contains both journal references and large sections of
'plain' text. It also contains a large amount of technical
terms which will really test any kind of soundex matching
if you are using that.
http://www.ncbi.nlm.nih.gov/Omim/omimfaq.html#download
Unfortunately it only comes as a flat text file, but is
very easy to parse.
And if you start reading it, you'll probably learn quite
a lot of things you really didn't want to know!! :-D
-Mark
| From | Date | Subject | |
|---|---|---|---|
| Next Message | Oleg Bartunov | 2005-08-16 09:38:41 | Re: Looking for a large database for testing |
| Previous Message | Richard Huxton | 2005-08-16 08:23:58 | Re: database encoding with index search problem |