Re: Performance problems testing with Spamassassin 3.1.0

From: Andrew McMillan <andrew(at)catalyst(dot)net(dot)nz>
To: Matthew Schumacher <matt(dot)s(at)aptalaska(dot)net>
Cc: pgsql-performance(at)postgresql(dot)org
Subject: Re: Performance problems testing with Spamassassin 3.1.0
Date: 2005-07-29 05:50:11
Message-ID: 1122616211.5691.340.camel@lamb.mcmillan.net.nz
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-performance

On Thu, 2005-07-28 at 16:13 -0800, Matthew Schumacher wrote:
>
> Ok, I finally got some test data together so that others can test
> without installing SA.
>
> The schema and test dataset is over at
> http://www.aptalaska.net/~matt.s/bayes/bayesBenchmark.tar.gz
>
> I have a pretty fast machine with a tuned postgres and it takes it about
> 2 minutes 30 seconds to load the test data. Since the test data is the
> bayes information on 616 spam messages than comes out to be about 250ms
> per message. While that is doable, it does add quite a bit of overhead
> to the email system.

On my laptop this takes:

real 1m33.758s
user 0m4.285s
sys 0m1.181s

One interesting effect is the data in bayes_vars has a huge number of
updates and needs vacuum _frequently_. After the run a vacuum full
compacts it down from 461 pages to 1 page.

Regards,
Andrew.

-------------------------------------------------------------------------
Andrew @ Catalyst .Net .NZ Ltd, PO Box 11-053, Manners St, Wellington
WEB: http://catalyst.net.nz/ PHYS: Level 2, 150-154 Willis St
DDI: +64(4)803-2201 MOB: +64(272)DEBIAN OFFICE: +64(4)499-2267
I don't do it for the money.
-- Donald Trump, Art of the Deal

-------------------------------------------------------------------------

In response to

Responses

Browse pgsql-performance by date

  From Date Subject
Next Message Gavin Sherry 2005-07-29 05:58:24 Re: Performance problems testing with Spamassassin 3.1.0
Previous Message Chris Travers 2005-07-29 05:23:15 Re: Left joining against two empty tables makes a query