Re: [OT] Tom's/Marc's spam filters?

From: Joe Conway <mail(at)joeconway(dot)com>
To: "Marc G(dot) Fournier" <scrappy(at)postgresql(dot)org>
Cc: Michael Chaney <mdchaney(at)michaelchaney(dot)com>, pgsql-general(at)postgresql(dot)org
Subject: Re: [OT] Tom's/Marc's spam filters?
Date: 2004-04-21 20:07:55
Message-ID: 4086D49B.3060403@joeconway.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

Marc G. Fournier wrote:
> On Wed, 21 Apr 2004, Joe Conway wrote:
>> /usr/bin/sa-learn --mbox --spam /path/to/false-neg.mbox
>>
>>Now I just drop all false negatives into that mailbox, and clean them
>>out periodically. Hopefully that will make a significant improvement.
>
> This, for me, has made the big difference, since the false-negatives don't
> get autolearned :(

Actually, even much of what does (correctly) get marked as spam, ends up
with autolearn=no, because it seems SpamAssassin is somewhat
conservative with autolearning. I just sent this off list to Michael Chaney:
---------------------------------------------------------------------

I've noticed that the threshold for autolearn seems too high, i.e. a
high proportion of email correctly marked as spam, has autolearn=no.
Here's an example:

X-Spam-Status: Yes, hits=3.7 required=2.5
tests=BAYES_44,HTML_FONT_INVISIBLE, HTML_IMAGE_ONLY_04,
HTML_MESSAGE,MIME_HTML_NO_CHARSET,MIME_HTML_ONLY,
MIME_HTML_ONLY_MULTI autolearn=no version=2.63

Now in /etc/mail/spamassassin/local.cf I have this setting:

# Enable Bayes auto-learning
auto_learn 1
bayes_auto_learn_threshold_spam 6

From the SA docs, I get the impression that autolearn cannot be made
more aggressive.

So in order to counteract that, I just made an additional change -- I
put in a mail filter rule that automatically forwards any mail marked as
spam, but with autolearn=no, to false-neg.mbox. This should help too, I
think.

Joe

In response to

Browse pgsql-general by date

  From Date Subject
Next Message Philipp Buehler 2004-04-21 20:09:25 Re: 7.3.4 on Linux: UPDATE .. foo=foo+1 degrades massivly over time
Previous Message scott.marlowe 2004-04-21 19:59:36 Re: kill -2