Re: [OT] Tom's/Marc's spam filters?

From: Joe Conway <mail(at)joeconway(dot)com>
To: "Marc G(dot) Fournier" <scrappy(at)postgresql(dot)org>
Cc: Will Trillich <will(at)serensoft(dot)com>, pgsql-general(at)postgresql(dot)org
Subject: Re: [OT] Tom's/Marc's spam filters?
Date: 2004-04-24 05:50:51
Message-ID: 408A003B.9050400@joeconway.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

Marc G. Fournier wrote:
> On Mon, 19 Apr 2004, Joe Conway wrote:
>>Marc G. Fournier wrote:
>>>Huh? I just use Spamassassin myself, with Razor/Pyzor/DCC and Bayes all
>>>enabled ...
>>
>>I use exactly the same setup. But recently I've noticed that the
>>spammers are getting smarter -- I think 20% of it is slipping by the
>>filters. I'm going to need something better.
>
> do you force learn those spam that get through the cracks? I get about 20
> or 30 messages that slip through the cracks, which I process through with
> sa-learn nightly ...

Sorry to drag this OT thread on even longer, but it seems to be a topic
many are interested in ;-)

I wanted to report back that after just 2 days of forced (supervised)
learning, the bayesian filter is now nailing about 99% of all spam.
*Many, many, thanks* for the suggestion.

But I wonder why the autolearn feature is so conservative? At this point
I'm getting lots of stuff like this:

X-Spam-Status: Yes, hits=5.8 required=2.5 tests=BAYES_99,HTML_FONT_BIG,
HTML_MESSAGE autolearn=no version=2.63
X-Spam-Report:
* 0.1 HTML_MESSAGE BODY: HTML included in message
* 0.3 HTML_FONT_BIG BODY: HTML has a big font
* 5.4 BAYES_99 BODY: Bayesian spam probability is 99 to 100%
* [score: 1.0000]

Notice that, even though I get a hit on BAYES_99, I still get
autolearn=no. Ah well, I guess I should be asking that question of the
SpamAssassin guys. Also notice that this sucker would have gotten
through with a score of only 0.4 had it not been for the bayesian filter.

Again, thanks.

Joe

In response to

Responses

Browse pgsql-general by date

  From Date Subject
Next Message Alvar Freude 2004-04-24 09:33:52 Re: [OT] Tom's/Marc's spam filters?
Previous Message Bruce Momjian 2004-04-23 23:59:41 Re: thread_test.c problems