You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@spamassassin.apache.org by bu...@bugzilla.spamassassin.org on 2004/02/18 03:03:30 UTC

[Bug 3055] Bayes: use hash instead of Message-Id?

http://bugzilla.spamassassin.org/show_bug.cgi?id=3055





------- Additional Comments From felicity@kluge.net  2004-02-17 18:03 -------
Subject: Re:  New: Bayes: use hash instead of Message-Id?

On Tue, Feb 17, 2004 at 04:18:30PM -0800, bugzilla-daemon@bugzilla.spamassassin.org wrote:
> What were the objections to using a hash of some selected headers (From, To,
> Subject) and the message body, again?  Strikes me this is a more resilient
> way to avoid spammers using 1 message ID for all their spam and evading
> bayes learning that way.

I think the issue was that as the mail gets passed through MTAs and such,
enough of the message could change such that the hash won't match anymore.

Doing a hash instead is trivial (if you don't mind ignoring the non-hash
entries)...  M::SA::Bayes::get_msgid() already does a hash if there is
no/no valid Message-Id header:

    # Use sha1(Date:, last received: and top N bytes of body)
    # where N is MIN(1024 bytes, 1/2 of body length)





------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.