You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@spamassassin.apache.org by bu...@bugzilla.spamassassin.org on 2010/02/01 19:46:02 UTC

[Bug 6317] Enhancement: include sender text in the message body so body and uri tests can scan it

https://issues.apache.org/SpamAssassin/show_bug.cgi?id=6317

Adam Katz <an...@khopis.com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |antispam@khopis.com

--- Comment #1 from Adam Katz <an...@khopis.com> 2010-02-01 10:46:00 UTC ---
This stems from a list conversation archived at
http://old.nabble.com/forum/ViewPost.jtp?post=27384882&framed=y and my tests
were also mentioned in another thread from last week at
http://old.nabble.com/forum/ViewPost.jtp?post=27328212&framed=y

I'm not sure I agree with the full concept though, and I think my participatory
remarks may have been misread.

Bayesian rules already examine From and Subject fields in addition to the body,
and they rightly mark the collected words with the field name (e.g. "from:adam"
is a word plucked by Bayes when it sees "Adam Katz" in the From header, with
the colon being a forbidden character in standard word parsing.  This is not
necessarily the exact mechanism SA uses to delimit, but it is close.)

The topic that spurred this request was related to spamvertised websites that
appear in the From header rather than the body and thus are immune to SA's uri
detection.  Martin has abstracted this idea to all body tests, which may not be
as wise.

Furthermore, URI detection for the From header may be a frivolous exercise, as
my tests at http://ruleqa.spamassassin.org/?rule=/FROM_W&srcpath=khop seem to
indicate that *any* URI in this location is itself a strong an indicator of
spam.  Further parsing is therefore unnecessary.

Publishing this rule with SA before legit mail starts clutching this concept
might deter its adoption.

-- 
Configure bugmail: https://issues.apache.org/SpamAssassin/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.