You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@spamassassin.apache.org by Apache Wiki <wi...@apache.org> on 2006/04/06 20:32:01 UTC

[Spamassassin Wiki] Update of "HitFrequencies" by DanielQuinlan

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Spamassassin Wiki" for change notification.

The following page has been changed by DanielQuinlan:
http://wiki.apache.org/spamassassin/HitFrequencies

The comment on the change is:
spam%/overall%

------------------------------------------------------------------------------
  
  A ''good'' rule has a very extreme S/O (near as possible to 1.0 or 0.0) and a high percentage of hits in the correct category.  In other words,  RCVD_IN_OPM_HTTP is a very good rule in the example above, because it hits 5.2028% of all spam mails without hitting any ham at all (no false positives).
  
- S/O stands for "spam / overall", in other words, the proportion of the total hits that were spam messages.  As such, it is equivalent to Bayesian probability, or Positive Predictive Value in bioinformatics or medicine.
+ S/O stands for "spam% / overall%", in other words, the proportion of the total hits that were spam messages.  As such, it is equivalent to Bayesian probability, or Positive Predictive Value in bioinformatics or medicine.
  
  == Measuring Rule Overlap ==