You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@spamassassin.apache.org by Apache Wiki <wi...@apache.org> on 2006/04/06 20:53:43 UTC

[Spamassassin Wiki] Trivial Update of "HitFrequencies" by DanielQuinlan

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Spamassassin Wiki" for change notification.

The following page has been changed by DanielQuinlan:
http://wiki.apache.org/spamassassin/HitFrequencies

The comment on the change is:
edit conflict

------------------------------------------------------------------------------
  
  A ''good'' rule has a very extreme S/O (near as possible to 1.0 or 0.0) and a high percentage of hits in the correct category.  In other words,  RCVD_IN_OPM_HTTP is a very good rule in the example above, because it hits 5.2028% of all spam mails without hitting any ham at all (no false positives).
  
- 
- ---- /!\ '''Edit conflict - other version:''' ----
- S/O stands for "spam% / overall%", in other words, the proportion of the total hits that were spam messages.  As such, it is equivalent to Bayesian probability, or Positive Predictive Value in bioinformatics or medicine.
- 
- ---- /!\ '''Edit conflict - your version:''' ----
  S/O stands for "spam / overall" for which the formula is "spam% / (ham% + spam%)", in other words, the proportion of the total hits that were spam messages.  As such, it is equivalent to Bayesian probability, or Positive Predictive Value in bioinformatics or medicine.
- 
- ---- /!\ '''End of edit conflict''' ----
  
  == Measuring Rule Overlap ==