You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@spamassassin.apache.org by Apache Wiki <wi...@apache.org> on 2006/04/06 20:53:43 UTC
[Spamassassin Wiki] Trivial Update of "HitFrequencies" by DanielQuinlan
Dear Wiki user,
You have subscribed to a wiki page or wiki category on "Spamassassin Wiki" for change notification.
The following page has been changed by DanielQuinlan:
http://wiki.apache.org/spamassassin/HitFrequencies
The comment on the change is:
edit conflict
------------------------------------------------------------------------------
A ''good'' rule has a very extreme S/O (near as possible to 1.0 or 0.0) and a high percentage of hits in the correct category. In other words, RCVD_IN_OPM_HTTP is a very good rule in the example above, because it hits 5.2028% of all spam mails without hitting any ham at all (no false positives).
-
- ---- /!\ '''Edit conflict - other version:''' ----
- S/O stands for "spam% / overall%", in other words, the proportion of the total hits that were spam messages. As such, it is equivalent to Bayesian probability, or Positive Predictive Value in bioinformatics or medicine.
-
- ---- /!\ '''Edit conflict - your version:''' ----
S/O stands for "spam / overall" for which the formula is "spam% / (ham% + spam%)", in other words, the proportion of the total hits that were spam messages. As such, it is equivalent to Bayesian probability, or Positive Predictive Value in bioinformatics or medicine.
-
- ---- /!\ '''End of edit conflict''' ----
== Measuring Rule Overlap ==