You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@spamassassin.apache.org by bu...@bugzilla.spamassassin.org on 2007/10/03 14:17:52 UTC
[Bug 4104] Several useful URI rules
http://issues.apache.org/SpamAssassin/show_bug.cgi?id=4104
------- Additional Comments From jm@jmason.org 2007-10-03 05:17 -------
Totally forgot about this! here are the results:
http://ruleqa.spamassassin.org/20070224-r511240-n/%2FURIBL#new
this is exactly the same code as URIBL_BLACK:
0.00000 44.7322 0.0801 0.998 0.95 0.00 T_URIBL_RHS_URIBL_BLACK
ditto for GREY:
0.00000 0.1759 0.0841 0.677 0.58 0.00 T_URIBL_RHS_URIBL_GREY
these all have too-poor hit-rates:
0.00000 0.9225 0.4043 0.695 0.64 0.00 T_URIBL_RHS_NOSTDMAIL
0.00000 1.0300 0.5404 0.656 0.62 0.00 T_URIBL_RHS_DSN
0.00000 1.1540 0.7365 0.610 0.60 0.00 T_URIBL_RHS_WHOIS
0.00000 1.9154 1.1888 0.617 0.59 0.00 T_URIBL_RHS_NOCOMPLAINTS
0.00000 0.4774 0.4523 0.513 0.56 0.00 T_URIBL_RHS_BOGUSMX
0.00000 2.1290 2.0774 0.506 0.55 0.00 T_URIBL_RHS_POST
0.00000 6.7159 6.1202 0.523 0.54 0.00 T_URIBL_RHS_TLD_WHOIS
0.00000 2.9068 3.2542 0.472 0.53 0.00 T_URIBL_RHS_ABUSE
Also we don't seem to have results for the SPEWS URIBL rules, but I'm afraid
we're unlikely to add them regardless, since it appears SPEWS is no longer
active (and was philosophically incompatible with our policies anyway).
However, these look promising:
0.00000 7.1859 0.0280 0.996 0.91 0.00 T_URIBL_RHS_AHBL
A 99.6% accuracy with 0.028% false positives. It overlaps with other
rules as follows:
overlap spam: 100% of WHOIS_GKGPROXY hits also hit T_URIBL_RHS_AHBL; 0% of
T_URIBL_RHS_AHBL hits also hit WHOIS_GKGPROXY
overlap spam: 100% of WHOIS_SECINFOSERV hits also hit T_URIBL_RHS_AHBL; 0% of
T_URIBL_RHS_AHBL hits also hit WHOIS_SECINFOSERV
overlap spam: 81% of WHOIS_UNLISTED hits also hit T_URIBL_RHS_AHBL; 0% of
T_URIBL_RHS_AHBL hits also hit WHOIS_UNLISTED
overlap spam: 80% of WHOIS_REGISTERFLY hits also hit T_URIBL_RHS_AHBL; 5% of
T_URIBL_RHS_AHBL hits also hit WHOIS_REGISTERFLY
overlap spam: 77% of WHOIS_SECUREWHOIS hits also hit T_URIBL_RHS_AHBL; 2% of
T_URIBL_RHS_AHBL hits also hit WHOIS_SECUREWHOIS
overlap spam: 60% of T_URIBL_XS_SURBL hits also hit T_URIBL_RHS_AHBL; 0% of
T_URIBL_RHS_AHBL hits also hit T_URIBL_XS_SURBL
overlap spam: 55% of WHOIS_PRIVPROT hits also hit T_URIBL_RHS_AHBL; 3% of
T_URIBL_RHS_AHBL hits also hit WHOIS_PRIVPROT
overlap spam: 50% of WHOIS_1AND1PR hits also hit T_URIBL_RHS_AHBL; 0% of
T_URIBL_RHS_AHBL hits also hit WHOIS_1AND1PR
overlap spam: 38% of SPF_HELO_FAIL hits also hit T_URIBL_RHS_AHBL; 1% of
T_URIBL_RHS_AHBL hits also hit SPF_HELO_FAIL
overlap spam: 36% of WHOIS_PRIVACYPOST hits also hit T_URIBL_RHS_AHBL; 0% of
T_URIBL_RHS_AHBL hits also hit WHOIS_PRIVACYPOST
overlap spam: 34% of DNS_FROM_AHBL_RHSBL hits also hit T_URIBL_RHS_AHBL; 3%
of T_URIBL_RHS_AHBL hits also hit DNS_FROM_AHBL_RHSBL
overlap spam: 26% of RCVD_IN_SBL hits also hit T_URIBL_RHS_AHBL; 5% of
T_URIBL_RHS_AHBL hits also hit RCVD_IN_SBL
overlap spam: 23% of URIBL_WS_SURBL hits also hit T_URIBL_RHS_AHBL; 97% of
T_URIBL_RHS_AHBL hits also hit URIBL_WS_SURBL
overlap spam: 21% of RCVD_IN_SORBS_MISC hits also hit T_URIBL_RHS_AHBL; 0% of
T_URIBL_RHS_AHBL hits also hit RCVD_IN_SORBS_MISC
overlap spam: 20% of RCVD_IN_WHOIS_INVALID hits also hit T_URIBL_RHS_AHBL; 0%
of T_URIBL_RHS_AHBL hits also hit RCVD_IN_WHOIS_INVALID
overlap ham: 100% of WHOIS_IDSHIELD hits also hit T_URIBL_RHS_AHBL; 0% of
T_URIBL_RHS_AHBL hits also hit WHOIS_IDSHIELD
overlap ham: 83% of URIBL_WS_SURBL hits also hit T_URIBL_RHS_AHBL; 10% of
T_URIBL_RHS_AHBL hits also hit URIBL_WS_SURBL
overlap ham: 81% of DNS_FROM_AHBL_RHSBL hits also hit T_URIBL_RHS_AHBL; 66%
of T_URIBL_RHS_AHBL hits also hit DNS_FROM_AHBL_RHSBL
overlap ham: 50% of URIBL_PH_SURBL hits also hit T_URIBL_RHS_AHBL; 0% of
T_URIBL_RHS_AHBL hits also hit URIBL_PH_SURBL
overlap ham: 42% of RAZOR2_CF_RANGE_E8_51_100 hits also hit
T_URIBL_RHS_AHBL; 10% of T_URIBL_RHS_AHBL hits also hit RAZOR2_CF_RANGE_E8_51_100
overlap ham: 42% of URIBL_JP_SURBL hits also hit T_URIBL_RHS_AHBL; 1% of
T_URIBL_RHS_AHBL hits also hit URIBL_JP_SURBL
Those are pretty good figures. I think this is a good candidate
for a new lookup. Added as bug 5667.
Also:
0.00000 0.1240 0.0000 1.000 0.58 0.00 T_URIBL_XS_SURBL
Definitely good.
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.