You are viewing a plain text version of this content. The canonical link for it is here.
Posted to sysadmins@spamassassin.apache.org by Merijn van den Kroonenberg <me...@web2all.nl> on 2017/11/15 12:27:51 UTC

comparing 72_scores.cf files

I wanted to compare the 72_scores.cf file against the one from yesterday
and the one from march. However the svn diff is not very readable and its
very hard to see which rules were added or removed. I also wanted to
compare the ranges.data file which also contains rules and score ranges.

I patched together a small script for comparing score files.

See attchement for the (perl) script.

It produces output like this (comparing march against last night):

./compare-rulefiles 72_scores_20170315.cf 72_scores-1815282.cf

Only in 1 (removed in 2)
AC_HTML_NONSENSE_TAGS
AC_SPAMMY_URI_PATTERNS1
AC_SPAMMY_URI_PATTERNS10
AC_SPAMMY_URI_PATTERNS11
AC_SPAMMY_URI_PATTERNS12
AC_SPAMMY_URI_PATTERNS2
AC_SPAMMY_URI_PATTERNS3
AC_SPAMMY_URI_PATTERNS4
AC_SPAMMY_URI_PATTERNS8
AC_SPAMMY_URI_PATTERNS9
ADVANCE_FEE_2_NEW_FORM
ADVANCE_FEE_4_NEW
ADVANCE_FEE_5_NEW
AXB_XMAILER_MIMEOLE_OL_1ECD5
AXB_XM_FORGED_OL2600
BODY_EMPTY
CANT_SEE_AD
CN_B2B_SPAMMER
COMMENT_GIBBERISH
ENCRYPTED_MESSAGE
FBI_MONEY
FBI_SPOOF
FORM_LOW_CONTRAST
FOUND_YOU
FREEMAIL_DOC_PDF_BCC
FROM_WORDY_SHORT
FSL_HELO_BARE_IP_2
GOOGLE_DOCS_PHISH
GOOGLE_DOCS_PHISH_MANY
GOOG_MALWARE_DNLD
HDRS_LCASE
HEXHASH_WORD
HK_SCAM_N15
HTML_OFF_PAGE
LIST_PRTL_PUMPDUMP
LIST_PRTL_SAME_USER
LOTTO_AGENT
LOTTO_DEPT
LUCRATIVE
MONEY_LOTTERY
MSGID_NOFQDN1
MSM_PRIO_REPTO
PHP_NOVER_MUA
PHP_ORIG_SCRIPT
PHP_SCRIPT_MUA
PP_TOO_MUCH_UNICODE02
PP_TOO_MUCH_UNICODE05
PUMPDUMP
PUMPDUMP_MULTI
RAND_HEADER_MANY
RP_MATCHES_RCVD
SHARE_50_50
SPOOFED_FREEM_REPTO_CHN
STOCK_LOW_CONTRAST
STOCK_TIP
SYSADMIN
TO_NO_BRKTS_FROM_MSSP
TW_GIBBERISH_MANY
UC_GIBBERISH_OBFU
URI_DATA
URI_GOOGLE_PROXY
URI_OPTOUT_3LD
URI_PHISH
XPRIO_SHORT_SUBJ

Only in 2 (added in 2)
ADVANCE_FEE_4_NEW_MONEY
ADVANCE_FEE_5_NEW_FRM_MNY
ADVANCE_FEE_5_NEW_MONEY
APOSTROPHE_TOCC
AXB_X_AOL_SEZ_S
DEAR_BENEFICIARY
FROM_MISSP_DYNIP
FROM_MISSP_EH_MATCH
FSL_HELO_FAKE
FSL_MIME_NO_TEXT
FUZZY_UNSUBSCRIBE
HDRS_MISSP
MANY_PILL_PRICE
MILLION_USD
MONEY_ATM_CARD
MONEY_FORM_SHORT
MONEY_FROM_41
SERGIO_SUBJECT_VIAGRA01
SHORTENED_URL_SRC
SINGLETS_LOW_CONTRAST
TO_NO_BRKTS_DYNIP

Changed
AC_BR_BONANZA
  0.001 0.001 0.001 0.001
  0.001 0.967 0.001 0.967
AC_DIV_BONANZA
  0.001 0.001 0.001 0.001
  0.909 0.001 0.909 0.001
ADVANCE_FEE_2_NEW_MONEY
  1.997 0.001 1.997 0.001
  0.001 0.039 0.001 0.039
ADVANCE_FEE_3_NEW
  3.496 0.001 3.496 0.001
  2.099 1.697 2.099 1.697
ADVANCE_FEE_3_NEW_MONEY
  2.796 0.001 2.796 0.001
  3.100 2.696 3.100 2.696
AXB_XMAILER_MIMEOLE_OL_024C2
  0.367 0.001 0.367 0.001
  0.001 2.822 0.001 2.822
BODY_URI_ONLY
  0.998 0.001 0.998 0.001
  3.500 3.595 3.500 3.595
BOGUS_MSM_HDRS
  0.909 0.001 0.909 0.001
  1.912 0.677 1.912 0.677
CK_HELO_DYNAMIC_SPLIT_IP
  1.350 0.001 1.350 0.001
  1.500 0.001 1.500 0.001
CK_HELO_GENERIC
  0.249 0.249 0.249 0.249
  2.195 1.350 2.195 1.350
DATE_IN_FUTURE_96_Q
  3.296 3.299 3.296 3.299
  2.999 2.696 2.999 2.696
FILL_THIS_FORM
  2.748 0.001 2.748 0.001
  0.200 1.402 0.200 1.402
FORM_FRAUD
  0.998 0.001 0.998 0.001
  3.099 1.841 3.099 1.841
FORM_FRAUD_3
  2.696 0.001 2.696 0.001
  2.899 0.383 2.899 0.383
FORM_FRAUD_5
  0.209 0.001 0.209 0.001
  1.531 1.802 1.531 1.802
FREEMAIL_FORGED_FROMDOMAIN
  0.001 0.199 0.001 0.199
  0.001 0.001 0.001 0.001
FROM_IN_TO_AND_SUBJ
  0.287 0.262 0.287 0.262
  1.001 0.001 1.001 0.001
FROM_MISSP_FREEMAIL
  3.595 0.001 3.595 0.001
  2.965 3.393 2.965 3.393
FROM_MISSP_MSFT
  0.001 0.001 0.001 0.001
  0.001 3.129 0.001 3.129
FROM_MISSP_REPLYTO
  0.001 0.001 0.001 0.001
  3.799 0.001 3.799 0.001
FROM_MISSP_SPF_FAIL
  0.001 1.000 0.001 1.000
  0.001 0.001 0.001 0.001
FROM_MISSP_TO_UNDISC
  1.438 0.001 1.438 0.001
  1.102 0.275 1.102 0.275
FROM_MISSP_USER
  0.001 0.001 0.001 0.001
  0.001 1.416 0.001 1.416
FROM_MISSP_XPRIO
  0.001 0.001 0.001 0.001
  0.001 3.595 0.001 3.595
FROM_WORDY
  2.497 0.001 2.497 0.001
  3.299 2.511 3.299 2.511
FSL_CTYPE_WIN1251
  0.001 0.001 0.001 0.001
  0.001 0.588 0.001 0.588
FSL_NEW_HELO_USER
  0.083 0.001 0.083 0.001
  0.001 0.755 0.001 0.755
HELO_MISC_IP
  0.248 0.250 0.248 0.250
  2.600 1.357 2.600 1.357
HK_RANDOM_FROM
  0.998 0.001 0.998 0.001
  1.501 2.664 1.501 2.664
HK_SCAM_N2
  3.249 0.001 3.249 0.001
  3.099 2.696 3.099 2.696
IMG_DIRECT_TO_MX
  2.397 2.400 2.397 2.400
  3.599 1.743 3.599 1.743
LONG_HEX_URI
  2.194 2.290 2.194 2.290
  0.399 0.884 0.399 0.884
LONG_IMG_URI
  0.553 0.100 0.553 0.100
  0.001 0.001 0.001 0.001
LOTS_OF_MONEY
  0.001 0.001 0.001 0.001
  0.001 0.006 0.001 0.006
MIMEOLE_DIRECT_TO_MX
  1.445 0.381 1.445 0.381
  2.399 0.737 2.399 0.737
MIME_NO_TEXT
  1.000 1.000 1.000 1.000
  3.505 2.941 3.505 2.941
MONEY_FRAUD_3
  2.896 0.001 2.896 0.001
  3.100 0.243 3.100 0.243
MONEY_FRAUD_5
  3.096 0.001 3.096 0.001
  3.400 2.896 3.400 2.896
MONEY_FRAUD_8
  2.548 0.001 2.548 0.001
  0.364 3.199 0.364 3.199
NSL_RCVD_FROM_USER
  0.548 0.001 0.548 0.001
  0.001 0.159 0.001 0.159
NSL_RCVD_HELO_USER
  1.273 0.001 1.273 0.001
  2.599 0.970 2.599 0.970
PP_MIME_FAKE_ASCII_TEXT
  0.429 0.001 0.429 0.001
  0.899 0.001 0.899 0.001
RCVD_IN_MSPIKE_H2
  0.001 -2.800 0.001 -2.800
  0.001 -1.271 0.001 -1.271
RCVD_IN_MSPIKE_L3
  0.001 0.001 0.001 0.001
  0.001 1.282 0.001 1.282
RCVD_IN_MSPIKE_L4
  0.001 0.001 0.001 0.001
  0.001 0.147 0.001 0.147
RCVD_IN_MSPIKE_L5
  0.001 0.001 0.001 0.001
  0.001 2.283 0.001 2.283
RCVD_IN_MSPIKE_ZBI
  0.001 0.001 0.001 0.001
  0.001 3.496 0.001 3.496
SPOOFED_FREEM_REPTO
  2.498 1.368 2.498 1.368
  3.899 3.396 3.899 3.396
STATIC_XPRIO_OLE
  1.997 0.001 1.997 0.001
  1.668 0.277 1.668 0.277
STYLE_GIBBERISH
  2.800 3.093 2.800 3.093
  3.499 2.554 3.499 2.554
THIS_AD
  0.596 2.200 0.596 2.200
  0.001 0.001 0.001 0.001
TO_EQ_FM_DIRECT_MX
  2.497 0.622 2.497 0.622
  2.799 2.997 2.799 2.997
TO_IN_SUBJ
  0.099 0.099 0.099 0.099
  2.099 0.001 2.099 0.001
TO_NO_BRKTS_HTML_IMG
  0.001 2.000 0.001 2.000
  0.001 0.799 0.001 0.799
TO_NO_BRKTS_HTML_ONLY
  1.997 0.001 1.997 0.001
  0.001 0.001 0.001 0.001
TO_NO_BRKTS_MSFT
  2.497 0.001 2.497 0.001
  0.602 3.595 0.602 3.595
TO_NO_BRKTS_NORDNS_HTML
  0.398 0.001 0.398 0.001
  0.001 0.339 0.001 0.339
TO_NO_BRKTS_PCNT
  2.497 0.001 2.497 0.001
  2.699 2.196 2.699 2.196
TVD_SPACE_ENCODED
  2.497 0.001 2.497 0.001
  3.599 3.096 3.599 3.096
TVD_SPACE_ENC_FM_MIME
  1.997 0.001 1.997 0.001
  3.499 2.996 3.499 2.996
TVD_SPACE_RATIO_MINFP
  2.497 0.001 2.497 0.001
  4.399 4.395 4.399 4.395
URI_ONLY_MSGID_MALF
  0.001 1.191 0.001 1.191
  2.800 2.880 2.800 2.880
URI_TRY_3LD
  0.195 0.001 0.195 0.001
  0.001 0.227 0.001 0.227
URI_TRY_USME
  0.001 0.001 0.001 0.001
  3.299 2.896 3.299 2.896
URI_WPADMIN
  3.396 3.014 3.396 3.014
  2.899 2.498 2.899 2.498
URI_WP_DIRINDEX
  1.000 1.000 1.000 1.000
  4.399 3.995 4.399 3.995
URI_WP_HACKED
  2.996 3.000 2.996 3.000
  4.099 3.146 4.099 3.146
URI_WP_HACKED_2
  1.187 1.764 1.187 1.764
  1.479 3.412 1.479 3.412
XPRIO
  2.248 2.249 2.248 2.249
  0.367 0.140 0.367 0.140


Re: comparing 72_scores.cf files

Posted by Merijn van den Kroonenberg <me...@web2all.nl>.
> First to answer Kevin, the script is mainly for debugging and reviewing.
>
> I think it should be possible to do something like
>
> svn diff -r {2017-11-15} --diff-cmd /path/to/compare-rulefiles
> http://svn.apache.org/repos/asf/spamassassin/trunk/rulesrc/scores/72_scores.cf
>
> to get the changes to 72_scores.cf from last night. (cannot test right
> now because my linux svn here has no http support)

Actually that didn't work because svn adds a couple diff specific options
and arguments.

With a small wrapper shell script I can strip away those and then it works:

svn diff -r {`date --iso-8601=date`} --diff-cmd
'/path/to/compare-rulefiles-diffwrapper'
http://svn.apache.org/repos/asf/spamassassin/trunk/rulesrc/scores/72_scores.cf

If you runn this command after the rule generation you will get the rule
diff compared to the previous version. So you can easily see whats
changing each day. Could add it to a status mail or something.

Same thing could be done for the active.list to see which rules were
added/removed during rule promotion. (unfortunately we would need to
change the wrapper script for that to add the -r 0 option which is needed
for active.list)

See attachment for wrapper shell script, to be put in the same dir as the
compare-rulefiles perl script.

Cheers,
Merijn

Re: comparing 72_scores.cf files

Posted by Merijn van den Kroonenberg <me...@web2all.nl>.
First to answer Kevin, the script is mainly for debugging and reviewing.

I think it should be possible to do something like

svn diff -r {2017-11-15} --diff-cmd /path/to/compare-rulefiles 
http://svn.apache.org/repos/asf/spamassassin/trunk/rulesrc/scores/72_scores.cf

to get the changes to 72_scores.cf from last night. (cannot test right 
now because my linux svn here has no http support)

On 2017-11-15 13:58, Dave Jones wrote:
> Nice script.  Very useful.  Now it would be good to extend it to check
> the rules still exist.  For example, I wrote a quick bash on-liner to
> find the rules no longer exist in the main SA rules so we shouldn't
> expect these 6 to be in the new 72_scores.cf:

I added an update of the script which has the --deletedonly and 
--addedonly options to output only added or removed rules.

i think you might want to compare against active.list?

> 
> AXB_XMAILER_MIMEOLE_OL_1ECD5
> AXB_XM_FORGED_OL2600
> FSL_HELO_BARE_IP_2
> HK_SCAM_N15
> MONEY_LOTTERY
> MSGID_NOFQDN1

./compare-rulefiles -d 72_scores_20170315.cf 72_scores-1815282.cf > 
deleted_rules.txt
./compare-rulefiles -r 0 -d deleted_rules.txt rules/active.list > 
deactivated_rules.txt
./compare-rulefiles -r 0 -a deactivated_rules.txt deleted_rules.txt > 
disappeared_rules.txt

where deactivated_rules.txt:
AXB_XMAILER_MIMEOLE_OL_1ECD5
AXB_XM_FORGED_OL2600
FREEMAIL_DOC_PDF_BCC
FSL_HELO_BARE_IP_2
HDRS_LCASE
HK_SCAM_N15
LOTTO_AGENT
LOTTO_DEPT
MONEY_LOTTERY
MSGID_NOFQDN1
RP_MATCHES_RCVD
SHARE_50_50

and disappeared_rules.txt:
AC_HTML_NONSENSE_TAGS
AC_SPAMMY_URI_PATTERNS1
AC_SPAMMY_URI_PATTERNS10
AC_SPAMMY_URI_PATTERNS11
AC_SPAMMY_URI_PATTERNS12
AC_SPAMMY_URI_PATTERNS2
AC_SPAMMY_URI_PATTERNS3
AC_SPAMMY_URI_PATTERNS4
AC_SPAMMY_URI_PATTERNS8
AC_SPAMMY_URI_PATTERNS9
ADVANCE_FEE_2_NEW_FORM
ADVANCE_FEE_4_NEW
ADVANCE_FEE_5_NEW
AXB_XMAILER_MIMEOLE_OL_1ECD5
AXB_XM_FORGED_OL2600
BODY_EMPTY
CANT_SEE_AD
CN_B2B_SPAMMER
COMMENT_GIBBERISH
ENCRYPTED_MESSAGE
FBI_MONEY
FBI_SPOOF
FORM_LOW_CONTRAST
FOUND_YOU
FREEMAIL_DOC_PDF_BCC
FROM_WORDY_SHORT
FSL_HELO_BARE_IP_2
GOOGLE_DOCS_PHISH
GOOGLE_DOCS_PHISH_MANY
GOOG_MALWARE_DNLD
HDRS_LCASE
HEXHASH_WORD
HK_SCAM_N15
HTML_OFF_PAGE
LIST_PRTL_PUMPDUMP
LIST_PRTL_SAME_USER
LOTTO_AGENT
LOTTO_DEPT
LUCRATIVE
MONEY_LOTTERY
MSGID_NOFQDN1
MSM_PRIO_REPTO
PHP_NOVER_MUA
PHP_ORIG_SCRIPT
PHP_SCRIPT_MUA
PP_TOO_MUCH_UNICODE02
PP_TOO_MUCH_UNICODE05
PUMPDUMP
PUMPDUMP_MULTI
RAND_HEADER_MANY
RP_MATCHES_RCVD
SHARE_50_50
SPOOFED_FREEM_REPTO_CHN
STOCK_LOW_CONTRAST
STOCK_TIP
SYSADMIN
TO_NO_BRKTS_FROM_MSSP
TW_GIBBERISH_MANY
UC_GIBBERISH_OBFU
URI_DATA
URI_GOOGLE_PROXY
URI_OPTOUT_3LD
URI_PHISH
XPRIO_SHORT_SUBJ

So these are basically all rules which were in march 72_scores.cf but 
not in our current one, minus all rules which are no longer in 
active.list.

> 
> Next it would be interesting to see if these rest of the "only in 1
> (removed in 2)" are in other current rule files and if they have an
> uncommented score.  Ones without a score or a commented score will get
> the default score of 1.0.  Otherwise, we may not be too far off once
> we can get the lock-scores commited today.

yeah really curious about that

> 
> Dave
> 
> On 11/15/2017 06:27 AM, Merijn van den Kroonenberg wrote:
>> I wanted to compare the 72_scores.cf file against the one from 
>> yesterday
>> and the one from march. However the svn diff is not very readable and 
>> its
>> very hard to see which rules were added or removed. I also wanted to
>> compare the ranges.data file which also contains rules and score 
>> ranges.
>> 
>> I patched together a small script for comparing score files.
>> 
>> See attchement for the (perl) script.
>> 
>> It produces output like this (comparing march against last night):
>> 
>> ./compare-rulefiles 72_scores_20170315.cf 72_scores-1815282.cf
>> 
>> Only in 1 (removed in 2)
>> AC_HTML_NONSENSE_TAGS
>> AC_SPAMMY_URI_PATTERNS1
>> AC_SPAMMY_URI_PATTERNS10
>> AC_SPAMMY_URI_PATTERNS11
>> AC_SPAMMY_URI_PATTERNS12
>> AC_SPAMMY_URI_PATTERNS2
>> AC_SPAMMY_URI_PATTERNS3
>> AC_SPAMMY_URI_PATTERNS4
>> AC_SPAMMY_URI_PATTERNS8
>> AC_SPAMMY_URI_PATTERNS9
>> ADVANCE_FEE_2_NEW_FORM
>> ADVANCE_FEE_4_NEW
>> ADVANCE_FEE_5_NEW
>> AXB_XMAILER_MIMEOLE_OL_1ECD5
>> AXB_XM_FORGED_OL2600
>> BODY_EMPTY
>> CANT_SEE_AD
>> CN_B2B_SPAMMER
>> COMMENT_GIBBERISH
>> ENCRYPTED_MESSAGE
>> FBI_MONEY
>> FBI_SPOOF
>> FORM_LOW_CONTRAST
>> FOUND_YOU
>> FREEMAIL_DOC_PDF_BCC
>> FROM_WORDY_SHORT
>> FSL_HELO_BARE_IP_2
>> GOOGLE_DOCS_PHISH
>> GOOGLE_DOCS_PHISH_MANY
>> GOOG_MALWARE_DNLD
>> HDRS_LCASE
>> HEXHASH_WORD
>> HK_SCAM_N15
>> HTML_OFF_PAGE
>> LIST_PRTL_PUMPDUMP
>> LIST_PRTL_SAME_USER
>> LOTTO_AGENT
>> LOTTO_DEPT
>> LUCRATIVE
>> MONEY_LOTTERY
>> MSGID_NOFQDN1
>> MSM_PRIO_REPTO
>> PHP_NOVER_MUA
>> PHP_ORIG_SCRIPT
>> PHP_SCRIPT_MUA
>> PP_TOO_MUCH_UNICODE02
>> PP_TOO_MUCH_UNICODE05
>> PUMPDUMP
>> PUMPDUMP_MULTI
>> RAND_HEADER_MANY
>> RP_MATCHES_RCVD
>> SHARE_50_50
>> SPOOFED_FREEM_REPTO_CHN
>> STOCK_LOW_CONTRAST
>> STOCK_TIP
>> SYSADMIN
>> TO_NO_BRKTS_FROM_MSSP
>> TW_GIBBERISH_MANY
>> UC_GIBBERISH_OBFU
>> URI_DATA
>> URI_GOOGLE_PROXY
>> URI_OPTOUT_3LD
>> URI_PHISH
>> XPRIO_SHORT_SUBJ
>> 
>> Only in 2 (added in 2)
>> ADVANCE_FEE_4_NEW_MONEY
>> ADVANCE_FEE_5_NEW_FRM_MNY
>> ADVANCE_FEE_5_NEW_MONEY
>> APOSTROPHE_TOCC
>> AXB_X_AOL_SEZ_S
>> DEAR_BENEFICIARY
>> FROM_MISSP_DYNIP
>> FROM_MISSP_EH_MATCH
>> FSL_HELO_FAKE
>> FSL_MIME_NO_TEXT
>> FUZZY_UNSUBSCRIBE
>> HDRS_MISSP
>> MANY_PILL_PRICE
>> MILLION_USD
>> MONEY_ATM_CARD
>> MONEY_FORM_SHORT
>> MONEY_FROM_41
>> SERGIO_SUBJECT_VIAGRA01
>> SHORTENED_URL_SRC
>> SINGLETS_LOW_CONTRAST
>> TO_NO_BRKTS_DYNIP
>> 
>> Changed
>> AC_BR_BONANZA
>>    0.001 0.001 0.001 0.001
>>    0.001 0.967 0.001 0.967
>> AC_DIV_BONANZA
>>    0.001 0.001 0.001 0.001
>>    0.909 0.001 0.909 0.001
>> ADVANCE_FEE_2_NEW_MONEY
>>    1.997 0.001 1.997 0.001
>>    0.001 0.039 0.001 0.039
>> ADVANCE_FEE_3_NEW
>>    3.496 0.001 3.496 0.001
>>    2.099 1.697 2.099 1.697
>> ADVANCE_FEE_3_NEW_MONEY
>>    2.796 0.001 2.796 0.001
>>    3.100 2.696 3.100 2.696
>> AXB_XMAILER_MIMEOLE_OL_024C2
>>    0.367 0.001 0.367 0.001
>>    0.001 2.822 0.001 2.822
>> BODY_URI_ONLY
>>    0.998 0.001 0.998 0.001
>>    3.500 3.595 3.500 3.595
>> BOGUS_MSM_HDRS
>>    0.909 0.001 0.909 0.001
>>    1.912 0.677 1.912 0.677
>> CK_HELO_DYNAMIC_SPLIT_IP
>>    1.350 0.001 1.350 0.001
>>    1.500 0.001 1.500 0.001
>> CK_HELO_GENERIC
>>    0.249 0.249 0.249 0.249
>>    2.195 1.350 2.195 1.350
>> DATE_IN_FUTURE_96_Q
>>    3.296 3.299 3.296 3.299
>>    2.999 2.696 2.999 2.696
>> FILL_THIS_FORM
>>    2.748 0.001 2.748 0.001
>>    0.200 1.402 0.200 1.402
>> FORM_FRAUD
>>    0.998 0.001 0.998 0.001
>>    3.099 1.841 3.099 1.841
>> FORM_FRAUD_3
>>    2.696 0.001 2.696 0.001
>>    2.899 0.383 2.899 0.383
>> FORM_FRAUD_5
>>    0.209 0.001 0.209 0.001
>>    1.531 1.802 1.531 1.802
>> FREEMAIL_FORGED_FROMDOMAIN
>>    0.001 0.199 0.001 0.199
>>    0.001 0.001 0.001 0.001
>> FROM_IN_TO_AND_SUBJ
>>    0.287 0.262 0.287 0.262
>>    1.001 0.001 1.001 0.001
>> FROM_MISSP_FREEMAIL
>>    3.595 0.001 3.595 0.001
>>    2.965 3.393 2.965 3.393
>> FROM_MISSP_MSFT
>>    0.001 0.001 0.001 0.001
>>    0.001 3.129 0.001 3.129
>> FROM_MISSP_REPLYTO
>>    0.001 0.001 0.001 0.001
>>    3.799 0.001 3.799 0.001
>> FROM_MISSP_SPF_FAIL
>>    0.001 1.000 0.001 1.000
>>    0.001 0.001 0.001 0.001
>> FROM_MISSP_TO_UNDISC
>>    1.438 0.001 1.438 0.001
>>    1.102 0.275 1.102 0.275
>> FROM_MISSP_USER
>>    0.001 0.001 0.001 0.001
>>    0.001 1.416 0.001 1.416
>> FROM_MISSP_XPRIO
>>    0.001 0.001 0.001 0.001
>>    0.001 3.595 0.001 3.595
>> FROM_WORDY
>>    2.497 0.001 2.497 0.001
>>    3.299 2.511 3.299 2.511
>> FSL_CTYPE_WIN1251
>>    0.001 0.001 0.001 0.001
>>    0.001 0.588 0.001 0.588
>> FSL_NEW_HELO_USER
>>    0.083 0.001 0.083 0.001
>>    0.001 0.755 0.001 0.755
>> HELO_MISC_IP
>>    0.248 0.250 0.248 0.250
>>    2.600 1.357 2.600 1.357
>> HK_RANDOM_FROM
>>    0.998 0.001 0.998 0.001
>>    1.501 2.664 1.501 2.664
>> HK_SCAM_N2
>>    3.249 0.001 3.249 0.001
>>    3.099 2.696 3.099 2.696
>> IMG_DIRECT_TO_MX
>>    2.397 2.400 2.397 2.400
>>    3.599 1.743 3.599 1.743
>> LONG_HEX_URI
>>    2.194 2.290 2.194 2.290
>>    0.399 0.884 0.399 0.884
>> LONG_IMG_URI
>>    0.553 0.100 0.553 0.100
>>    0.001 0.001 0.001 0.001
>> LOTS_OF_MONEY
>>    0.001 0.001 0.001 0.001
>>    0.001 0.006 0.001 0.006
>> MIMEOLE_DIRECT_TO_MX
>>    1.445 0.381 1.445 0.381
>>    2.399 0.737 2.399 0.737
>> MIME_NO_TEXT
>>    1.000 1.000 1.000 1.000
>>    3.505 2.941 3.505 2.941
>> MONEY_FRAUD_3
>>    2.896 0.001 2.896 0.001
>>    3.100 0.243 3.100 0.243
>> MONEY_FRAUD_5
>>    3.096 0.001 3.096 0.001
>>    3.400 2.896 3.400 2.896
>> MONEY_FRAUD_8
>>    2.548 0.001 2.548 0.001
>>    0.364 3.199 0.364 3.199
>> NSL_RCVD_FROM_USER
>>    0.548 0.001 0.548 0.001
>>    0.001 0.159 0.001 0.159
>> NSL_RCVD_HELO_USER
>>    1.273 0.001 1.273 0.001
>>    2.599 0.970 2.599 0.970
>> PP_MIME_FAKE_ASCII_TEXT
>>    0.429 0.001 0.429 0.001
>>    0.899 0.001 0.899 0.001
>> RCVD_IN_MSPIKE_H2
>>    0.001 -2.800 0.001 -2.800
>>    0.001 -1.271 0.001 -1.271
>> RCVD_IN_MSPIKE_L3
>>    0.001 0.001 0.001 0.001
>>    0.001 1.282 0.001 1.282
>> RCVD_IN_MSPIKE_L4
>>    0.001 0.001 0.001 0.001
>>    0.001 0.147 0.001 0.147
>> RCVD_IN_MSPIKE_L5
>>    0.001 0.001 0.001 0.001
>>    0.001 2.283 0.001 2.283
>> RCVD_IN_MSPIKE_ZBI
>>    0.001 0.001 0.001 0.001
>>    0.001 3.496 0.001 3.496
>> SPOOFED_FREEM_REPTO
>>    2.498 1.368 2.498 1.368
>>    3.899 3.396 3.899 3.396
>> STATIC_XPRIO_OLE
>>    1.997 0.001 1.997 0.001
>>    1.668 0.277 1.668 0.277
>> STYLE_GIBBERISH
>>    2.800 3.093 2.800 3.093
>>    3.499 2.554 3.499 2.554
>> THIS_AD
>>    0.596 2.200 0.596 2.200
>>    0.001 0.001 0.001 0.001
>> TO_EQ_FM_DIRECT_MX
>>    2.497 0.622 2.497 0.622
>>    2.799 2.997 2.799 2.997
>> TO_IN_SUBJ
>>    0.099 0.099 0.099 0.099
>>    2.099 0.001 2.099 0.001
>> TO_NO_BRKTS_HTML_IMG
>>    0.001 2.000 0.001 2.000
>>    0.001 0.799 0.001 0.799
>> TO_NO_BRKTS_HTML_ONLY
>>    1.997 0.001 1.997 0.001
>>    0.001 0.001 0.001 0.001
>> TO_NO_BRKTS_MSFT
>>    2.497 0.001 2.497 0.001
>>    0.602 3.595 0.602 3.595
>> TO_NO_BRKTS_NORDNS_HTML
>>    0.398 0.001 0.398 0.001
>>    0.001 0.339 0.001 0.339
>> TO_NO_BRKTS_PCNT
>>    2.497 0.001 2.497 0.001
>>    2.699 2.196 2.699 2.196
>> TVD_SPACE_ENCODED
>>    2.497 0.001 2.497 0.001
>>    3.599 3.096 3.599 3.096
>> TVD_SPACE_ENC_FM_MIME
>>    1.997 0.001 1.997 0.001
>>    3.499 2.996 3.499 2.996
>> TVD_SPACE_RATIO_MINFP
>>    2.497 0.001 2.497 0.001
>>    4.399 4.395 4.399 4.395
>> URI_ONLY_MSGID_MALF
>>    0.001 1.191 0.001 1.191
>>    2.800 2.880 2.800 2.880
>> URI_TRY_3LD
>>    0.195 0.001 0.195 0.001
>>    0.001 0.227 0.001 0.227
>> URI_TRY_USME
>>    0.001 0.001 0.001 0.001
>>    3.299 2.896 3.299 2.896
>> URI_WPADMIN
>>    3.396 3.014 3.396 3.014
>>    2.899 2.498 2.899 2.498
>> URI_WP_DIRINDEX
>>    1.000 1.000 1.000 1.000
>>    4.399 3.995 4.399 3.995
>> URI_WP_HACKED
>>    2.996 3.000 2.996 3.000
>>    4.099 3.146 4.099 3.146
>> URI_WP_HACKED_2
>>    1.187 1.764 1.187 1.764
>>    1.479 3.412 1.479 3.412
>> XPRIO
>>    2.248 2.249 2.248 2.249
>>    0.367 0.140 0.367 0.140
>> 

Re: comparing 72_scores.cf files

Posted by Dave Jones <da...@apache.org>.
Nice script.  Very useful.  Now it would be good to extend it to check 
the rules still exist.  For example, I wrote a quick bash on-liner to 
find the rules no longer exist in the main SA rules so we shouldn't 
expect these 6 to be in the new 72_scores.cf:

AXB_XMAILER_MIMEOLE_OL_1ECD5
AXB_XM_FORGED_OL2600
FSL_HELO_BARE_IP_2
HK_SCAM_N15
MONEY_LOTTERY
MSGID_NOFQDN1

Next it would be interesting to see if these rest of the "only in 1 
(removed in 2)" are in other current rule files and if they have an 
uncommented score.  Ones without a score or a commented score will get 
the default score of 1.0.  Otherwise, we may not be too far off once we 
can get the lock-scores commited today.

Dave

On 11/15/2017 06:27 AM, Merijn van den Kroonenberg wrote:
> I wanted to compare the 72_scores.cf file against the one from yesterday
> and the one from march. However the svn diff is not very readable and its
> very hard to see which rules were added or removed. I also wanted to
> compare the ranges.data file which also contains rules and score ranges.
>
> I patched together a small script for comparing score files.
>
> See attchement for the (perl) script.
>
> It produces output like this (comparing march against last night):
>
> ./compare-rulefiles 72_scores_20170315.cf 72_scores-1815282.cf
>
> Only in 1 (removed in 2)
> AC_HTML_NONSENSE_TAGS
> AC_SPAMMY_URI_PATTERNS1
> AC_SPAMMY_URI_PATTERNS10
> AC_SPAMMY_URI_PATTERNS11
> AC_SPAMMY_URI_PATTERNS12
> AC_SPAMMY_URI_PATTERNS2
> AC_SPAMMY_URI_PATTERNS3
> AC_SPAMMY_URI_PATTERNS4
> AC_SPAMMY_URI_PATTERNS8
> AC_SPAMMY_URI_PATTERNS9
> ADVANCE_FEE_2_NEW_FORM
> ADVANCE_FEE_4_NEW
> ADVANCE_FEE_5_NEW
> AXB_XMAILER_MIMEOLE_OL_1ECD5
> AXB_XM_FORGED_OL2600
> BODY_EMPTY
> CANT_SEE_AD
> CN_B2B_SPAMMER
> COMMENT_GIBBERISH
> ENCRYPTED_MESSAGE
> FBI_MONEY
> FBI_SPOOF
> FORM_LOW_CONTRAST
> FOUND_YOU
> FREEMAIL_DOC_PDF_BCC
> FROM_WORDY_SHORT
> FSL_HELO_BARE_IP_2
> GOOGLE_DOCS_PHISH
> GOOGLE_DOCS_PHISH_MANY
> GOOG_MALWARE_DNLD
> HDRS_LCASE
> HEXHASH_WORD
> HK_SCAM_N15
> HTML_OFF_PAGE
> LIST_PRTL_PUMPDUMP
> LIST_PRTL_SAME_USER
> LOTTO_AGENT
> LOTTO_DEPT
> LUCRATIVE
> MONEY_LOTTERY
> MSGID_NOFQDN1
> MSM_PRIO_REPTO
> PHP_NOVER_MUA
> PHP_ORIG_SCRIPT
> PHP_SCRIPT_MUA
> PP_TOO_MUCH_UNICODE02
> PP_TOO_MUCH_UNICODE05
> PUMPDUMP
> PUMPDUMP_MULTI
> RAND_HEADER_MANY
> RP_MATCHES_RCVD
> SHARE_50_50
> SPOOFED_FREEM_REPTO_CHN
> STOCK_LOW_CONTRAST
> STOCK_TIP
> SYSADMIN
> TO_NO_BRKTS_FROM_MSSP
> TW_GIBBERISH_MANY
> UC_GIBBERISH_OBFU
> URI_DATA
> URI_GOOGLE_PROXY
> URI_OPTOUT_3LD
> URI_PHISH
> XPRIO_SHORT_SUBJ
>
> Only in 2 (added in 2)
> ADVANCE_FEE_4_NEW_MONEY
> ADVANCE_FEE_5_NEW_FRM_MNY
> ADVANCE_FEE_5_NEW_MONEY
> APOSTROPHE_TOCC
> AXB_X_AOL_SEZ_S
> DEAR_BENEFICIARY
> FROM_MISSP_DYNIP
> FROM_MISSP_EH_MATCH
> FSL_HELO_FAKE
> FSL_MIME_NO_TEXT
> FUZZY_UNSUBSCRIBE
> HDRS_MISSP
> MANY_PILL_PRICE
> MILLION_USD
> MONEY_ATM_CARD
> MONEY_FORM_SHORT
> MONEY_FROM_41
> SERGIO_SUBJECT_VIAGRA01
> SHORTENED_URL_SRC
> SINGLETS_LOW_CONTRAST
> TO_NO_BRKTS_DYNIP
>
> Changed
> AC_BR_BONANZA
>    0.001 0.001 0.001 0.001
>    0.001 0.967 0.001 0.967
> AC_DIV_BONANZA
>    0.001 0.001 0.001 0.001
>    0.909 0.001 0.909 0.001
> ADVANCE_FEE_2_NEW_MONEY
>    1.997 0.001 1.997 0.001
>    0.001 0.039 0.001 0.039
> ADVANCE_FEE_3_NEW
>    3.496 0.001 3.496 0.001
>    2.099 1.697 2.099 1.697
> ADVANCE_FEE_3_NEW_MONEY
>    2.796 0.001 2.796 0.001
>    3.100 2.696 3.100 2.696
> AXB_XMAILER_MIMEOLE_OL_024C2
>    0.367 0.001 0.367 0.001
>    0.001 2.822 0.001 2.822
> BODY_URI_ONLY
>    0.998 0.001 0.998 0.001
>    3.500 3.595 3.500 3.595
> BOGUS_MSM_HDRS
>    0.909 0.001 0.909 0.001
>    1.912 0.677 1.912 0.677
> CK_HELO_DYNAMIC_SPLIT_IP
>    1.350 0.001 1.350 0.001
>    1.500 0.001 1.500 0.001
> CK_HELO_GENERIC
>    0.249 0.249 0.249 0.249
>    2.195 1.350 2.195 1.350
> DATE_IN_FUTURE_96_Q
>    3.296 3.299 3.296 3.299
>    2.999 2.696 2.999 2.696
> FILL_THIS_FORM
>    2.748 0.001 2.748 0.001
>    0.200 1.402 0.200 1.402
> FORM_FRAUD
>    0.998 0.001 0.998 0.001
>    3.099 1.841 3.099 1.841
> FORM_FRAUD_3
>    2.696 0.001 2.696 0.001
>    2.899 0.383 2.899 0.383
> FORM_FRAUD_5
>    0.209 0.001 0.209 0.001
>    1.531 1.802 1.531 1.802
> FREEMAIL_FORGED_FROMDOMAIN
>    0.001 0.199 0.001 0.199
>    0.001 0.001 0.001 0.001
> FROM_IN_TO_AND_SUBJ
>    0.287 0.262 0.287 0.262
>    1.001 0.001 1.001 0.001
> FROM_MISSP_FREEMAIL
>    3.595 0.001 3.595 0.001
>    2.965 3.393 2.965 3.393
> FROM_MISSP_MSFT
>    0.001 0.001 0.001 0.001
>    0.001 3.129 0.001 3.129
> FROM_MISSP_REPLYTO
>    0.001 0.001 0.001 0.001
>    3.799 0.001 3.799 0.001
> FROM_MISSP_SPF_FAIL
>    0.001 1.000 0.001 1.000
>    0.001 0.001 0.001 0.001
> FROM_MISSP_TO_UNDISC
>    1.438 0.001 1.438 0.001
>    1.102 0.275 1.102 0.275
> FROM_MISSP_USER
>    0.001 0.001 0.001 0.001
>    0.001 1.416 0.001 1.416
> FROM_MISSP_XPRIO
>    0.001 0.001 0.001 0.001
>    0.001 3.595 0.001 3.595
> FROM_WORDY
>    2.497 0.001 2.497 0.001
>    3.299 2.511 3.299 2.511
> FSL_CTYPE_WIN1251
>    0.001 0.001 0.001 0.001
>    0.001 0.588 0.001 0.588
> FSL_NEW_HELO_USER
>    0.083 0.001 0.083 0.001
>    0.001 0.755 0.001 0.755
> HELO_MISC_IP
>    0.248 0.250 0.248 0.250
>    2.600 1.357 2.600 1.357
> HK_RANDOM_FROM
>    0.998 0.001 0.998 0.001
>    1.501 2.664 1.501 2.664
> HK_SCAM_N2
>    3.249 0.001 3.249 0.001
>    3.099 2.696 3.099 2.696
> IMG_DIRECT_TO_MX
>    2.397 2.400 2.397 2.400
>    3.599 1.743 3.599 1.743
> LONG_HEX_URI
>    2.194 2.290 2.194 2.290
>    0.399 0.884 0.399 0.884
> LONG_IMG_URI
>    0.553 0.100 0.553 0.100
>    0.001 0.001 0.001 0.001
> LOTS_OF_MONEY
>    0.001 0.001 0.001 0.001
>    0.001 0.006 0.001 0.006
> MIMEOLE_DIRECT_TO_MX
>    1.445 0.381 1.445 0.381
>    2.399 0.737 2.399 0.737
> MIME_NO_TEXT
>    1.000 1.000 1.000 1.000
>    3.505 2.941 3.505 2.941
> MONEY_FRAUD_3
>    2.896 0.001 2.896 0.001
>    3.100 0.243 3.100 0.243
> MONEY_FRAUD_5
>    3.096 0.001 3.096 0.001
>    3.400 2.896 3.400 2.896
> MONEY_FRAUD_8
>    2.548 0.001 2.548 0.001
>    0.364 3.199 0.364 3.199
> NSL_RCVD_FROM_USER
>    0.548 0.001 0.548 0.001
>    0.001 0.159 0.001 0.159
> NSL_RCVD_HELO_USER
>    1.273 0.001 1.273 0.001
>    2.599 0.970 2.599 0.970
> PP_MIME_FAKE_ASCII_TEXT
>    0.429 0.001 0.429 0.001
>    0.899 0.001 0.899 0.001
> RCVD_IN_MSPIKE_H2
>    0.001 -2.800 0.001 -2.800
>    0.001 -1.271 0.001 -1.271
> RCVD_IN_MSPIKE_L3
>    0.001 0.001 0.001 0.001
>    0.001 1.282 0.001 1.282
> RCVD_IN_MSPIKE_L4
>    0.001 0.001 0.001 0.001
>    0.001 0.147 0.001 0.147
> RCVD_IN_MSPIKE_L5
>    0.001 0.001 0.001 0.001
>    0.001 2.283 0.001 2.283
> RCVD_IN_MSPIKE_ZBI
>    0.001 0.001 0.001 0.001
>    0.001 3.496 0.001 3.496
> SPOOFED_FREEM_REPTO
>    2.498 1.368 2.498 1.368
>    3.899 3.396 3.899 3.396
> STATIC_XPRIO_OLE
>    1.997 0.001 1.997 0.001
>    1.668 0.277 1.668 0.277
> STYLE_GIBBERISH
>    2.800 3.093 2.800 3.093
>    3.499 2.554 3.499 2.554
> THIS_AD
>    0.596 2.200 0.596 2.200
>    0.001 0.001 0.001 0.001
> TO_EQ_FM_DIRECT_MX
>    2.497 0.622 2.497 0.622
>    2.799 2.997 2.799 2.997
> TO_IN_SUBJ
>    0.099 0.099 0.099 0.099
>    2.099 0.001 2.099 0.001
> TO_NO_BRKTS_HTML_IMG
>    0.001 2.000 0.001 2.000
>    0.001 0.799 0.001 0.799
> TO_NO_BRKTS_HTML_ONLY
>    1.997 0.001 1.997 0.001
>    0.001 0.001 0.001 0.001
> TO_NO_BRKTS_MSFT
>    2.497 0.001 2.497 0.001
>    0.602 3.595 0.602 3.595
> TO_NO_BRKTS_NORDNS_HTML
>    0.398 0.001 0.398 0.001
>    0.001 0.339 0.001 0.339
> TO_NO_BRKTS_PCNT
>    2.497 0.001 2.497 0.001
>    2.699 2.196 2.699 2.196
> TVD_SPACE_ENCODED
>    2.497 0.001 2.497 0.001
>    3.599 3.096 3.599 3.096
> TVD_SPACE_ENC_FM_MIME
>    1.997 0.001 1.997 0.001
>    3.499 2.996 3.499 2.996
> TVD_SPACE_RATIO_MINFP
>    2.497 0.001 2.497 0.001
>    4.399 4.395 4.399 4.395
> URI_ONLY_MSGID_MALF
>    0.001 1.191 0.001 1.191
>    2.800 2.880 2.800 2.880
> URI_TRY_3LD
>    0.195 0.001 0.195 0.001
>    0.001 0.227 0.001 0.227
> URI_TRY_USME
>    0.001 0.001 0.001 0.001
>    3.299 2.896 3.299 2.896
> URI_WPADMIN
>    3.396 3.014 3.396 3.014
>    2.899 2.498 2.899 2.498
> URI_WP_DIRINDEX
>    1.000 1.000 1.000 1.000
>    4.399 3.995 4.399 3.995
> URI_WP_HACKED
>    2.996 3.000 2.996 3.000
>    4.099 3.146 4.099 3.146
> URI_WP_HACKED_2
>    1.187 1.764 1.187 1.764
>    1.479 3.412 1.479 3.412
> XPRIO
>    2.248 2.249 2.248 2.249
>    0.367 0.140 0.367 0.140
>


Re: comparing 72_scores.cf files

Posted by "Kevin A. McGrail" <ke...@mcgrail.com>.
Very Cool.  Is there a Tl;Dr result you came up with or is this more 
something prepped for when we get to the stage to review?

On 11/15/2017 7:27 AM, Merijn van den Kroonenberg wrote:
> I wanted to compare the 72_scores.cf file against the one from yesterday
> and the one from march. However the svn diff is not very readable and its
> very hard to see which rules were added or removed. I also wanted to
> compare the ranges.data file which also contains rules and score ranges.
>
> I patched together a small script for comparing score files.
>
> See attchement for the (perl) script.
>
> It produces output like this (comparing march against last night):
>
> ./compare-rulefiles 72_scores_20170315.cf 72_scores-1815282.cf
>
> Only in 1 (removed in 2)
> AC_HTML_NONSENSE_TAGS
> AC_SPAMMY_URI_PATTERNS1
> AC_SPAMMY_URI_PATTERNS10
> AC_SPAMMY_URI_PATTERNS11
> AC_SPAMMY_URI_PATTERNS12
> AC_SPAMMY_URI_PATTERNS2
> AC_SPAMMY_URI_PATTERNS3
> AC_SPAMMY_URI_PATTERNS4
> AC_SPAMMY_URI_PATTERNS8
> AC_SPAMMY_URI_PATTERNS9
> ADVANCE_FEE_2_NEW_FORM
> ADVANCE_FEE_4_NEW
> ADVANCE_FEE_5_NEW
> AXB_XMAILER_MIMEOLE_OL_1ECD5
> AXB_XM_FORGED_OL2600
> BODY_EMPTY
> CANT_SEE_AD
> CN_B2B_SPAMMER
> COMMENT_GIBBERISH
> ENCRYPTED_MESSAGE
> FBI_MONEY
> FBI_SPOOF
> FORM_LOW_CONTRAST
> FOUND_YOU
> FREEMAIL_DOC_PDF_BCC
> FROM_WORDY_SHORT
> FSL_HELO_BARE_IP_2
> GOOGLE_DOCS_PHISH
> GOOGLE_DOCS_PHISH_MANY
> GOOG_MALWARE_DNLD
> HDRS_LCASE
> HEXHASH_WORD
> HK_SCAM_N15
> HTML_OFF_PAGE
> LIST_PRTL_PUMPDUMP
> LIST_PRTL_SAME_USER
> LOTTO_AGENT
> LOTTO_DEPT
> LUCRATIVE
> MONEY_LOTTERY
> MSGID_NOFQDN1
> MSM_PRIO_REPTO
> PHP_NOVER_MUA
> PHP_ORIG_SCRIPT
> PHP_SCRIPT_MUA
> PP_TOO_MUCH_UNICODE02
> PP_TOO_MUCH_UNICODE05
> PUMPDUMP
> PUMPDUMP_MULTI
> RAND_HEADER_MANY
> RP_MATCHES_RCVD
> SHARE_50_50
> SPOOFED_FREEM_REPTO_CHN
> STOCK_LOW_CONTRAST
> STOCK_TIP
> SYSADMIN
> TO_NO_BRKTS_FROM_MSSP
> TW_GIBBERISH_MANY
> UC_GIBBERISH_OBFU
> URI_DATA
> URI_GOOGLE_PROXY
> URI_OPTOUT_3LD
> URI_PHISH
> XPRIO_SHORT_SUBJ
>
> Only in 2 (added in 2)
> ADVANCE_FEE_4_NEW_MONEY
> ADVANCE_FEE_5_NEW_FRM_MNY
> ADVANCE_FEE_5_NEW_MONEY
> APOSTROPHE_TOCC
> AXB_X_AOL_SEZ_S
> DEAR_BENEFICIARY
> FROM_MISSP_DYNIP
> FROM_MISSP_EH_MATCH
> FSL_HELO_FAKE
> FSL_MIME_NO_TEXT
> FUZZY_UNSUBSCRIBE
> HDRS_MISSP
> MANY_PILL_PRICE
> MILLION_USD
> MONEY_ATM_CARD
> MONEY_FORM_SHORT
> MONEY_FROM_41
> SERGIO_SUBJECT_VIAGRA01
> SHORTENED_URL_SRC
> SINGLETS_LOW_CONTRAST
> TO_NO_BRKTS_DYNIP
>
> Changed
> AC_BR_BONANZA
>    0.001 0.001 0.001 0.001
>    0.001 0.967 0.001 0.967
> AC_DIV_BONANZA
>    0.001 0.001 0.001 0.001
>    0.909 0.001 0.909 0.001
> ADVANCE_FEE_2_NEW_MONEY
>    1.997 0.001 1.997 0.001
>    0.001 0.039 0.001 0.039
> ADVANCE_FEE_3_NEW
>    3.496 0.001 3.496 0.001
>    2.099 1.697 2.099 1.697
> ADVANCE_FEE_3_NEW_MONEY
>    2.796 0.001 2.796 0.001
>    3.100 2.696 3.100 2.696
> AXB_XMAILER_MIMEOLE_OL_024C2
>    0.367 0.001 0.367 0.001
>    0.001 2.822 0.001 2.822
> BODY_URI_ONLY
>    0.998 0.001 0.998 0.001
>    3.500 3.595 3.500 3.595
> BOGUS_MSM_HDRS
>    0.909 0.001 0.909 0.001
>    1.912 0.677 1.912 0.677
> CK_HELO_DYNAMIC_SPLIT_IP
>    1.350 0.001 1.350 0.001
>    1.500 0.001 1.500 0.001
> CK_HELO_GENERIC
>    0.249 0.249 0.249 0.249
>    2.195 1.350 2.195 1.350
> DATE_IN_FUTURE_96_Q
>    3.296 3.299 3.296 3.299
>    2.999 2.696 2.999 2.696
> FILL_THIS_FORM
>    2.748 0.001 2.748 0.001
>    0.200 1.402 0.200 1.402
> FORM_FRAUD
>    0.998 0.001 0.998 0.001
>    3.099 1.841 3.099 1.841
> FORM_FRAUD_3
>    2.696 0.001 2.696 0.001
>    2.899 0.383 2.899 0.383
> FORM_FRAUD_5
>    0.209 0.001 0.209 0.001
>    1.531 1.802 1.531 1.802
> FREEMAIL_FORGED_FROMDOMAIN
>    0.001 0.199 0.001 0.199
>    0.001 0.001 0.001 0.001
> FROM_IN_TO_AND_SUBJ
>    0.287 0.262 0.287 0.262
>    1.001 0.001 1.001 0.001
> FROM_MISSP_FREEMAIL
>    3.595 0.001 3.595 0.001
>    2.965 3.393 2.965 3.393
> FROM_MISSP_MSFT
>    0.001 0.001 0.001 0.001
>    0.001 3.129 0.001 3.129
> FROM_MISSP_REPLYTO
>    0.001 0.001 0.001 0.001
>    3.799 0.001 3.799 0.001
> FROM_MISSP_SPF_FAIL
>    0.001 1.000 0.001 1.000
>    0.001 0.001 0.001 0.001
> FROM_MISSP_TO_UNDISC
>    1.438 0.001 1.438 0.001
>    1.102 0.275 1.102 0.275
> FROM_MISSP_USER
>    0.001 0.001 0.001 0.001
>    0.001 1.416 0.001 1.416
> FROM_MISSP_XPRIO
>    0.001 0.001 0.001 0.001
>    0.001 3.595 0.001 3.595
> FROM_WORDY
>    2.497 0.001 2.497 0.001
>    3.299 2.511 3.299 2.511
> FSL_CTYPE_WIN1251
>    0.001 0.001 0.001 0.001
>    0.001 0.588 0.001 0.588
> FSL_NEW_HELO_USER
>    0.083 0.001 0.083 0.001
>    0.001 0.755 0.001 0.755
> HELO_MISC_IP
>    0.248 0.250 0.248 0.250
>    2.600 1.357 2.600 1.357
> HK_RANDOM_FROM
>    0.998 0.001 0.998 0.001
>    1.501 2.664 1.501 2.664
> HK_SCAM_N2
>    3.249 0.001 3.249 0.001
>    3.099 2.696 3.099 2.696
> IMG_DIRECT_TO_MX
>    2.397 2.400 2.397 2.400
>    3.599 1.743 3.599 1.743
> LONG_HEX_URI
>    2.194 2.290 2.194 2.290
>    0.399 0.884 0.399 0.884
> LONG_IMG_URI
>    0.553 0.100 0.553 0.100
>    0.001 0.001 0.001 0.001
> LOTS_OF_MONEY
>    0.001 0.001 0.001 0.001
>    0.001 0.006 0.001 0.006
> MIMEOLE_DIRECT_TO_MX
>    1.445 0.381 1.445 0.381
>    2.399 0.737 2.399 0.737
> MIME_NO_TEXT
>    1.000 1.000 1.000 1.000
>    3.505 2.941 3.505 2.941
> MONEY_FRAUD_3
>    2.896 0.001 2.896 0.001
>    3.100 0.243 3.100 0.243
> MONEY_FRAUD_5
>    3.096 0.001 3.096 0.001
>    3.400 2.896 3.400 2.896
> MONEY_FRAUD_8
>    2.548 0.001 2.548 0.001
>    0.364 3.199 0.364 3.199
> NSL_RCVD_FROM_USER
>    0.548 0.001 0.548 0.001
>    0.001 0.159 0.001 0.159
> NSL_RCVD_HELO_USER
>    1.273 0.001 1.273 0.001
>    2.599 0.970 2.599 0.970
> PP_MIME_FAKE_ASCII_TEXT
>    0.429 0.001 0.429 0.001
>    0.899 0.001 0.899 0.001
> RCVD_IN_MSPIKE_H2
>    0.001 -2.800 0.001 -2.800
>    0.001 -1.271 0.001 -1.271
> RCVD_IN_MSPIKE_L3
>    0.001 0.001 0.001 0.001
>    0.001 1.282 0.001 1.282
> RCVD_IN_MSPIKE_L4
>    0.001 0.001 0.001 0.001
>    0.001 0.147 0.001 0.147
> RCVD_IN_MSPIKE_L5
>    0.001 0.001 0.001 0.001
>    0.001 2.283 0.001 2.283
> RCVD_IN_MSPIKE_ZBI
>    0.001 0.001 0.001 0.001
>    0.001 3.496 0.001 3.496
> SPOOFED_FREEM_REPTO
>    2.498 1.368 2.498 1.368
>    3.899 3.396 3.899 3.396
> STATIC_XPRIO_OLE
>    1.997 0.001 1.997 0.001
>    1.668 0.277 1.668 0.277
> STYLE_GIBBERISH
>    2.800 3.093 2.800 3.093
>    3.499 2.554 3.499 2.554
> THIS_AD
>    0.596 2.200 0.596 2.200
>    0.001 0.001 0.001 0.001
> TO_EQ_FM_DIRECT_MX
>    2.497 0.622 2.497 0.622
>    2.799 2.997 2.799 2.997
> TO_IN_SUBJ
>    0.099 0.099 0.099 0.099
>    2.099 0.001 2.099 0.001
> TO_NO_BRKTS_HTML_IMG
>    0.001 2.000 0.001 2.000
>    0.001 0.799 0.001 0.799
> TO_NO_BRKTS_HTML_ONLY
>    1.997 0.001 1.997 0.001
>    0.001 0.001 0.001 0.001
> TO_NO_BRKTS_MSFT
>    2.497 0.001 2.497 0.001
>    0.602 3.595 0.602 3.595
> TO_NO_BRKTS_NORDNS_HTML
>    0.398 0.001 0.398 0.001
>    0.001 0.339 0.001 0.339
> TO_NO_BRKTS_PCNT
>    2.497 0.001 2.497 0.001
>    2.699 2.196 2.699 2.196
> TVD_SPACE_ENCODED
>    2.497 0.001 2.497 0.001
>    3.599 3.096 3.599 3.096
> TVD_SPACE_ENC_FM_MIME
>    1.997 0.001 1.997 0.001
>    3.499 2.996 3.499 2.996
> TVD_SPACE_RATIO_MINFP
>    2.497 0.001 2.497 0.001
>    4.399 4.395 4.399 4.395
> URI_ONLY_MSGID_MALF
>    0.001 1.191 0.001 1.191
>    2.800 2.880 2.800 2.880
> URI_TRY_3LD
>    0.195 0.001 0.195 0.001
>    0.001 0.227 0.001 0.227
> URI_TRY_USME
>    0.001 0.001 0.001 0.001
>    3.299 2.896 3.299 2.896
> URI_WPADMIN
>    3.396 3.014 3.396 3.014
>    2.899 2.498 2.899 2.498
> URI_WP_DIRINDEX
>    1.000 1.000 1.000 1.000
>    4.399 3.995 4.399 3.995
> URI_WP_HACKED
>    2.996 3.000 2.996 3.000
>    4.099 3.146 4.099 3.146
> URI_WP_HACKED_2
>    1.187 1.764 1.187 1.764
>    1.479 3.412 1.479 3.412
> XPRIO
>    2.248 2.249 2.248 2.249
>    0.367 0.140 0.367 0.140
>