You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@spamassassin.apache.org by bu...@bugzilla.spamassassin.org on 2012/12/18 01:15:38 UTC

[Bug 6880] Use of rule name as its own description

https://issues.apache.org/SpamAssassin/show_bug.cgi?id=6880

Adam Katz <an...@khopis.com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |antispam@khopis.com

--- Comment #1 from Adam Katz <an...@khopis.com> ---
(Discalimer:  I've been bad about keeping up on the mailing list, and entirely
missed this thread.  My advanced apologies if others have already proposed
either of these two ideas.)


I wouldn't mind it being more informative, e.g. the first N characters of the
rule definition where N is the remaining available width (minus three if we're
to add "..." on the end).

This may, however, be too much visibility.


But really, we can nip most of this by writing descriptions for the more
commonly hitting rules.

Here's a list (and the code to get it) of every (published) rule lacking a
description (in English) whose combined ham+spam percent exceeds 2.0 (this
assumes you have access to the freqs data and adjust paths as needed):

$ awk '$1 == "describe" { print $2 }' trunk/rules/*.cf |grep -Fwvf-
results/freqs |awk '$2 + $3 > 2.0 && $7 !~ /^[T_]_|\(/ || /SPAM%/ {printf "%9s
%9s %6s %5s %5s  %s\n", $2,$3,$4,$5,$6,$7 }'

    SPAM%      HAM%    S/O  RANK SCORE  NAME
   8.1460         0  1.000  0.98  4.40  AXB_XMAILER_MIMEOLE_OL_024C2
   6.4331         0  1.000  0.96  0.99  MSOE_MID_WRONG_CASE
   3.8914         0  1.000  0.91  2.00  MONEY_FROM_41
   4.1578    0.0147  0.996  0.90  0.10  HDRS_LCASE
   2.9813    0.0047  0.998  0.88  3.55  MONEY_FORM_SHORT
   4.0997    0.0332  0.992  0.88  2.40  REPLYTO_WITHOUT_TO_CC
   8.9362    0.0979  0.989  0.87  0.85  FSL_NEW_HELO_USER
   3.0561    0.0189  0.994  0.86  3.90  FM_LOTTO_MONEY
   2.4960    0.0026  0.999  0.86  2.24  HK_SCAM_N2
   2.4883    0.0026  0.999  0.86  1.41  LOTTO_DEPT
   2.2830         0  1.000  0.86  2.43  KB_FAKED_THE_BAT
   2.2860    0.0058  0.997  0.85  3.80  KB_DATE_CONTAINS_TAB
   3.0306    0.0558  0.982  0.82  3.10  STOX_REPLY_TYPE_WITHOUT_QUOTES
   2.4462    0.0474  0.981  0.80  0.00  HK_LOTTO
   2.6508    0.1016  0.963  0.76  1.00  MONEY_PERCENT
   3.0412    0.1300  0.959  0.75  1.90  STOX_REPLY_TYPE
   4.0309    0.2852  0.934  0.69  1.00  MALFORMED_FREEMAIL
   1.6001    0.9593  0.625  0.48  0.00  TVD_SPACE_RATIO
   2.5939    2.8648  0.475  0.44  0.50  SUBJECT_NEEDS_ENCODING
   2.0024    4.9165  0.289  0.37  0.00  TVD_RCVD_SPACE_BRACKET

(That's a hasty bit of scripting; it doesn't account for the T_* rules that got
published as dependencies, nor does it help for non-English descriptions, which
will use syntax like "lang de describe RULE  Beispiel Beschreibung")

-- 
You are receiving this mail because:
You are the assignee for the bug.