You are viewing a plain text version of this content. The canonical link for it is here.
Posted to users@spamassassin.apache.org by Matthew Newton <mc...@leicester.ac.uk> on 2005/04/21 11:48:56 UTC
New rules, could someone please check?
Hi,
Would someone with a decent size corpus please be kind enough to check
the following rules for me?
I think these are all new ones since last time I asked. I'm interested
in the top five, mainly.
The entire rule set is at http://www.le.ac.uk/cc/mcn4/spam/uolcc.cf (and
includes one or two that I know are "bad", and need to be rewritten...
use at your own risk!).
Thanks!
Matthew
full UOLCC_HTM_L_URL_2 /\n(http:\/\/[a-z0-9-]+\.[a-z]{2,4}\/[[:alnum:]]{5,35}\/[[:alnum:]]{5,40}={0,3}\.htm)\s*\n\s*\n\s*([^\s]+)(\s+[^\s]+){1,}\s*\n\s*\n[^\s.]+(\s[^\s.]+){0,15}[^\.]\n\s*\n\1l/s
describe UOLCC_HTM_L_URL_2 Matches pattern of spam mail (2) (.htm .html)
score UOLCC_HTM_L_URL_2 3.8
full UOLCC_NOMORE /\n\s*no\s+more\?\s*\n/is
describe UOLCC_NOMORE Bad unsubscribe question
score UOLCC_NOMORE 0.1
full UOLCC_TOPGRADE /\n\s*top[-\s]*grade\s+quality\s*\n/is
describe UOLCC_TOPGRADE Spammy phrase
score UOLCC_TOPGRADE 0.1
full UOLCC_LOWPRICE /\n\s*low\s+prices?\s*\n/is
describe UOLCC_LOWPRICE Spammy phrase
score UOLCC_LOWPRICE 0.1
full UOLCC_FASTDELIV /\n\s*(?:swift|fast|quick)\s+delivery\s*\n/is
describe UOLCC_FASTDELIV Spammy phrase
score UOLCC_FASTDELIV 0.1
full UOLCC_RUSDELUXE /RusDeluxe.{0,5}Group/
describe UOLCC_RUSDELUXE Body contains spam phrase
score UOLCC_RUSDELUXE 5.0
full UOLCC_RUSDELUXE1 /12 Pushkinskaya street, office/
describe UOLCC_RUSDELUXE1 Body contains spam address
score UOLCC_RUSDELUXE1 5.0
full UOLCC_RUSDELUXE2 /33 Bolshaya Nikitskaya street, office/
describe UOLCC_RUSDELUXE2 Body contains spam address
score UOLCC_RUSDELUXE2 5.0
full UOLCC_RD_ICQ /ICQ\#\s*338818190/i
describe UOLCC_RD_ICQ Body contains bad ICQ number
score UOLCC_RD_ICQ 5.0
full UOLCC_MAKE_MONEY /\.make\.money\./i
describe UOLCC_MAKE_MONEY Body contains spam phrase
score UOLCC_MAKE_MONEY 4.5
header UOLCC_ZETA_TRADE Subject =~ /Zeta Trade/
describe UOLCC_ZETA_TRADE Subject contains spam phrase
score UOLCC_ZETA_TRADE 2.5
full UOLCC_ZETA_TRADE1 /Zeta Trade/
describe UOLCC_ZETA_TRADE1 Body contains spam phrase
score UOLCC_ZETA_TRADE1 2.5
body __UOLCC_DRUG1 /cialis\s+soft\s+tabs/i
body __UOLCC_DRUG2 /\bimpotence\b/i
body __UOLCC_DRUG3 /\btadalafil\b/i
body __UOLCC_DRUG4 /\bbest\s+erections?\b/i
body __UOLCC_DRUG5 /\bno\s+prior\s+prescription\s+needed\b/i
body __UOLCC_DRUG6 /\bless\s+sidebacks\b/i
body __UOLCC_DRUG7 /\bsex\b/i
meta UOLCC_DRUGS1 ((__UOLCC_DRUG1 + __UOLCC_DRUG2 + __UOLCC_DRUG3 + __UOLCC_DRUG4 + __UOLCC_DRUG5 + __UOLCC_DRUG6 + __UOLCC_DRUG7) > 4)
describe UOLCC_DRUGS1 Refers to drugs
score UOLCC_DRUGS1 3.5
--
Matthew Newton <mc...@le.ac.uk>
UNIX and e-mail Systems Administrator, Network Support Section,
Computer Centre, University of Leicester,
Leicester LE1 7RH, United Kingdom