You are viewing a plain text version of this content. The canonical link for it is here.
Posted to users@spamassassin.apache.org by Shelley Waltz <sh...@cabm.rutgers.edu> on 2005/03/22 18:22:14 UTC

Wordlist Rules

I am trying to use the following from a posting on Mar 2 2004
RHES3/spamassassin-2.63-1 ...

# match Bayes-poison lists of lowercase words without articles or common
#prepositions

body  PT_WORDLIST_10
/(?:\b(?!(?:from|that|have|this|were|with)\b)[a-z]{4,12}\s+){10}/
describe PT_WORDLIST_10  string of 10+ random words
score  PT_WORDLIST_10  1.0

body  PT_WORDLIST_13
/(?:\b(?!(?:from|that|have|this|were|with)\b)[a-z]{4,12}\s+){13}/
describe PT_WORDLIST_13  string of 13+ random words
score  PT_WORDLIST_13  3.0

body  PT_WORDLIST_30
/(?:\b(?!(?:from|that|have|this|were|with)\b)[a-z]{4,12}\s+){30}/
describe PT_WORDLIST_30  string of 30+ random words
score  PT_WORDLIST_30  10.0

# match Bayes-poison lists of lowercase words without articles or common
#prepositions

body  XX_WORDLIST_10
/(?:\b(?!(?:from|that|have|this|were|with)\b)[a-z\.\,\-\;]{4,18}\s+){10}/
describe XX_WORDLIST_10  string of 10+ random words
score  XX_WORDLIST_10  1.0


When I spamassassin -D --lint I get the following ...


Failed to parse line in SpamAssassin configuration, skipping: body 
PT_WORDLIST_10
Failed to parse line in SpamAssassin configuration, skipping:
/(?:\b(?!(?:from|that|have|this|were|with)\b)[a_z]{4,12}\s+){10}/
Failed to parse line in SpamAssassin configuration, skipping: body 
PT_WORDLIST_13
Failed to parse line in SpamAssassin configuration, skipping:
/(?:\b(?!(?:from|that|have|this|were|with)\b)[a_z]{4,12}\s+){13}/
Failed to parse line in SpamAssassin configuration, skipping: body 
PT_WORDLIST_30
Failed to parse line in SpamAssassin configuration, skipping:
/(?:\b(?!(?:from|that|have|this|were|with)\b)[a_z]{4,12}\s+){30}/
Failed to parse line in SpamAssassin configuration, skipping: body 
XX_WORDLIST_10
Failed to parse line in SpamAssassin configuration, skipping:
/(?:\b(?!(?:from|that|have|this|were|with)\b)[a_z\.\,\_\;]{4,18}\s+){10}/
Failed to parse line in SpamAssassin configuration, skipping: body 
XX_WORDLIST_13
Failed to parse line in SpamAssassin configuration, skipping:
/(?:\b(?!(?:from|that|have|this|were|with)\b)[a_z\.\,\_\;]{4,18}\s+){13}/
Failed to parse line in SpamAssassin configuration, skipping: body 
XX_WORDLIST_30
Failed to parse line in SpamAssassin configuration, skipping:
/(?:\b(?!(?:from|that|have|this|were|with)\b)[a_z\.\,\_\;]{4,18}\s+){30}/

Do these rules work for SA-2.63?


{ Shelley Waltz;
  Center for Advanced Biotechnology and Medicine;
  Rutgers University/UMDNJ;
  679 Hoes Lane;
  Piscataway, NJ 08854;
  732 235 3346 }

Re: Wordlist Rules

Posted by Theo Van Dinter <fe...@kluge.net>.
On Tue, Mar 22, 2005 at 12:22:14PM -0500, Shelley Waltz wrote:
> body  PT_WORDLIST_10
> /(?:\b(?!(?:from|that|have|this|were|with)\b)[a-z]{4,12}\s+){10}/
> describe PT_WORDLIST_10  string of 10+ random words
> score  PT_WORDLIST_10  1.0
> 
> Failed to parse line in SpamAssassin configuration, skipping: body 
> PT_WORDLIST_10
> Failed to parse line in SpamAssassin configuration, skipping:
> /(?:\b(?!(?:from|that|have|this|were|with)\b)[a_z]{4,12}\s+){10}/

Configuration parameters must be on 1 line.  Join the RE line onto the
line before it, and try again.

-- 
Randomly Generated Tagline:
"... and now we have a parallelogram, or at least we would if I could draw."
                                                    - Prof. Farr

Re: Wordlist Rules

Posted by Matt Kettler <mk...@evi-inc.com>.
Shelley Waltz wrote:

>
>When I spamassassin -D --lint I get the following ...
>
>
>Failed to parse line in SpamAssassin configuration, skipping: body 
>PT_WORDLIST_10
>Failed to parse line in SpamAssassin configuration, skipping:
>/(?:\b(?!(?:from|that|have|this|were|with)\b)[a_z]{4,12}\s+){10}/
>Do these rules work for SA-2.63?
>

Yes, but you've got linewraps inserted where they shouldn't be, probably
due to one mail client or another wrapping lines..

Each rule should only be 3 lines, not 4. The line beginning with /
should be a part of the "body" line.