You are viewing a plain text version of this content. The canonical link for it is here.
Posted to users@spamassassin.apache.org by WalyBaly <wa...@softhome.net> on 2004/08/03 13:12:20 UTC

Re: training my SA for bayes for all users

Well did some changes,...
heres what i got (hope its correct)


spamd -D
debug: Score set 0 chosen.
debug: Preloading modules with HOME=/tmp/spamd-24851-init
debug: ignore: test message to precompile patterns and load modules
debug: using "/usr/share/spamassassin" for default rules dir
debug: using "/etc/mail/spamassassin" for site rules dir
debug: bayes: 24851 tie-ing to DB file R/O /etc/mail/spamassassin/_toks
debug: bayes: 24851 tie-ing to DB file R/O /etc/mail/spamassassin/_seen
debug: bayes: found bayes db version 2
debug: bayes: Not available for scanning, only 160 ham(s) in Bayes DB < 200
debug: bayes: 24851 untie-ing
debug: bayes: 24851 untie-ing db_toks
debug: bayes: 24851 untie-ing db_seen
debug: Score set 1 chosen.
debug: Initialising learner
debug: bayes: 24851 tie-ing to DB file R/O /etc/mail/spamassassin/_toks
debug: bayes: 24851 tie-ing to DB file R/O /etc/mail/spamassassin/_seen
debug: bayes: found bayes db version 2
debug: bayes: Not available for scanning, only 160 ham(s) in Bayes DB < 200
debug: bayes: 24851 untie-ing
debug: bayes: 24851 untie-ing db_toks
debug: bayes: 24851 untie-ing db_seen
debug: is Net::DNS::Resolver available? yes
debug: trying (3) slashdot.org...
debug: looking up MX for 'slashdot.org'
debug: MX for 'slashdot.org' exists? 1
debug: MX lookup of slashdot.org succeeded => Dns available (set 
dns_available to hardcode)
debug: is DNS available? 1
debug: all '*From' addrs: ignore@compiling.spamassassin.taint.org
debug: running header regexp tests; score so far=0
debug: running body-text per-line regexp tests; score so far=1.27
debug: Razor2 is not available
debug: running raw-body-text per-line regexp tests; score so far=1.27
debug: running uri tests; score so far=1.27
debug: uri tests: Done uriRE
debug: running full-text regexp tests; score so far=1.27
debug: Razor2 is not available
debug: Current PATH is: 
/usr/kerberos/sbin:/usr/kerberos/bin:/usr/local/sbin:/usr/local/bin:/sbin:/bin:/usr/sbin:/usr/bin:/usr/X11R6/bin
debug: Pyzor is not available: pyzor not found
debug: DCCifd is not available: no r/w dccifd socket found.
debug: DCC is not available: no executable dccproc found.
debug: all '*To' addrs:
debug: RBL: success for 1 of 1 queries
debug: running meta tests; score so far=1.27
debug: is spam? score=1.27 required=7 tests=DATE_MISSING,NO_REAL_NAME
server started on port 783/tcp (running version 2.63)
server pid: 24851
logmsg: server started on port 783/tcp (running version 2.63)

rgds.
waly

Kris Deugau wrote:

>WalyBaly wrote:
>  
>
>>spamd -D
>>debug: Score set 0 chosen.
>>debug: Preloading modules with HOME=/tmp/spamd-18698-init
>>    
>>
>                                 ^^^^^^^^^^^^^^^^^^^^^^^^^^
>  
>
>>debug: ignore: test message to precompile patterns and load modules
>>debug: using "/usr/share/spamassassin" for default rules dir
>>debug: using "/etc/mail/spamassassin" for site rules dir
>>debug: bayes: no dbs present, cannot scan:
>>        /tmp/spamd-18698-init/.spamassassin/bayes_toks
>>    
>>
>          ^^^^^^^^^^^^^^^^^^^^^^
>  
>
>>debug: Score set 1 chosen.
>>debug: Initialising learner
>>debug: bayes: no dbs present, cannot scan:
>>        /tmp/spamd-18698-init/.spamassassin/bayes_toks
>>    
>>
>          ^^^^^^^^^^^^^^^^^^^^^^
>Er, quite aside from what you're trying to do, this just looks very
>seriously WRONG.  /tmp is NOT somewhere you want to keep persistent data
>such as a global Bayes database...
>
>  
>
>>and my whole objective is to apply bayes learning to all my users in
>>/var/spool/mail ??
>>    
>>
>
>For 2.63, you'll need directives something like this:
>
>bayes_learn_to_journal  1
>bayes_auto_expire       0
>bayes_path      /var/SpamAssassin/bayes
>bayes_file_mode 0777
>
>in a .cf file in /etc/mail/spamassassin.  (I use local.cf.)  Change the
>path to suit yourself as to where you want the bayes_* files to end up. 
>Note that in the most "common/standard" systemwide setup (spamd run as
>root, spamc called from each user's .procmailrc as that user), this
>*MUST* be at least a group-writable directory, unless you also want to
>disable autolearn.  I have it setuid to the group that all mail users
>are members of;  early on I had permissions trouble that didn't go away
>otherwise.
>
>bayes_file_mode is set as it is so that autolearning can take place.
>
>With bayes_auto_expire set to 0 (off), you'll also need a cron job to do
>the expiry.  I use this in /etc/crontab (watch for linewrap):
>
>02 5 * * * root /usr/bin/sa-learn -p /root/.spamassassin/user_prefs
>--rebuild --force-expire
>
>Note that you may not require the " -p /root/.spamassassin/user_prefs "
>part;  I got odd behaviour when I left it off.  Strangely enough, I have
>no non-comment entries in that file...
>
>Then to learn from all mailboxes in /var/spool/mail (a BAD idea IMO -
>what about FNs that have slipped through any existing filtering?) run
>"sa-learn --spam --mbox --no-rebuild /var/spool/mail/*".  (I *think*
>that should work;  IIRC I've used something similar learning sets of
>mboxes all together.)
>
>-kgd
>  
>