You are viewing a plain text version of this content. The canonical link for it is here.
Posted to users@spamassassin.apache.org by Sietse van Zanen <si...@wizdom.nu> on 2007/03/03 13:27:38 UTC

RE:

ALL, depending which user is the recipient.

If you want a site wide bayes, you should set that up. (run spamd with
-u 'username')

Users receive different e-mails, so their bayes db tokens will be
different too.

-Sietse


-----Original Message-----
From: Kelsey Forsythe [mailto:forsythe@chem.iupui.edu] 
Sent: Saturday, March 03, 2007 09:46
To: users@spamassassin.apache.org
Subject: 

I receive different results running the 'sa-learn --dump magic'  
command depending on the user.

Running as user 'atom':

sa-learn --dump magic
0.000          0          3          0  non-token data: bayes db version
0.000          0       8902          0  non-token data: nspam
0.000          0       2590          0  non-token data: nham
0.000          0     135271          0  non-token data: ntokens
0.000          0 1165600543          0  non-token data: oldest atime
0.000          0 1172187010          0  non-token data: newest atime
0.000          0          0          0  non-token data: last journal  
sync atime
0.000          0 1172188618          0  non-token data: last expiry  
atime
0.000          0    2764800          0  non-token data: last expire  
atime delta
0.000          0      30931          0  non-token data: last expire  
reduction count

And then repeat but as root:

sa-learn --dump magic
0.000          0          3          0  non-token data: bayes db version
0.000          0         95          0  non-token data: nspam
0.000          0         97          0  non-token data: nham
0.000          0      22563          0  non-token data: ntokens
0.000          0 1146776047          0  non-token data: oldest atime
0.000          0 1171272030          0  non-token data: newest atime
0.000          0          0          0  non-token data: last journal  
sync atime
0.000          0          0          0  non-token data: last expiry  
atime
0.000          0          0          0  non-token data: last expire  
atime delta
0.000          0          0          0  non-token data: last expire  
reduction count

And then repeat as clamav:

sa-learn --dump magic
0.000          0          3          0  non-token data: bayes db version
0.000          0       4625          0  non-token data: nspam
0.000          0      19114          0  non-token data: nham
0.000          0     124393          0  non-token data: ntokens
0.000          0 1172484157          0  non-token data: oldest atime
0.000          0 1172909678          0  non-token data: newest atime
0.000          0 1172909274          0  non-token data: last journal  
sync atime
0.000          0 1172829815          0  non-token data: last expiry  
atime
0.000          0     345600          0  non-token data: last expire  
atime delta
0.000          0      39910          0  non-token data: last expire  
reduction count

Which is truly the one used by the SpamAssassin program?

Thanks in advance,
Kelsey


Re:

Posted by Tom Q Citizen <to...@bay-online-media.com>.
Sietse van Zanen wrote:
> ALL, depending which user is the recipient.
>
> If you want a site wide bayes, you should set that up. (run spamd with
> -u 'username')
>
> Users receive different e-mails, so their bayes db tokens will be
> different too.
>   
Excellent!  This is something I've been wondering about as well.  I run 
spamd (3.1.8) like this:

/usr/bin/spamd -c -d -m 3 -H /etc/mail/spamassassin -v -u vpopmail -x 
--virtual-config-dir=/home/vpopmail/domains/%d/%l

I'm running SA in a Qmail/Vpopmail environment.  When I run "sa-learn 
--dump magic", I get larger numbers than when I run for individual users:

[root@linux tom]# sa-learn --dump magic
0.000          0          3          0  non-token data: bayes db version
0.000          0     306286          0  non-token data: nspam
0.000          0      51813          0  non-token data: nham
0.000          0     151436          0  non-token data: ntokens
0.000          0 1165821737          0  non-token data: oldest atime
0.000          0 1166496611          0  non-token data: newest atime
0.000          0 1166495449          0  non-token data: last journal 
sync atime
0.000          0 1166495477          0  non-token data: last expiry atime
0.000          0          0          0  non-token data: last expire 
atime delta
0.000          0          0          0  non-token data: last expire 
reduction count
[root@linux tom]#
[root@linux tom]# sa-learn --dbpath 
/home/vpopmail/domains/somedomain.com/someuser --dump magic
0.000          0          3          0  non-token data: bayes db version
0.000          0       2796          0  non-token data: nspam
0.000          0        113          0  non-token data: nham
0.000          0     139925          0  non-token data: ntokens
0.000          0 1169849102          0  non-token data: oldest atime
0.000          0 1172937996          0  non-token data: newest atime
0.000          0          0          0  non-token data: last journal 
sync atime
0.000          0 1172612681          0  non-token data: last expiry atime
0.000          0    2764800          0  non-token data: last expire 
atime delta
0.000          0      47402          0  non-token data: last expire 
reduction count
[root@linux tom]#

Will a combination of both bayes databases be used or ONLY the one for 
each individual user?

Thanks!

Peace...

Tom