You are viewing a plain text version of this content. The canonical link for it is here.
Posted to users@spamassassin.apache.org by Kelsey Forsythe <fo...@chem.iupui.edu> on 2007/03/03 09:46:09 UTC

(Unknown)

I receive different results running the 'sa-learn --dump magic'  
command depending on the user.

Running as user ‘atom’:

sa-learn --dump magic
0.000          0          3          0  non-token data: bayes db version
0.000          0       8902          0  non-token data: nspam
0.000          0       2590          0  non-token data: nham
0.000          0     135271          0  non-token data: ntokens
0.000          0 1165600543          0  non-token data: oldest atime
0.000          0 1172187010          0  non-token data: newest atime
0.000          0          0          0  non-token data: last journal  
sync atime
0.000          0 1172188618          0  non-token data: last expiry  
atime
0.000          0    2764800          0  non-token data: last expire  
atime delta
0.000          0      30931          0  non-token data: last expire  
reduction count

And then repeat but as root:

sa-learn --dump magic
0.000          0          3          0  non-token data: bayes db version
0.000          0         95          0  non-token data: nspam
0.000          0         97          0  non-token data: nham
0.000          0      22563          0  non-token data: ntokens
0.000          0 1146776047          0  non-token data: oldest atime
0.000          0 1171272030          0  non-token data: newest atime
0.000          0          0          0  non-token data: last journal  
sync atime
0.000          0          0          0  non-token data: last expiry  
atime
0.000          0          0          0  non-token data: last expire  
atime delta
0.000          0          0          0  non-token data: last expire  
reduction count

And then repeat as clamav:

sa-learn --dump magic
0.000          0          3          0  non-token data: bayes db version
0.000          0       4625          0  non-token data: nspam
0.000          0      19114          0  non-token data: nham
0.000          0     124393          0  non-token data: ntokens
0.000          0 1172484157          0  non-token data: oldest atime
0.000          0 1172909678          0  non-token data: newest atime
0.000          0 1172909274          0  non-token data: last journal  
sync atime
0.000          0 1172829815          0  non-token data: last expiry  
atime
0.000          0     345600          0  non-token data: last expire  
atime delta
0.000          0      39910          0  non-token data: last expire  
reduction count

Which is truly the one used by the SpamAssassin program?

Thanks in advance,
Kelsey


Re:

Posted by Tom Q Citizen <to...@bay-online-media.com>.
Sietse van Zanen wrote:
> ALL, depending which user is the recipient.
>
> If you want a site wide bayes, you should set that up. (run spamd with
> -u 'username')
>
> Users receive different e-mails, so their bayes db tokens will be
> different too.
>   
Excellent!  This is something I've been wondering about as well.  I run 
spamd (3.1.8) like this:

/usr/bin/spamd -c -d -m 3 -H /etc/mail/spamassassin -v -u vpopmail -x 
--virtual-config-dir=/home/vpopmail/domains/%d/%l

I'm running SA in a Qmail/Vpopmail environment.  When I run "sa-learn 
--dump magic", I get larger numbers than when I run for individual users:

[root@linux tom]# sa-learn --dump magic
0.000          0          3          0  non-token data: bayes db version
0.000          0     306286          0  non-token data: nspam
0.000          0      51813          0  non-token data: nham
0.000          0     151436          0  non-token data: ntokens
0.000          0 1165821737          0  non-token data: oldest atime
0.000          0 1166496611          0  non-token data: newest atime
0.000          0 1166495449          0  non-token data: last journal 
sync atime
0.000          0 1166495477          0  non-token data: last expiry atime
0.000          0          0          0  non-token data: last expire 
atime delta
0.000          0          0          0  non-token data: last expire 
reduction count
[root@linux tom]#
[root@linux tom]# sa-learn --dbpath 
/home/vpopmail/domains/somedomain.com/someuser --dump magic
0.000          0          3          0  non-token data: bayes db version
0.000          0       2796          0  non-token data: nspam
0.000          0        113          0  non-token data: nham
0.000          0     139925          0  non-token data: ntokens
0.000          0 1169849102          0  non-token data: oldest atime
0.000          0 1172937996          0  non-token data: newest atime
0.000          0          0          0  non-token data: last journal 
sync atime
0.000          0 1172612681          0  non-token data: last expiry atime
0.000          0    2764800          0  non-token data: last expire 
atime delta
0.000          0      47402          0  non-token data: last expire 
reduction count
[root@linux tom]#

Will a combination of both bayes databases be used or ONLY the one for 
each individual user?

Thanks!

Peace...

Tom



RE:

Posted by Sietse van Zanen <si...@wizdom.nu>.
ALL, depending which user is the recipient.

If you want a site wide bayes, you should set that up. (run spamd with
-u 'username')

Users receive different e-mails, so their bayes db tokens will be
different too.

-Sietse


-----Original Message-----
From: Kelsey Forsythe [mailto:forsythe@chem.iupui.edu] 
Sent: Saturday, March 03, 2007 09:46
To: users@spamassassin.apache.org
Subject: 

I receive different results running the 'sa-learn --dump magic'  
command depending on the user.

Running as user 'atom':

sa-learn --dump magic
0.000          0          3          0  non-token data: bayes db version
0.000          0       8902          0  non-token data: nspam
0.000          0       2590          0  non-token data: nham
0.000          0     135271          0  non-token data: ntokens
0.000          0 1165600543          0  non-token data: oldest atime
0.000          0 1172187010          0  non-token data: newest atime
0.000          0          0          0  non-token data: last journal  
sync atime
0.000          0 1172188618          0  non-token data: last expiry  
atime
0.000          0    2764800          0  non-token data: last expire  
atime delta
0.000          0      30931          0  non-token data: last expire  
reduction count

And then repeat but as root:

sa-learn --dump magic
0.000          0          3          0  non-token data: bayes db version
0.000          0         95          0  non-token data: nspam
0.000          0         97          0  non-token data: nham
0.000          0      22563          0  non-token data: ntokens
0.000          0 1146776047          0  non-token data: oldest atime
0.000          0 1171272030          0  non-token data: newest atime
0.000          0          0          0  non-token data: last journal  
sync atime
0.000          0          0          0  non-token data: last expiry  
atime
0.000          0          0          0  non-token data: last expire  
atime delta
0.000          0          0          0  non-token data: last expire  
reduction count

And then repeat as clamav:

sa-learn --dump magic
0.000          0          3          0  non-token data: bayes db version
0.000          0       4625          0  non-token data: nspam
0.000          0      19114          0  non-token data: nham
0.000          0     124393          0  non-token data: ntokens
0.000          0 1172484157          0  non-token data: oldest atime
0.000          0 1172909678          0  non-token data: newest atime
0.000          0 1172909274          0  non-token data: last journal  
sync atime
0.000          0 1172829815          0  non-token data: last expiry  
atime
0.000          0     345600          0  non-token data: last expire  
atime delta
0.000          0      39910          0  non-token data: last expire  
reduction count

Which is truly the one used by the SpamAssassin program?

Thanks in advance,
Kelsey


Re:

Posted by Matt Kettler <mk...@verizon.net>.
Kelsey Forsythe wrote:
> I receive different results running the 'sa-learn --dump magic'
> command depending on the user.
That's normal.

>
> Which is truly the one used by the SpamAssassin program?

Which one is truly being used to call the SpamAssassin program? The
answer depends on how you've set your system up.

So how do you call SA? From your previous message, it looks like you use
amavis. In which case, none of those bayes databases is used. Amavis
will invoke SA as the user "amavis".