You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@spamassassin.apache.org by Warren Togami <wt...@redhat.com> on 2009/08/10 08:30:35 UTC

Rescoring questions

On 08/05/2009 06:30 PM, Justin Mason wrote:
> it's distributed.
>
> Basically, it goes like this:
> http://wiki.apache.org/spamassassin/RescoreMassCheck

I am wondering...

* What is different about the rescoring mass checks that it cannot be 
done directly from the regular nightly?

* What is different about the rescoring mass checks than whatever is 
used to generate the data for sa-update?

* http://wiki.apache.org/spamassassin/SaUpdateBackend
Is this page update to date?  Some of the links are dead.

Warren Togami
wtogami@redhat.com

Re: Rescoring questions

Posted by Justin Mason <jm...@jmason.org>.
On Mon, Aug 10, 2009 at 07:30, Warren Togami<wt...@redhat.com> wrote:
> On 08/05/2009 06:30 PM, Justin Mason wrote:
>>
>> it's distributed.
>>
>> Basically, it goes like this:
>> http://wiki.apache.org/spamassassin/RescoreMassCheck
>
> I am wondering...
>
> * What is different about the rescoring mass checks that it cannot be done
> directly from the regular nightly?
>
> * What is different about the rescoring mass checks than whatever is used to
> generate the data for sa-update?

Initially, we did not mass-check sufficiently large quantities of mail
for the nightly, or use Bayes or network rules.  However I'm now
thinking we can just pick a nightly set of logs, and use that, since
we already made changes to the nightly process last year to support
generating network (set1/3) rulesets.

> * http://wiki.apache.org/spamassassin/SaUpdateBackend
> Is this page update to date?  Some of the links are dead.

probably out of date.  I'll take a look.

-- 
--j.