You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@spamassassin.apache.org by Apache Wiki <wi...@apache.org> on 2005/08/04 03:54:36 UTC

[Spamassassin Wiki] Update of "RescoreMassCheck" by JustinMason

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Spamassassin Wiki" for change notification.

The following page has been changed by JustinMason:
http://wiki.apache.org/spamassassin/RescoreMassCheck

The comment on the change is:
expand the post-perceptron end

------------------------------------------------------------------------------
  
  == 5. generate scores for score sets ==
  
- See RunningPerceptron.   Once this is complete, update rules/50_scores.cf with the generated scores.
+ See RunningPerceptron.
  
+ Once this is complete, rules/50_scores.cf will have the generated scores, created by runGA. (TODO: I think.)
+ 
+ Set aside the testing logs set created by runGA, for use in later statistics-generation steps.  Install the new STATISTICS files:
+ 
+ {{{
+   cd masses
+   tar cvfz testlogs.tgz \
+           gen-set{0,1,2,3}-$LOGDIR/SPBASE/spam-test.log \
+           gen-set{0,1,2,3}-$LOGDIR/NSBASE/ham-test.log
+   cp gen-set0-$LOGDIR/statistics ../rules/STATISTICS-set0.txt
+   cp gen-set1-$LOGDIR/statistics ../rules/STATISTICS-set1.txt
+   cp gen-set2-$LOGDIR/statistics ../rules/STATISTICS-set2.txt
+   cp gen-set3-$LOGDIR/statistics ../rules/STATISTICS-set3.txt
+ }}}
+ 
+ == 6. upload results ==
+ 
+ Attach the new proposed 50_scores.cf and STATISTICS*.txt as a patch to the rescoring bug on the bugzilla, for voting and comments. ;)
+ 
+ {{{
+   cd ..
+   svn diff rules/50_scores.cf rules/STAT* > ~/newscores.diff
+   [upload ~/newscores.diff in your web browser]
+ }}}
+ 
+ == 7. upload the test logs ==
+ 
+ Since stuff like the STATISTICS cannot ever be regenerated without the (randomised) test logs, these need to be saved, too.   Currently, I think the best bet is to upload the {{{testlogs.tgz}}} file somewhere on spamassassin.zones.apache.org; it doesn't have to be in a public place, ASF-committer-account-required is fine.
+