You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@spamassassin.apache.org by Apache Wiki <wi...@apache.org> on 2009/09/29 16:40:55 UTC

[Spamassassin Wiki] Update of "CorpusCleaning" by JustinMason

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Spamassassin Wiki" for change notification.

The "CorpusCleaning" page has been changed by JustinMason:
http://wiki.apache.org/spamassassin/CorpusCleaning?action=diff&rev1=16&rev2=17

  You can then take that mbox file, grep out the original MassCheck message id strings, and remove those lines from the 'spam.log' file:
  
  {{{
- grep X-Mass-Check-Id mbox | sed -e 's/^X-Mass-Check-Id: //' > id.fps
+ grep -a X-Mass-Check-Id mbox | sed -e 's/^X-Mass-Check-Id: //' > id.fps
  ./remove-ids-from-mclog id.fps < spam.log > spam.log.new
  mv spam.log.new spam.log
  }}}
@@ -80, +80 @@

  Delete the messages that are good, usable ham, leaving only spams, hams that include bits of spam, virus blowback, bounces, or whatever other undesirable messages you want to get rid of.  Quit and save.
  
  {{{
- grep X-Mass-Check-Id mbox | sed -e 's/^X-Mass-Check-Id: //' > id.fns
+ grep -a X-Mass-Check-Id mbox | sed -e 's/^X-Mass-Check-Id: //' > id.fns
  ./remove-ids-from-mclog id.fns < ham.log > ham.log.new
  mv ham.log.new ham.log
  }}}