You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@spamassassin.apache.org by Apache Wiki <wi...@apache.org> on 2006/10/02 12:18:37 UTC

[Spamassassin Wiki] Update of "FasterPerformance" by JustinMason

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Spamassassin Wiki" for change notification.

The following page has been changed by JustinMason:
http://wiki.apache.org/spamassassin/FasterPerformance

The comment on the change is:
reorder, and emphasise sa-blacklist; del mention of utf-8, it's obsolete

------------------------------------------------------------------------------
  
  == General Advice ==
  
- Use [http://www.spamassassin.org/full/3.0.x/dist/spamd/README spamd].
- 
- Ensure you are not using a locale that uses UTF-8; UTF-8 character sets have higher overhead for text-processing applications: Utf8Performance
- 
- If you are using network tests, install a local DNS server (BIND named, for example) on the same host to cache responses, and set the /etc/resolv.conf file to use that instead of one on another machine.  See CachingNameserver.
- 
- Examine the custom rule set files you use.
+ Examine the custom rule set files you use:
  
   * Avoid large rule sets, those over 100k or 150k in size. The more rules you have, the slower SA will run. 
-  * In particular, the {{{sa-blacklist}}} and {{{sa-blacklist-uri}}} rulesets are extremely heavyweight, and greatly affect performance.  If you're using these, and running into performance issues, ''remove them immediately''.  (Use network tests instead, since they are there as URIBL_WS_SURBL.  See [http://wiki.apache.org/spamassassin/OutOfMemoryProblems#head-198fc106917f358aea90b95047299e4de6c0443d OutOfMemoryProblems].)
+  * In particular, the {{{sa-blacklist}}} and {{{sa-blacklist-uri}}} rulesets are extremely heavyweight, and greatly affect performance.  If you're using these, and running into performance issues, '''remove them immediately'''.  (Use network tests instead, since they are there as URIBL_WS_SURBL.  See [http://wiki.apache.org/spamassassin/OutOfMemoryProblems#head-198fc106917f358aea90b95047299e4de6c0443d OutOfMemoryProblems].)
   * Pick rule set files that are more productive. In the SARE families published by Bob Menschel, use files 0 and 1 for productivity / efficiency, and avoid files 2 and 3. 
   * Remove and re-add rule set files one at a time, and check performance after each change. If one rule set file causes a huge change in performance, take appropriate action. 
  
  Examine the custom rules you create, or have downloaded from third parties.  Poorly-written regular expressions can use resources exponentially.  Avoid body, rawbody, or full rules that use {{{+}}} or {{{*}}} quantifiers. 
+ 
+ Use [http://www.spamassassin.org/full/3.0.x/dist/spamd/README spamd].
+ 
+ If you are using network tests, install a local DNS server (BIND named, for example) on the same host to cache responses, and set the /etc/resolv.conf file to use that instead of one on another machine.  See CachingNameserver.
  
  If you're having period load spikes, it could be because the Bayes database is re-syncing and expiring old tokens. See BayesForceExpire for a workaround.