You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@spamassassin.apache.org by Apache Wiki <wi...@apache.org> on 2006/10/02 12:18:37 UTC
[Spamassassin Wiki] Update of "FasterPerformance" by JustinMason
Dear Wiki user,
You have subscribed to a wiki page or wiki category on "Spamassassin Wiki" for change notification.
The following page has been changed by JustinMason:
http://wiki.apache.org/spamassassin/FasterPerformance
The comment on the change is:
reorder, and emphasise sa-blacklist; del mention of utf-8, it's obsolete
------------------------------------------------------------------------------
== General Advice ==
- Use [http://www.spamassassin.org/full/3.0.x/dist/spamd/README spamd].
-
- Ensure you are not using a locale that uses UTF-8; UTF-8 character sets have higher overhead for text-processing applications: Utf8Performance
-
- If you are using network tests, install a local DNS server (BIND named, for example) on the same host to cache responses, and set the /etc/resolv.conf file to use that instead of one on another machine. See CachingNameserver.
-
- Examine the custom rule set files you use.
+ Examine the custom rule set files you use:
* Avoid large rule sets, those over 100k or 150k in size. The more rules you have, the slower SA will run.
- * In particular, the {{{sa-blacklist}}} and {{{sa-blacklist-uri}}} rulesets are extremely heavyweight, and greatly affect performance. If you're using these, and running into performance issues, ''remove them immediately''. (Use network tests instead, since they are there as URIBL_WS_SURBL. See [http://wiki.apache.org/spamassassin/OutOfMemoryProblems#head-198fc106917f358aea90b95047299e4de6c0443d OutOfMemoryProblems].)
+ * In particular, the {{{sa-blacklist}}} and {{{sa-blacklist-uri}}} rulesets are extremely heavyweight, and greatly affect performance. If you're using these, and running into performance issues, '''remove them immediately'''. (Use network tests instead, since they are there as URIBL_WS_SURBL. See [http://wiki.apache.org/spamassassin/OutOfMemoryProblems#head-198fc106917f358aea90b95047299e4de6c0443d OutOfMemoryProblems].)
* Pick rule set files that are more productive. In the SARE families published by Bob Menschel, use files 0 and 1 for productivity / efficiency, and avoid files 2 and 3.
* Remove and re-add rule set files one at a time, and check performance after each change. If one rule set file causes a huge change in performance, take appropriate action.
Examine the custom rules you create, or have downloaded from third parties. Poorly-written regular expressions can use resources exponentially. Avoid body, rawbody, or full rules that use {{{+}}} or {{{*}}} quantifiers.
+
+ Use [http://www.spamassassin.org/full/3.0.x/dist/spamd/README spamd].
+
+ If you are using network tests, install a local DNS server (BIND named, for example) on the same host to cache responses, and set the /etc/resolv.conf file to use that instead of one on another machine. See CachingNameserver.
If you're having period load spikes, it could be because the Bayes database is re-syncing and expiring old tokens. See BayesForceExpire for a workaround.