You are viewing a plain text version of this content. The canonical link for it is here.
Posted to users@spamassassin.apache.org by Jeff Chan <je...@surbl.org> on 2005/05/11 12:02:28 UTC

Re: [SPAM-TAG] RE: SpamCopURI not working, was RE: More Messed Up www URLs

On Tuesday, May 10, 2005, 9:32:45 AM, John Stewart wrote:

> SA lints fine... running it in debug mode, it appears to not be checking
> anything but the multi records. See below.

> I've grepped through /usr/share/spamassassin and /etc/mail/spamassasin, and
> the only URI_RBL reference I find in any .cf file is in
> /etc/mail/spamassasin/spamcop_uri.cf, which is the config file included with
> SpamCopURI-0.25 (which has rules and scores for 7 different _URI_RBL's). The
> only one I'm seeing *ever* hit in my logfiles is SPAMCOP_URL_RBL.

> This is really killing my spam scanning performance...!

> [...]
> debug: using "/usr/share/spamassassin" for default rules dir
> debug: using "/etc/mail/spamassassin" for site rules dir
> debug: using "/var/amavis/.spamassassin" for user state dir
> debug: using "/var/amavis/.spamassassin/user_prefs" for user prefs file    
> [...]
> debug: Razor2 results: spam? 0  highest cf score: 0
> debug: running raw-body-text per-line regexp tests; score so far=0
> debug: running uri tests; score so far=0
> debug: uri tests: Done uriRE
> debug: checking url: http://www.achat-montre-rolex.net./
> debug: querying for achat-montre-rolex.net.multi.surbl.org

> debug: Query failed for achat-montre-rolex.net.multi.surbl.org
> debug: Receieved match prefix: 127.0.0
> debug: Receieved mask: 2
> debug: no match
> debug: checking url: http://www.achat-montre-rolex.net./
> debug: returning cached data :  achat-montre-rolex.net.multi.surbl.org ->
> ARRAY(0x9b20414)
> debug: Receieved match prefix: 127.0.0
> debug: Receieved mask: 4
> debug: no match
[...]

Your configuration and installation are fine.  multi.surbl.org is
the only list that should be checked, as it's the combined list
with all other SURBL lists included:

  http://www.surbl.org/lists.html#multi

It looks like the issue is that SpamCopURI is getting fooled by
the trailing dot in the URI, like the : and other characters that
formerly confused SA 3.0 too.  Let's ask Eric Kolve to please
update SpamCopURI to ignore these extra characters that appear at
the end of the host portion of URIs, like SA 3.1 now does, as a
result of this bug fix:

  http://bugzilla.spamassassin.org/show_bug.cgi?id=4191

Jeff C.
-- 
Jeff Chan
mailto:jeffc@surbl.org
http://www.surbl.org/