You are viewing a plain text version of this content. The canonical link for it is here.
Posted to users@spamassassin.apache.org by Mike Brown <mi...@skew.org> on 2013/06/11 10:28:21 UTC

sa-update: MIRRORED.BY is 404 for any channel

I'm running 3.3.2 on two FreeBSD 8.3 systems on different networks. Both 
systems are configured roughly identically with regard to SpamAssassin. One 
system runs Perl 5.16 (not sure if that matters) and can run sa-update without 
error, but the other runs Perl 5.12 and gets 404s when it tries to update 
MIRRORED.BY for any channel. Well, updates.spamassassin.org or 
sought.rules.yerp.org are the ones I tried, at least:


Jun 11 00:35:14.769 [53689] dbg: http: GET http://yerp.org/rules/MIRRORED.BY" request failed, retrying: 404 Not Found: <?xml version="1.0" encoding="iso-8859-1"?> <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> <html xmlns="http://www.w3.org/1999/xhtml" xml:lang="en" lang="en"> <head> <title>404 - Not Found</title> </head> <body> <h1>404 - Not Found</h1> </body> </html>
(repeat 3X)

Jun 11 00:05:07.327 [43091] dbg: http: GET http://spamassassin.apache.org/updates/MIRRORED.BY" request failed, retrying: 404 Not Found: <!DOCTYPE HTML PUBLIC "-//IETF//DTD HTML 2.0//EN"> <html><head> <title>404 Not Found</title> </head><body> <h1>Not Found</h1> <p>The requested URL /updates/MIRRORED.BY&quot; was not found on this server.</p> <hr> <address>Apache/2.4.4 (Unix) OpenSSL/1.0.1e Server at spamassassin.apache.org Port 80</address> </body></html>
(repeat 3X)


Same thing happens even if I try only updates.spamassassin.org without gpg.

These 404 errors look just like those in bug 6914, but neither of my systems 
are IPv6-only. IPv6 isn't routing on either system (ping6 -c3 ::1 is the only 
thing that works).

Bug 6838 was no help, either, because the mirrors in question are alive and 
like I said, the one system can sa-update without any issues.

On both systems I get the same DNS results, nothing out of the ordinary. And 
when running curl or whatever, I can (via IPv4) fetch the MIRRORED.BY file 
from the same URLs just fine, as well as talk to the mirror sites listed 
therein. So connecting to the sites shouldn't be an issue. 

Sorry if this is a FAQ, user error, or typical newbie configuration oversight, 
but I searched for answers for quite a while before resorting to posting here. 
Any assistance would be much appreciated. How can I further diagnose this?

Re: sa-update: MIRRORED.BY is 404 for any channel

Posted by Quanah Gibson-Mount <qu...@zimbra.com>.
--On Wednesday, June 12, 2013 10:12 PM -0600 Mike Brown <mi...@skew.org> 
wrote:

> Martin wrote:
>> Do you have a MIRRORED.BY file in you spamassassin update directory? It
>> looks like it doesn't have the file with the mirrors in and instead is
>> using  the file name.
>>
>> If so you could copy it over from your other box that's working.
>>
>
> Thanks; your suggestion worked.
>
> The way MIRRORED.BY files get used and updated(?) is a complete mystery
> to me.

I hit a bug a while back that if you end up with an empty file (say because 
of a timeout), all future updates fail until the file is removed.  This is 
fix in current 3.4 SVN checkout.  At this point I use the SVN 3.4.0 version 
on all my production servers (as do many others).

--Quanah


--

Quanah Gibson-Mount
Sr. Member of Technical Staff
Zimbra, Inc
A Division of VMware, Inc.
--------------------
Zimbra ::  the leader in open source messaging and collaboration

Re: sa-update: MIRRORED.BY is 404 for any channel

Posted by Mike Brown <mi...@skew.org>.
Martin wrote:
> Do you have a MIRRORED.BY file in you spamassassin update directory? It 
> looks like it doesn't have the file with the mirrors in and instead is using 
> the file name.
> 
> If so you could copy it over from your other box that's working.
> 

Thanks; your suggestion worked.

The way MIRRORED.BY files get used and updated(?) is a complete mystery to me. 

There was nothing in the old file that was wrong, syntax-wise. It was the same 
as the current, working one, but with the last line (the secnap mirror) 
commented out by me after seeing something about that site having problems a 
while back. How this difference results in the 404s I was seeing is not at all 
obvious.

RE: sa-update: MIRRORED.BY is 404 for any channel

Posted by Martin <ma...@ntlworld.com>.
 

> -----Original Message-----
> From: Mike Brown [mailto:mike@skew.org] 
> Sent: Tuesday, June 11, 2013 10:38 AM
> To: users@spamassassin.apache.org
> Subject: Re: sa-update: MIRRORED.BY is 404 for any channel
> 
> John Wilcock wrote:
> > > Jun 11 00:05:07.327 [43091] dbg: http: GET 
> > > http://spamassassin.apache.org/updates/MIRRORED.BY" 
> request failed, 
> > > retrying: 404 Not Found: <!DOCTYPE HTML PUBLIC "-//IETF//DTD HTML 
> > > 2.0//EN"> <html><head> <title>404 Not Found</title> </head><body> 
> > > <h1>Not Found</h1> <p>The requested URL 
> /updates/MIRRORED.BY&quot; 
> > > was not found on this server.</p> <hr> 
> <address>Apache/2.4.4 (Unix) 
> > > OpenSSL/1.0.1e Server at spamassassin.apache.org Port 
> 80</address> 
> > > </body></html> (repeat 3X)
> > 
> > Note the trailing quote marks on those two URLs. I've no idea where 
> > they came from, but it could well be a simple config error...
> 
> Whoa, you're right. I'm so used to treating quotes as not 
> part of a URL, I didn't even see they're part of what's being 
> requested.
> 
> I note they appear in the bugzilla report I referred to, as well:
> https://issues.apache.org/SpamAssassin/show_bug.cgi?format=mul
tiple&id=6914
> 
> MIRRORED.BY isn't something I fetch myself; I just run 
> sa-update with various options set. So where are the quotes 
> coming from?
> 

Do you have a MIRRORED.BY file in you spamassassin update directory? It looks like it doesn't have the file with the mirrors in and instead is using the file name.

If so you could copy it over from your other box that's working.


Re: sa-update: MIRRORED.BY is 404 for any channel

Posted by Mike Brown <mi...@skew.org>.
John Wilcock wrote:
> > Jun 11 00:05:07.327 [43091] dbg: http: GET http://spamassassin.apache.org/updates/MIRRORED.BY" request failed, retrying: 404 Not Found: <!DOCTYPE HTML PUBLIC "-//IETF//DTD HTML 2.0//EN"> <html><head> <title>404 Not Found</title> </head><body> <h1>Not Found</h1> <p>The requested URL /updates/MIRRORED.BY&quot; was not found on this server.</p> <hr> <address>Apache/2.4.4 (Unix) OpenSSL/1.0.1e Server at spamassassin.apache.org Port 80</address> </body></html>
> > (repeat 3X)
> 
> Note the trailing quote marks on those two URLs. I've no idea where they 
> came from, but it could well be a simple config error...

Whoa, you're right. I'm so used to treating quotes as not part of a URL, I 
didn't even see they're part of what's being requested.

I note they appear in the bugzilla report I referred to, as well:
https://issues.apache.org/SpamAssassin/show_bug.cgi?format=multiple&id=6914

MIRRORED.BY isn't something I fetch myself; I just run sa-update with various 
options set. So where are the quotes coming from?

Re: sa-update: MIRRORED.BY is 404 for any channel

Posted by John Wilcock <jo...@tradoc.fr>.
Le 11/06/2013 10:28, Mike Brown a écrit :
> I'm running 3.3.2 on two FreeBSD 8.3 systems on different networks. Both
> systems are configured roughly identically with regard to SpamAssassin. One
> system runs Perl 5.16 (not sure if that matters) and can run sa-update without
> error, but the other runs Perl 5.12 and gets 404s when it tries to update
> MIRRORED.BY for any channel. Well, updates.spamassassin.org or
> sought.rules.yerp.org are the ones I tried, at least:
>
>
> Jun 11 00:35:14.769 [53689] dbg: http: GET http://yerp.org/rules/MIRRORED.BY" request failed, retrying: 404 Not Found: <?xml version="1.0" encoding="iso-8859-1"?> <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> <html xmlns="http://www.w3.org/1999/xhtml" xml:lang="en" lang="en"> <head> <title>404 - Not Found</title> </head> <body> <h1>404 - Not Found</h1> </body> </html>
> (repeat 3X)
>
> Jun 11 00:05:07.327 [43091] dbg: http: GET http://spamassassin.apache.org/updates/MIRRORED.BY" request failed, retrying: 404 Not Found: <!DOCTYPE HTML PUBLIC "-//IETF//DTD HTML 2.0//EN"> <html><head> <title>404 Not Found</title> </head><body> <h1>Not Found</h1> <p>The requested URL /updates/MIRRORED.BY&quot; was not found on this server.</p> <hr> <address>Apache/2.4.4 (Unix) OpenSSL/1.0.1e Server at spamassassin.apache.org Port 80</address> </body></html>
> (repeat 3X)

Note the trailing quote marks on those two URLs. I've no idea where they 
came from, but it could well be a simple config error...


John.

-- 
-- Over 5000 webcams from ski resorts around the world - www.snoweye.com
-- Translate your technical documents and web pages    - www.tradoc.fr