You are viewing a plain text version of this content. The canonical link for it is here.
Posted to users@spamassassin.apache.org by yaps <ya...@mynet.com> on 2005/12/29 14:55:57 UTC

uri parsing problem

Hi,

 we have problems with  spamassassin uri parser. It doesn't parse the
correct domain form the given url below:

 <http://SKBYYYld>
%2Ei1ozPfpedzc1dicevdqp7i8spku77p%2EASAMPLEDOMAIN.COM?:3wPr4KKYR.net"> =20
=09

the spamassassin uri parse result is as follows:

spamd[14123]: uri: html uri found, http://SKBYYYld>
%2Ei1ozPfpedzc1dicevdqp7i8spku77p%2EASAMPLEDOMAIN.COM?:3wPr4KKYR.net
spamd[14123]: uri: cleaned html uri, http://SKBYYYld>
%2Ei1ozPfpedzc1dicevdqp7i8spku77p%2EASAMPLEDOMAIN.COM?:3wPr4KKYR.net
spamd[14123]: uri: cleaned html uri,
http://SKBYYYld%3e%20.i1ozPfpedzc1dicevdqp7i8spku77p.ASAMPLEDOMAIN.COM/?:3wPr4KKYR.net
spamd[14123]: uri: parsed uri found, 2EASAMPLEDOMAIN.COM?:3wPr4KKYR.net
spamd[14123]: uri: cleaned parsed uri,
http://2EASAMPLEDOMAIN.COM/?:3wPr4KKYR.net
spamd[14123]: uri: cleaned parsed uri, 2EASAMPLEDOMAIN.COM?:3wPr4KKYR.net
spamd[14123]: uri: parsed domain, 2eASAMPLEDOMAIN.COM
spamd[14123]: uri: parsed uri found,
http://2EASAMPLEDOMAIN.COM?:3wPr4KKYR.net
spamd[14123]: uri: cleaned parsed uri,
http://2EASAMPLEDOMAIN.COM?:3wPr4KKYR.net
spamd[14123]: uri: cleaned parsed uri,
http://2EASAMPLEDOMAIN.COM/?:3wPr4KKYR.net
spamd[14123]: uri: parsed domain, 2eASAMPLEDOMAIN.COM
spamd[14123]: uridnsbl: domains to query: buonanotte.com 2eASAMPLEDOMAIN.COM
spamd[14123]: rules: ran uri rule URI_NO_WWW_ANY_CGI ======> got hit:
"http://SKBYYYld%3e%20.i1ozPfpedzc1dicevdqp7i8spku77p.ASAMPLEDOMAIN.COM/?:3wPr4KKYR.net"
spamd[14123]: uridnsbl: query for 2eASAMPLEDOMAIN.COM took 0 seconds to
look up (multi.surbl.org.:2eASAMPLEDOMAIN.COM)



the expected result was ASAMPLEDOMAIN.COM. But as you can see above the
parsed domain is wrong.
This is tested with spamassassin 3.0.2 and 3.1.0.