You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by brad <br...@bcs-mail.net> on 2010/08/05 18:17:24 UTC

opic.OPICScoringFilter - java.net.MalformedURLException: no protocol

I changed my plugin.includes from
urlfilter-regex
To
urlfilter-automaton

And left everything else the same.
Using the default delivered automaton-urlfilter.txt 
with a few addition excludes like: 
-http://(@\.)*twitter\.com(/@)*

Now the hadoop log has a larger number of 
WARN  opic.OPICScoringFilter - java.net.MalformedURLException: no protocol:

Does urlfilter-automaton allow in something the urlfilter-regex does not?
How can I filter out the MalformedURLException


Thanks
Brad