You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by brad <br...@bcs-mail.net> on 2010/08/05 18:17:24 UTC
opic.OPICScoringFilter - java.net.MalformedURLException: no protocol
I changed my plugin.includes from
urlfilter-regex
To
urlfilter-automaton
And left everything else the same.
Using the default delivered automaton-urlfilter.txt
with a few addition excludes like:
-http://(@\.)*twitter\.com(/@)*
Now the hadoop log has a larger number of
WARN opic.OPICScoringFilter - java.net.MalformedURLException: no protocol:
Does urlfilter-automaton allow in something the urlfilter-regex does not?
How can I filter out the MalformedURLException
Thanks
Brad