You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by Earl Cahill <ca...@yahoo.com> on 2005/10/03 07:04:54 UTC

java.net.MalformedURLException: no protocol for parse-plugins.xml

I did a clean, full svn update, and ant on trunk, then
tried

bin/nutch crawl urls -dir crawl.test

and got

051002 224950 SEVERE Unable to load parse plugins file
from URL [parse-plugins.xml]
java.net.MalformedURLException: no protocol: ...

Likely missing file:/.  If I get rid of lines 617-622
of conf/nutch-default.xml

<property>
  <name>parse.plugin.file</name>
  <value>parse-plugins.xml</value>
  <description>The name of the file that defines the
associations between
  content-types and parsers.</description>
</property>

it at least lets me run my crawl.  Looks like that was
added in revision 292865 on friday by jerome.  Putting
in the full path works, as per this patch

Index: conf/nutch-default.xml
===================================================================
--- conf/nutch-default.xml      (revision 293226)
+++ conf/nutch-default.xml      (working copy)
@@ -616,7 +616,7 @@

 <property>
   <name>parse.plugin.file</name>
-  <value>parse-plugins.xml</value>
+ 
<value>file:/home/nutch/nutch/trunk/conf/parse-plugins.xml</value>
   <description>The name of the file that defines the
associations between
   content-types and parsers.</description>
 </property>

But yeah, that's not a good option.  I tried each
directory and none of them worked.

Hope this gets you at least close.

Earl


		
__________________________________ 
Yahoo! Mail - PC Magazine Editors' Choice 2005 
http://mail.yahoo.com

Re: java.net.MalformedURLException: no protocol for parse-plugins.xml

Posted by Jérôme Charron <je...@gmail.com>.
> Likely missing file:/. If I get rid of lines 617-622
> > of conf/nutch-default.xml
>
> Resolved and committed:
http://svn.apache.org/viewcvs.cgi?rev=293370&view=rev

Thanks Earl.

Jérôme

--
http://motrech.free.fr/
http://www.frutch.org/

Re: java.net.MalformedURLException: no protocol for parse-plugins.xml

Posted by Jérôme Charron <je...@gmail.com>.
> Likely missing file:/. If I get rid of lines 617-622
> of conf/nutch-default.xml

Oups, sorry.
I made this last change just after testing the whole patch.
And I doesn't test it once again since I was sure it was a minor change.
I correct this right now. Sorry.

Regards

Jérôme


--
http://motrech.free.fr/
http://www.frutch.org/