You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@nutch.apache.org by Andrzej Bialecki <ab...@getopt.org> on 2005/09/13 14:01:32 UTC

Re: [Nutch-cvs] svn commit: r280179 - in /lucene/nutch/trunk/src/plugin: clustering-carrot2/ creativecommons/ index-basic/ index-more/ languageidentifier/ ontology/ parse-ext/ parse-html/ parse-js/ parse-mp3/ parse-mspowerpoint/ parse-msword/ parse-pdf/ parse-rss/ par...

jerome@apache.org wrote:
> Author: jerome
> Date: Sun Sep 11 13:34:12 2005
> New Revision: 280179
> 
> URL: http://svn.apache.org/viewcvs?rev=280179&view=rev
> Log:
> Add a dependency to nutch-extensionpoints plugin

Looks like something broke after this commit. When I run a "nutch crawl" 
using the out-of-the-box configuration I get the following (with logging 
turned to ALL):

050913 125223 not including: creativecommons
050913 125223 not including: parse-pdf
050913 125223 not including: parse-ext
050913 125223 not including: ontology
050913 125223 not including: protocol-ftp
050913 125223 not including: protocol-http
050913 125223 not including: parse-zip
050913 125223 not including: nutch-extensionpoints
050913 125223 not including: index-more
050913 125223 not including: clustering-carrot2
050913 125223 not including: query-more
050913 125223 not including: language-identifier
050913 125223 not including: urlfilter-prefix
050913 125223 not including: parse-mspowerpoint
050913 125223 not including: parse-msword
050913 125223 not including: protocol-file
050913 125223 not including: lib-jakarta-poi
050913 125223 not including: parse-rss
050913 125223 Missing dependency nutch-extensionpoints for plugin query-url
050913 125223 Missing dependency nutch-extensionpoints for plugin query-site
050913 125223 Missing dependency nutch-extensionpoints for plugin 
protocol-httpc
lient
050913 125223 Missing dependency nutch-extensionpoints for plugin parse-html
050913 125223 Missing dependency nutch-extensionpoints for plugin 
index-basic
050913 125223 Missing dependency nutch-extensionpoints for plugin parse-text
050913 125223 Missing dependency nutch-extensionpoints for plugin parse-js
050913 125223 Missing dependency nutch-extensionpoints for plugin 
query-basic
050913 125223 Missing dependency nutch-extensionpoints for plugin 
urlfilter-rege
x
050913 125223 Plugin Auto-activation mode: [false]
050913 125223 Registered Plugins:
050913 125223   NONE
050913 125223 Registered Extension-Points:
050913 125223   NONE
Exception in thread "main" java.lang.ExceptionInInitializerError
         at 
org.apache.nutch.db.WebDBInjector.addPage(WebDBInjector.java:437)
         at 
org.apache.nutch.db.WebDBInjector.injectURLFile(WebDBInjector.java:37
8)
         at org.apache.nutch.db.WebDBInjector.main(WebDBInjector.java:535)
         at org.apache.nutch.tools.CrawlTool.main(CrawlTool.java:134)
Caused by: java.lang.RuntimeException: org.apache.nutch.net.URLFilter 
not found.

         at org.apache.nutch.net.URLFilters.<clinit>(URLFilters.java:44)
         ... 4 more


-- 
Best regards,
Andrzej Bialecki     <><
  ___. ___ ___ ___ _ _   __________________________________
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com


Re: [Nutch-cvs] svn commit: r280179 - in /lucene/nutch/trunk/src/plugin: clustering-carrot2/ creativecommons/ index-basic/ index-more/ languageidentifier/ ontology/ parse-ext/ parse-html/ parse-js/ parse-mp3/ parse-mspowerpoint/ parse-msword/ parse-p

Posted by Jérôme Charron <je...@gmail.com>.
> Looks like something broke after this commit. When I run a "nutch crawl"
> using the out-of-the-box configuration I get the following (with logging
> turned to ALL):

OK, I see the problem: I committed the nutch-site.xml file with the property 
plugin.autoactivation setted to false, whereas it must be setted to true by 
default.
You have the choice to manually activate needed plugins, or simply to set 
the plugin.auto-activation property to true.
I commit the correction right now (plugin.auto-activation = true).
(In my test env, the nutch-site override this property with a true value!)

Thanks Andrzej.

Jérôme

-- 
http://motrech.free.fr/
http://www.frutch.org/