You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Edoardo Causarano <ed...@gmail.com> on 2014/09/17 10:47:32 UTC

index command failing, no plugins found

Hi all,

I'm having difficulties running the index phase in standalone mode. Other commands such as "fetch" work fine but in the case of "index" plugins are not found. (using stock 1.9 built from source)

➜  deploy  bin/nutch index crawl/crawldb -linkdb crawl/linkdb -dir crawl/segments -filter -normalize
14/09/17 10:36:56 INFO indexer.IndexingJob: Indexer: starting at 2014-09-17 10:36:56
14/09/17 10:36:56 INFO indexer.IndexingJob: Indexer: deleting gone documents: false
14/09/17 10:36:56 INFO indexer.IndexingJob: Indexer: URL filtering: true
14/09/17 10:36:56 INFO indexer.IndexingJob: Indexer: URL normalizing: true
14/09/17 10:36:56 WARN plugin.PluginRepository: Plugins: directory not found: plugins
14/09/17 10:36:56 WARN plugin.PluginRepository: No plugins found on paths of property plugin.folders="plugins"
14/09/17 10:36:56 INFO plugin.PluginRepository: Plugin Auto-activation mode: [true]
14/09/17 10:36:56 INFO plugin.PluginRepository: Registered Plugins:
14/09/17 10:36:56 INFO plugin.PluginRepository: 	NONE
14/09/17 10:36:56 INFO plugin.PluginRepository: Registered Extension-Points:
14/09/17 10:36:56 INFO plugin.PluginRepository: 	NONE
14/09/17 10:36:56 ERROR indexer.IndexingJob: Indexer: java.lang.RuntimeException: org.apache.nutch.indexer.IndexWriter not found.
	at org.apache.nutch.indexer.IndexWriters.<init>(IndexWriters.java:51)
	at org.apache.nutch.indexer.IndexingJob.index(IndexingJob.java:91)
	at org.apache.nutch.indexer.IndexingJob.run(IndexingJob.java:176)
	at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
	at org.apache.nutch.indexer.IndexingJob.main(IndexingJob.java:186)
	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
	at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
	at java.lang.reflect.Method.invoke(Method.java:606)
	at org.apache.hadoop.util.RunJar.main(RunJar.java:212)


Best,
Edoardo

RE: index command failing, no plugins found

Posted by Markus Jelsma <ma...@openindex.io>.
Hi - you must add either the Solr or Elasticsearch indexing plugin to your nutch-site.xml configuration file. You can see conf/nutch-default.xml for examples. User defined config should be in nutch-site.xml.
Markus

 
 
-----Original message-----
> From:Edoardo Causarano <ed...@gmail.com>
> Sent: Wednesday 17th September 2014 10:48
> To: user@nutch.apache.org
> Subject: index command failing, no plugins found
> 
> Hi all,
> 
> I'm having difficulties running the index phase in standalone mode. Other commands such as "fetch" work fine but in the case of "index" plugins are not found. (using stock 1.9 built from source)
> 
> ➜  deploy  bin/nutch index crawl/crawldb -linkdb crawl/linkdb -dir crawl/segments -filter -normalize
> 14/09/17 10:36:56 INFO indexer.IndexingJob: Indexer: starting at 2014-09-17 10:36:56
> 14/09/17 10:36:56 INFO indexer.IndexingJob: Indexer: deleting gone documents: false
> 14/09/17 10:36:56 INFO indexer.IndexingJob: Indexer: URL filtering: true
> 14/09/17 10:36:56 INFO indexer.IndexingJob: Indexer: URL normalizing: true
> 14/09/17 10:36:56 WARN plugin.PluginRepository: Plugins: directory not found: plugins
> 14/09/17 10:36:56 WARN plugin.PluginRepository: No plugins found on paths of property plugin.folders="plugins"
> 14/09/17 10:36:56 INFO plugin.PluginRepository: Plugin Auto-activation mode: [true]
> 14/09/17 10:36:56 INFO plugin.PluginRepository: Registered Plugins:
> 14/09/17 10:36:56 INFO plugin.PluginRepository: 	NONE
> 14/09/17 10:36:56 INFO plugin.PluginRepository: Registered Extension-Points:
> 14/09/17 10:36:56 INFO plugin.PluginRepository: 	NONE
> 14/09/17 10:36:56 ERROR indexer.IndexingJob: Indexer: java.lang.RuntimeException: org.apache.nutch.indexer.IndexWriter not found.
> 	at org.apache.nutch.indexer.IndexWriters.<init>(IndexWriters.java:51)
> 	at org.apache.nutch.indexer.IndexingJob.index(IndexingJob.java:91)
> 	at org.apache.nutch.indexer.IndexingJob.run(IndexingJob.java:176)
> 	at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
> 	at org.apache.nutch.indexer.IndexingJob.main(IndexingJob.java:186)
> 	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> 	at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
> 	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> 	at java.lang.reflect.Method.invoke(Method.java:606)
> 	at org.apache.hadoop.util.RunJar.main(RunJar.java:212)
> 
> 
> Best,
> Edoardo