You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by kaveh minooie <ka...@plutoz.com> on 2012/01/28 02:30:49 UTC

solrdedup error

we were having this discussion afew days ago thou I don't think we came 
up with any solution, but dedup seems to be having problems:

bin/nutch solrdedup  http://solr3:8983/solr/core8

results in this:

2012-01-27 17:19:13,667 INFO  solr.SolrDeleteDuplicates - 
SolrDeleteDuplicates: starting at 2012-01-27 17:19:13
2012-01-27 17:19:13,667 INFO  solr.SolrDeleteDuplicates - 
SolrDeleteDuplicates: Solr url: http://solr3:8983/solr/core8
2012-01-27 17:19:14,397 WARN  util.NativeCodeLoader - Unable to load 
native-hadoop library for your platform... using builtin-java classes 
where applicable
2012-01-27 17:19:15,402 WARN  mapred.FileOutputCommitter - Output path 
is null in cleanup
2012-01-27 17:19:15,403 WARN  mapred.LocalJobRunner - job_local_0001
java.lang.NullPointerException
	at 
org.apache.nutch.indexer.solr.SolrDeleteDuplicates$SolrRecord.readSolrDocument(SolrDeleteDuplicates.java:131)
	at 
org.apache.nutch.indexer.solr.SolrDeleteDuplicates$SolrInputFormat$1.next(SolrDeleteDuplicates.java:271)
	at 
org.apache.nutch.indexer.solr.SolrDeleteDuplicates$SolrInputFormat$1.next(SolrDeleteDuplicates.java:241)
	at 
org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:236)
	at 
org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:216)
	at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
	at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:436)
	at org.apache.hadoop.mapred.MapTask.run(MapTask.java:372)
	at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:212)


and I don't know what this output  path is that it is complaining about. 
any body?

-- 
Kaveh Minooie

www.plutoz.com