You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by ArentJan Banck <aj...@planet.nl> on 2006/03/14 23:20:24 UTC

0.8: NullPointerException Optimizing index when crawling

I have been running 0.7.1  without problems.
Today i build a 0.8-dev version from trunk. and ported my settings from 
0.7.1

When doing a crawl crawling starts, but it ends with a nullpointer 
exception.
Any hints what is going wrong /  how I should resolve this?

commandline used:
sh bin/nutch crawl urls -dir /nutch-0.8-dev/crawl -depth 1



Tail of the stacktrace:
060314 231802   Nutch Query Filter (org.apache.nutch.searcher.QueryFilter)
060314 231802 parsing 
jar:file:/C:/nutch-0.8-dev/lib/hadoop-0.1-dev.jar!/hadoop-default.xml
060314 231803 parsing 
jar:file:/C:/nutch-0.8-dev/lib/hadoop-0.1-dev.jar!/mapred-default.xml
060314 231803 parsing \tmp\hadoop\mapred\local\job_u1rb8.xml\localRunner
060314 231803 parsing 
jar:file:/C:/nutch-0.8-dev/lib/hadoop-0.1-dev.jar!/mapred-default.xml
060314 231803 parsing file:/C:/nutch-0.8-dev/conf/hadoop-site.xml
060314 231803 found resource common-terms.utf8 at 
file:/C:/nutch-0.8-dev/conf/common-terms.utf8
060314 231803 found resource common-terms.utf8 at 
file:/C:/nutch-0.8-dev/conf/common-terms.utf8
060314 231803 Optimizing index.
java.lang.NullPointerException
        at 
org.apache.nutch.indexer.Indexer$OutputFormat$1.write(Indexer.java:109)
        at 
org.apache.hadoop.mapred.ReduceTask$3.collect(ReduceTask.java:270)
        at org.apache.nutch.indexer.Indexer.reduce(Indexer.java:242)
        at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:283)
        at 
org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:106)
060314 231803  map 100%  reduce 0%
Exception in thread "main" java.io.IOException: Job failed!
        at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:310)
        at org.apache.nutch.indexer.Indexer.index(Indexer.java:275)
        at org.apache.nutch.crawl.Crawl.main(Crawl.java:120)


Thanks,
Arent-Jan 



Re: 0.8: NullPointerException Optimizing index when crawling

Posted by Marko Bauhardt <mb...@media-style.com>.
Am 14.03.2006 um 23:20 schrieb ArentJan Banck:

> java.lang.NullPointerException
>        at org.apache.nutch.indexer.Indexer$OutputFormat$1.write 
> (Indexer.java:109)

What for index plugins do you have configured in your nutch- 
default.xml or nutch-site.xml? Be sure that the index-basic plugin is  
included.

Marko