You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Emmanuel JOKE <jo...@gmail.com> on 2007/06/24 12:10:12 UTC

Indexer NPE

Hi Guys,

Ive started a crawl and everything was working fine until it try to index
the data. I had a NPE as shown below:
2007-06-24 05:05:39,453 WARN  mapred.LocalJobRunner - job_agncso
java.lang.NullPointerException: value cannot be null
        at org.apache.lucene.document.Field.<init>(Field.java:195)
        at org.apache.lucene.document.Field.<init>(Field.java:171)
        at org.apache.nutch.indexer.Indexer.reduce(Indexer.java:198)
        at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:326)
        at org.apache.hadoop.mapred.LocalJobRunner$Job.run(
LocalJobRunner.java:155)
2007-06-24 05:05:39,592 FATAL indexer.Indexer - Indexer: java.io.IOException:
Job failed!
        at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:604)
        at org.apache.nutch.indexer.Indexer.index(Indexer.java:275)
        at org.apache.nutch.indexer.Indexer.run(Indexer.java:297)
        at org.apache.hadoop.util.ToolBase.doMain(ToolBase.java:189)
        at org.apache.nutch.indexer.Indexer.main(Indexer.java:280)

I don't understand what is the pb. Do you have any idea ?

Conf:
1 single machine
Linux 2.6, Java 1.6
nutch nigthly + hadoop 0.12.3

Thanks in advance for ur help

Re: Indexer NPE

Posted by Doğacan Güney <do...@gmail.com>.
On 6/24/07, Emmanuel JOKE <jo...@gmail.com> wrote:
> Hi Guys,
>
> Ive started a crawl and everything was working fine until it try to index
> the data. I had a NPE as shown below:
> 2007-06-24 05:05:39,453 WARN  mapred.LocalJobRunner - job_agncso
> java.lang.NullPointerException: value cannot be null
>         at org.apache.lucene.document.Field.<init>(Field.java:195)
>         at org.apache.lucene.document.Field.<init>(Field.java:171)
>         at org.apache.nutch.indexer.Indexer.reduce(Indexer.java:198)
>         at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:326)
>         at org.apache.hadoop.mapred.LocalJobRunner$Job.run(
> LocalJobRunner.java:155)
> 2007-06-24 05:05:39,592 FATAL indexer.Indexer - Indexer: java.io.IOException:
> Job failed!
>         at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:604)
>         at org.apache.nutch.indexer.Indexer.index(Indexer.java:275)
>         at org.apache.nutch.indexer.Indexer.run(Indexer.java:297)
>         at org.apache.hadoop.util.ToolBase.doMain(ToolBase.java:189)
>         at org.apache.nutch.indexer.Indexer.main(Indexer.java:280)
>
> I don't understand what is the pb. Do you have any idea ?

I just committed a fix for this one(NUTCH-504, rev 550196). See discussion here:

http://www.nabble.com/Indexing-problems-in-nutch-nightly-tf3923427.html

for why the problem occurs.

>
> Conf:
> 1 single machine
> Linux 2.6, Java 1.6
> nutch nigthly + hadoop 0.12.3
>
> Thanks in advance for ur help
>


-- 
Doğacan Güney