You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Marko Bauhardt (JIRA)" <ji...@apache.org> on 2006/05/25 11:07:30 UTC

[jira] Commented: (NUTCH-284) NullPointerException during index

    [ http://issues.apache.org/jira/browse/NUTCH-284?page=comments#action_12413227 ] 

Marko Bauhardt commented on NUTCH-284:
--------------------------------------

I think the index-basic plugin is not included? Because
Line 111: .... doc.getField("url").stringValue() ....

The BasicIndexingFilter index the field "url".

 Verify your Logfile or the nutch-default.xml (or nutch-site.xml).

Marko



> NullPointerException during index
> ---------------------------------
>
>          Key: NUTCH-284
>          URL: http://issues.apache.org/jira/browse/NUTCH-284
>      Project: Nutch
>         Type: Bug

>   Components: indexer
>     Versions: 0.8-dev
>     Reporter: Stefan Neufeind

>
> For  quite a few this "reduce > sort" has been going on. Then it fails. What could be wrong with this?
> 060524 212613 reduce > sort
> 060524 212614 reduce > sort
> 060524 212615 reduce > sort
> 060524 212615 found resource common-terms.utf8 at file:/home/mm/nutch-nightly-prod/conf/common-terms.utf8
> 060524 212615 found resource common-terms.utf8 at file:/home/mm/nutch-nightly-prod/conf/common-terms.utf8
> 060524 212619 Optimizing index.
> 060524 212619 job_jlbhhm
> java.lang.NullPointerException
>         at org.apache.nutch.indexer.Indexer$OutputFormat$1.write(Indexer.java:111)
>         at org.apache.hadoop.mapred.ReduceTask$3.collect(ReduceTask.java:269)
>         at org.apache.nutch.indexer.Indexer.reduce(Indexer.java:253)
>         at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:282)
>         at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:114)
> Exception in thread "main" java.io.IOException: Job failed!
>         at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:341)
>         at org.apache.nutch.indexer.Indexer.index(Indexer.java:287)
>         at org.apache.nutch.indexer.Indexer.main(Indexer.java:304)

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira