You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/04 13:19:21 UTC
[jira] [Updated] (NUTCH-993) NullPointerException at
FetcherOutputFormat.checkOutputSpecs
[ https://issues.apache.org/jira/browse/NUTCH-993?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Markus Jelsma updated NUTCH-993:
--------------------------------
Comment: was deleted
(was: There's an issue with ParseOutputformat. It fails when running Nutch locally:
{code}
ParseSegment: segment: crawl/segments/20110704125233
Exception in thread "main" java.io.IOException: Segment already fetched!
at org.apache.nutch.parse.ParseOutputFormat.checkOutputSpecs(ParseOutputFormat.java:86)
at org.apache.hadoop.mapred.JobClient.submitJobInternal(JobClient.java:772)
at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:730)
at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1249)
at org.apache.nutch.parse.ParseSegment.parse(ParseSegment.java:157)
at org.apache.nutch.parse.ParseSegment.run(ParseSegment.java:178)
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
at org.apache.nutch.parse.ParseSegment.main(ParseSegment.java:164)
{code})
> NullPointerException at FetcherOutputFormat.checkOutputSpecs
> ------------------------------------------------------------
>
> Key: NUTCH-993
> URL: https://issues.apache.org/jira/browse/NUTCH-993
> Project: Nutch
> Issue Type: Bug
> Components: fetcher
> Affects Versions: 1.3
> Environment: Cloudera CDH3 Cluster (hadoop 0.20.2-cdh3u0)
> Reporter: Christian Guegi
> Assignee: Markus Jelsma
> Priority: Minor
> Fix For: 1.4, 2.0
>
> Attachments: FetcherOutputFormat.patch, ParseOutputFormat.patch
>
>
> When running Nutch as a mapreduce job on an existing cluster I get an NullPointerException at org.apache.nutch.fetcher.FetcherOutputFormat.checkOutputSpecs.
> The reason is that the passed in reference to the file system is null.
> The attached patch ignores the parameter 'fs' and creates a new reference to the file system.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira