You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by spamsucks <sp...@rhoderunner.com> on 2006/12/22 17:32:21 UTC

PhasedFileSystem Exception in trunk build

I pulled from trunk yesterday, and doing a Intranet crawl, it fails.

Running on cygwin/XP

Any guidance?  Thanks.

2006-12-22 11:10:00,613 INFO  indexer.IndexingFilters - Adding 
org.apache.nutch.indexer.basic.BasicIndexingFilter
2006-12-22 11:10:01,363 WARN  mapred.LocalJobRunner - job_g7zf6p
java.lang.UnsupportedOperationException: Operation not supported
 at 
org.apache.hadoop.mapred.PhasedFileSystem.deleteRaw(PhasedFileSystem.java:291)
 at org.apache.hadoop.fs.FileSystem.delete(FileSystem.java:402)
 at 
org.apache.nutch.indexer.Indexer$OutputFormat.getRecordWriter(Indexer.java:97)
 at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:287)
 at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:137)




Re: PhasedFileSystem Exception in trunk build

Posted by Andrzej Bialecki <ab...@getopt.org>.
spamsucks wrote:
> I added the property to hadoop-site.xml, and it now fails with a 
> different error.  How about some more magical properties?

Nope ;) I think this indicates that one of your indexes is corrupt or 
absent.

-- 
Best regards,
Andrzej Bialecki     <><
 ___. ___ ___ ___ _ _   __________________________________
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com



Re: PhasedFileSystem Exception in trunk build

Posted by spamsucks <sp...@rhoderunner.com>.
I added the property to hadoop-site.xml, and it now fails with a different 
error.  How about some more magical properties?
Thanks.
Phillip

<property>
  <name>mapred.speculative.execution</name>
  <value>false</value>
  <description>from the mailing list
  </description>
</property>


Stack trace:

2006-12-22 13:40:46,040 WARN  mapred.LocalJobRunner 
(LocalJobRunner.java:run(147)) - job_ambth9
java.lang.ArrayIndexOutOfBoundsException: -1
        at 
org.apache.lucene.index.MultiReader.isDeleted(MultiReader.java:109)
        at 
org.apache.nutch.indexer.DeleteDuplicates$InputFormat$DDRecordReader.next(DeleteDuplicates.java:177)
        at org.apache.hadoop.mapred.MapTask$3.next(MapTask.java:203)
        at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:46)
        at org.apache.hadoop.mapred.MapTask.run(MapTask.java:215)
        at 
org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:109)
Exception in thread "main" java.io.IOException: Job failed!
        at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:399)
        at 
org.apache.nutch.indexer.DeleteDuplicates.dedup(DeleteDuplicates.java:433)
        at org.apache.nutch.crawl.Crawl.main(Crawl.java:131)



----- Original Message ----- 
From: "Andrzej Bialecki" <ab...@getopt.org>
To: <nu...@lucene.apache.org>
Sent: Friday, December 22, 2006 12:50 PM
Subject: Re: PhasedFileSystem Exception in trunk build


> spamsucks wrote:
>> I pulled from trunk yesterday, and doing a Intranet crawl, it fails.
>>
>> Running on cygwin/XP
>>
>> Any guidance?  Thanks.
>>
>> 2006-12-22 11:10:00,613 INFO  indexer.IndexingFilters - Adding 
>> org.apache.nutch.indexer.basic.BasicIndexingFilter
>> 2006-12-22 11:10:01,363 WARN  mapred.LocalJobRunner - job_g7zf6p
>> java.lang.UnsupportedOperationException: Operation not supported
>> at 
>> org.apache.hadoop.mapred.PhasedFileSystem.deleteRaw(PhasedFileSystem.java:291)
>> at org.apache.hadoop.fs.FileSystem.delete(FileSystem.java:402)
>> at 
>> org.apache.nutch.indexer.Indexer$OutputFormat.getRecordWriter(Indexer.java:97)
>> at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:287)
>> at 
>> org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:137)
>
> Please set "mapred.speculative.execution" to false, in hadoop-site.xml. 
> This is addressed in the upcoming update of Hadoop v. 0.9.2.
>
> -- 
> Best regards,
> Andrzej Bialecki     <><
> ___. ___ ___ ___ _ _   __________________________________
> [__ || __|__/|__||\/|  Information Retrieval, Semantic Web
> ___|||__||  \|  ||  |  Embedded Unix, System Integration
> http://www.sigram.com  Contact: info at sigram dot com
>
> 



Re: PhasedFileSystem Exception in trunk build

Posted by Andrzej Bialecki <ab...@getopt.org>.
spamsucks wrote:
> I pulled from trunk yesterday, and doing a Intranet crawl, it fails.
>
> Running on cygwin/XP
>
> Any guidance?  Thanks.
>
> 2006-12-22 11:10:00,613 INFO  indexer.IndexingFilters - Adding 
> org.apache.nutch.indexer.basic.BasicIndexingFilter
> 2006-12-22 11:10:01,363 WARN  mapred.LocalJobRunner - job_g7zf6p
> java.lang.UnsupportedOperationException: Operation not supported
> at 
> org.apache.hadoop.mapred.PhasedFileSystem.deleteRaw(PhasedFileSystem.java:291) 
>
> at org.apache.hadoop.fs.FileSystem.delete(FileSystem.java:402)
> at 
> org.apache.nutch.indexer.Indexer$OutputFormat.getRecordWriter(Indexer.java:97) 
>
> at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:287)
> at 
> org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:137)

Please set "mapred.speculative.execution" to false, in hadoop-site.xml. 
This is addressed in the upcoming update of Hadoop v. 0.9.2.

-- 
Best regards,
Andrzej Bialecki     <><
 ___. ___ ___ ___ _ _   __________________________________
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com