You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by qi wu <ch...@gmail.com> on 2007/10/12 06:38:06 UTC

Possible for recovering the corrupted sequence file?

Hi,
I am using Nutch/Hadoop with single node mode.Nutch failed to generate a new segement and in the hadoop log I find 
the error message below:

007-10-12 11:09:53,961 INFO  crawl.Generator - Generator: jobtracker is 'local', generating exactly one partition.
2007-10-12 11:09:58,602 WARN  fs.FileSystem - Moving bad file /nutch/youjiDB/crawldb/current/part-00000/data to /nutch/bad_files/data.-934992143
2007-10-12 11:09:58,607 WARN  mapred.LocalJobRunner - job_2daorz
java.lang.NullPointerException
        at org.apache.hadoop.fs.FSDataInputStream$Buffer.seek(FSDataInputStream.java:74)
        at org.apache.hadoop.fs.FSDataInputStream.seek(FSDataInputStream.java:121)
        at org.apache.hadoop.fs.ChecksumFileSystem$FSInputChecker.readBuffer(ChecksumFileSystem.java:221)        at org.apache.hadoop.fs.ChecksumFileSystem$FSInputChecker.read(ChecksumFileSystem.java:167)        at org.apache.hadoop.fs.FSDataInputStream$PositionCache.read(FSDataInputStream.java:41)        at java.io.BufferedInputStream.read1(BufferedInputStream.java:256)
        at java.io.BufferedInputStream.read(BufferedInputStream.java:317)        at java.io.DataInputStream.readFully(DataInputStream.java:178)        at org.apache.hadoop.io.DataOutputBuffer$Buffer.write(DataOutputBuffer.java:57)        at org.apache.hadoop.io.DataOutputBuffer.write(DataOutputBuffer.java:91)
@       

Any one can tell me how to recover the corrupted file ?

Thanks
-Qi