You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Ensheng Wang <nu...@yahoo.com.cn> on 2006/04/27 07:19:13 UTC

IOException when generate fetch

I got the blow error when I run 'nutch generate db segments -topN 50000'
   
  Could anybody help me? thx...
   
   
  Exception in thread "main" java.io.IOException: Version: 5
ID: 1d130841bb01680588a43f803794ad23
DomainID: -4932006994747272718
URL: http://forums.torrentportal.com/fpost16294.html
AnchorText:
targetHasOutlink: false
 read 77 bytes, should read 8269
        at org.apache.nutch.io.SequenceFile$Reader.next(SequenceFile.java:261)
        at org.apache.nutch.io.SequenceFile$Reader.next(SequenceFile.java:275)
        at org.apache.nutch.io.MapFile$Reader.next(MapFile.java:349)
        at org.apache.nutch.db.WebDBReader.getLinks(WebDBReader.java:326)
        at org.apache.nutch.tools.DistributedAnalysisTool.computeRound(DistributedAnalysisTool.java:384)
        at org.apache.nutch.tools.LinkAnalysisTool.iterate(LinkAnalysisTool.java:59)
        at org.apache.nutch.tools.LinkAnalysisTool.main(LinkAnalysisTool.java:81)

__________________________________________________
赶快注册雅虎超大容量免费邮箱?
http://cn.mail.yahoo.com