You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Chong Li (JIRA)" <ji...@apache.org> on 2015/02/24 20:07:05 UTC

[jira] [Created] (NUTCH-1950) File name too long when bin/nutch dump

Chong Li created NUTCH-1950:
-------------------------------

             Summary: File name too long when bin/nutch dump
                 Key: NUTCH-1950
                 URL: https://issues.apache.org/jira/browse/NUTCH-1950
             Project: Nutch
          Issue Type: Improvement
          Components: segment
    Affects Versions: 1.10
            Reporter: Chong Li
            Priority: Minor
             Fix For: 1.10


When bin/dump in version 1.10-trunk, there will be an exception saying "File name too long". When crawling, the length of the url may be longer than 255 bytes and nutch save the file using the url as file name. It can be saved in segments but when dumping the files to local file system, the length of the filename can not be longer than 255 bytes. 
The FileDumper.java need to be changed to handle such exception.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)