You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Chong Li (JIRA)" <ji...@apache.org> on 2015/02/26 07:33:06 UTC

[jira] [Updated] (NUTCH-1950) File name too long when bin/nutch dump

     [ https://issues.apache.org/jira/browse/NUTCH-1950?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Chong Li updated NUTCH-1950:
----------------------------
    Issue Type: Bug  (was: Improvement)

> File name too long when bin/nutch dump
> --------------------------------------
>
>                 Key: NUTCH-1950
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1950
>             Project: Nutch
>          Issue Type: Bug
>          Components: segment
>    Affects Versions: 1.10
>            Reporter: Chong Li
>            Priority: Minor
>             Fix For: 1.10
>
>   Original Estimate: 48h
>  Remaining Estimate: 48h
>
> When bin/dump in version 1.10-trunk, there will be an exception saying "File name too long". When crawling, the length of the url may be longer than 255 bytes and nutch save the file using the url as file name. It can be saved in segments but when dumping the files to local file system, the length of the filename can not be longer than 255 bytes. 
> The FileDumper.java need to be changed to handle such exception.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)