You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Chong Li (JIRA)" <ji...@apache.org> on 2015/02/24 20:07:05 UTC
[jira] [Created] (NUTCH-1950) File name too long when bin/nutch
dump
Chong Li created NUTCH-1950:
-------------------------------
Summary: File name too long when bin/nutch dump
Key: NUTCH-1950
URL: https://issues.apache.org/jira/browse/NUTCH-1950
Project: Nutch
Issue Type: Improvement
Components: segment
Affects Versions: 1.10
Reporter: Chong Li
Priority: Minor
Fix For: 1.10
When bin/dump in version 1.10-trunk, there will be an exception saying "File name too long". When crawling, the length of the url may be longer than 255 bytes and nutch save the file using the url as file name. It can be saved in segments but when dumping the files to local file system, the length of the filename can not be longer than 255 bytes.
The FileDumper.java need to be changed to handle such exception.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)