You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-issues@hadoop.apache.org by "Jing Zhao (JIRA)" <ji...@apache.org> on 2014/01/27 23:45:39 UTC

[jira] [Created] (HADOOP-10295) Allow distcp to automatically identify the checksum type of source files and use it for the target

Jing Zhao created HADOOP-10295:
----------------------------------

             Summary: Allow distcp to automatically identify the checksum type of source files and use it for the target
                 Key: HADOOP-10295
                 URL: https://issues.apache.org/jira/browse/HADOOP-10295
             Project: Hadoop Common
          Issue Type: Improvement
    Affects Versions: 2.2.0
            Reporter: Jing Zhao
            Assignee: Jing Zhao


Currently while doing distcp, users can use "-Ddfs.checksum.type" to specify the checksum type in the target FS. This works fine if all the source files are using the same checksum type. If files in the source cluster have mixed types of checksum, users have to either use "-skipcrccheck" or have checksum mismatching exception. Thus we may need to consider adding a new option to distcp so that it can automatically identify the original checksum type of each source file and use the same checksum type in the target FS. 



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)