You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-issues@hadoop.apache.org by "Dave Thompson (Updated) (JIRA)" <ji...@apache.org> on 2012/03/31 00:39:27 UTC
[jira] [Updated] (HADOOP-8233) Turn CRC checking off for 0 byte
size and differing blocksizes
[ https://issues.apache.org/jira/browse/HADOOP-8233?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Dave Thompson updated HADOOP-8233:
----------------------------------
Attachment: HADOOP-8233-branch-0.23.2.patch
Patch skips CRC on 0 byte size files and when blocksize between source and target do not match.
> Turn CRC checking off for 0 byte size and differing blocksizes
> --------------------------------------------------------------
>
> Key: HADOOP-8233
> URL: https://issues.apache.org/jira/browse/HADOOP-8233
> Project: Hadoop Common
> Issue Type: Bug
> Affects Versions: 0.23.3
> Reporter: Dave Thompson
> Assignee: Dave Thompson
> Attachments: HADOOP-8233-branch-0.23.2.patch
>
>
> DistcpV2 (hadoop-tools/hadoop-distcp/..) can fail from checksum failure, sometimes when copying a 0 byte file. Root cause of this may have to do with an inconsistent nature of HDFS when creating 0 byte files, however distcp can avoid this issue by not checking CRC when size is zero.
> Further, distcp fails checksum when copying from two clusters that use different blocksizes. In this case it does not make sense to check CRC, as it is a guaranteed failure.
> We need to turn CRC checking off for the above two cases.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira