You are viewing a plain text version of this content. The canonical link for it is here.
Posted to hdfs-dev@hadoop.apache.org by "Adam Antal (JIRA)" <ji...@apache.org> on 2018/10/04 15:46:00 UTC
[jira] [Created] (HDFS-13960) hdfs dfs -checksum command should
optionally show block size in output
Adam Antal created HDFS-13960:
---------------------------------
Summary: hdfs dfs -checksum command should optionally show block size in output
Key: HDFS-13960
URL: https://issues.apache.org/jira/browse/HDFS-13960
Project: Hadoop HDFS
Issue Type: Improvement
Components: hdfs
Reporter: Adam Antal
The hdfs checksum command computes the checksum in a distributed manner, which would take into account the block size. In other words, the block size determines how the file will be broken up.
Therefore itĀ can happen that the checksum command produces different outputs for the exact same file only differing in the block size: checksum(fileABlock1) + checksum(fileABlock2) != checksum(fileABlock1 + fileABlock2)
I suggest to add an option to the hdfs dfs -checksum command which would displays the block sizeĀ along with the output, and that could also be helpful in some other cases where this piece of information is needed.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-dev-unsubscribe@hadoop.apache.org
For additional commands, e-mail: hdfs-dev-help@hadoop.apache.org