You are viewing a plain text version of this content. The canonical link for it is here.
Posted to hdfs-dev@hadoop.apache.org by "Gautam Gopalakrishnan (JIRA)" <ji...@apache.org> on 2015/08/28 05:42:45 UTC

[jira] [Created] (HDFS-8986) Add option to -du to calculate directory space usage excluding snapshots

Gautam Gopalakrishnan created HDFS-8986:
-------------------------------------------

             Summary: Add option to -du to calculate directory space usage excluding snapshots
                 Key: HDFS-8986
                 URL: https://issues.apache.org/jira/browse/HDFS-8986
             Project: Hadoop HDFS
          Issue Type: Improvement
          Components: snapshots
            Reporter: Gautam Gopalakrishnan


When running {{hadoop fs -du}} on a snapshotted directory (or one of its children), the report includes space consumed by blocks that are only present in the snapshots. This is confusing for end users.

{noformat}
$  hadoop fs -du -h -s /tmp/parent /tmp/parent/*
799.7 M  2.3 G  /tmp/parent
799.7 M  2.3 G  /tmp/parent/sub1

$ hdfs dfs -createSnapshot /tmp/parent snap1
Created snapshot /tmp/parent/.snapshot/snap1

$ hadoop fs -rm -skipTrash /tmp/parent/sub1/*
...

$ hadoop fs -du -h -s /tmp/parent /tmp/parent/*
799.7 M  2.3 G  /tmp/parent
799.7 M  2.3 G  /tmp/parent/sub1

$ hdfs dfs -deleteSnapshot /tmp/parent snap1
$ hadoop fs -du -h -s /tmp/parent /tmp/parent/*
0  0  /tmp/parent
0  0  /tmp/parent/sub1
{noformat}

It would be helpful if we had a flag, say -X, to exclude any snapshot related disk usage in the output




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)