You are viewing a plain text version of this content. The canonical link for it is here.
Posted to mapreduce-dev@hadoop.apache.org by "Tyler Hale (JIRA)" <ji...@apache.org> on 2015/06/23 22:29:45 UTC

[jira] [Created] (MAPREDUCE-6414) Distcp command very slow to enumerate files needing

Tyler Hale created MAPREDUCE-6414:
-------------------------------------

             Summary: Distcp command very slow to enumerate files needing
                 Key: MAPREDUCE-6414
                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-6414
             Project: Hadoop Map/Reduce
          Issue Type: Improvement
          Components: distcp
    Affects Versions: 2.5.0
         Environment: RHEL 6.5
            Reporter: Tyler Hale


When copying large amounts of data using distcp utility (100's of TBs), the distcp utility takes a large time to enumerate all of the files that have changed.  In my system, this corresponds to 14-16 hours before the actual copying of data begins. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)