You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-issues@hadoop.apache.org by "10000kang (Jira)" <ji...@apache.org> on 2022/12/21 04:34:00 UTC

[jira] [Created] (HADOOP-18582) No need to clean tmp files in ditcp direct mode

10000kang created HADOOP-18582:
----------------------------------

             Summary: No need to clean tmp files in ditcp direct mode
                 Key: HADOOP-18582
                 URL: https://issues.apache.org/jira/browse/HADOOP-18582
             Project: Hadoop Common
          Issue Type: Bug
          Components: tools/distcp
    Affects Versions: 3.3.4
            Reporter: 10000kang


it not necessary to do `cleanupTempFiles`  while ditcp commit job in direct , because it there is no temp files in direct mode.

This clean operation will increase the task execution time, because it goes back to get the list of files in the target path. When the number of files in the target path is very large, this operation will be very slow.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: common-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: common-issues-help@hadoop.apache.org