You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@impala.apache.org by "Joe McDonnell (JIRA)" <ji...@apache.org> on 2019/05/23 20:52:00 UTC

[jira] [Created] (IMPALA-8581) Switch hdfs commandline calls in dataload and tests to use "-d" on s3 where applicable

Joe McDonnell created IMPALA-8581:
-------------------------------------

             Summary: Switch hdfs commandline calls in dataload and tests to use "-d" on s3 where applicable
                 Key: IMPALA-8581
                 URL: https://issues.apache.org/jira/browse/IMPALA-8581
             Project: IMPALA
          Issue Type: Bug
          Components: Infrastructure
    Affects Versions: Impala 3.3.0
            Reporter: Joe McDonnell


When doing a put or copyFromLocal via the HDFS commandline, it puts the file into a temporary file and then moves it into place. The "-d" option skips the temporary file and copies the file directly into place. On S3, this avoids an extra copy and can also avoid consistency issues. We should look into using the "-d" option across our tests. Our usages of the HDFS commandline generally don't require atomicity of this file copy, so using "-d" is unlikely to impact HDFS tests.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)