You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Qiang.Kang (Jira)" <ji...@apache.org> on 2020/06/13 04:48:00 UTC

[jira] [Created] (HIVE-23685) Removing user's extra resources when executing File Merge Task

Qiang.Kang created HIVE-23685:
---------------------------------

             Summary: Removing user's extra resources when executing File Merge Task
                 Key: HIVE-23685
                 URL: https://issues.apache.org/jira/browse/HIVE-23685
             Project: Hive
          Issue Type: Bug
          Components: Physical Optimizer, Query Planning
            Reporter: Qiang.Kang
            Assignee: Qiang.Kang


Hi, we find that MapReduce's file merge map containers will download user's extra resources(such as: added jars, files, archives) before launching task. When these resources are large or the network is busy, file merge jobs will be timeout, causing the query be failed. As we all know, file merge task will run correctly just with hive-exec.jar and MapReduce framework. Therefore, there is no need to download user's resources. The patch below prevents setting `tmpjars` for FileMerge Task.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)