You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Chao Sun (JIRA)" <ji...@apache.org> on 2017/03/30 17:24:41 UTC

[jira] [Created] (HIVE-16337) HoS: use separate config for mapjoin hash table size limit rather than hive.auto.convert.join.noconditionaltask.size

Chao Sun created HIVE-16337:
-------------------------------

             Summary: HoS: use separate config for mapjoin hash table size limit rather than hive.auto.convert.join.noconditionaltask.size
                 Key: HIVE-16337
                 URL: https://issues.apache.org/jira/browse/HIVE-16337
             Project: Hive
          Issue Type: Improvement
          Components: Spark
            Reporter: Chao Sun


HoS currently share the same config {{hive.auto.convert.join.noconditionaltask.size}} with MR, however it interprets it differently: in MR this is the file size of the input, while in Spark it is the data size. This could cause Spark to be over conservative in many mapjoin optimizations. I suggest we use a separate config for HoS for this purpose.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)