You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@kylin.apache.org by "Shaofeng SHI (JIRA)" <ji...@apache.org> on 2017/01/18 01:29:26 UTC
[jira] [Created] (KYLIN-2404) Add "hive.merge.mapfiles" and
"hive.merge.mapredfiles" to kylin_hive_conf.xml
Shaofeng SHI created KYLIN-2404:
-----------------------------------
Summary: Add "hive.merge.mapfiles" and "hive.merge.mapredfiles" to kylin_hive_conf.xml
Key: KYLIN-2404
URL: https://issues.apache.org/jira/browse/KYLIN-2404
Project: Kylin
Issue Type: Improvement
Components: Job Engine
Reporter: Shaofeng SHI
Assignee: Shaofeng SHI
Priority: Minor
Fix For: v2.0.0
Since 1.5.3, Kylin uses a "redistribute" step to merge the small files to proper size after creating the intermediate hive table. While in some users' environment, hive merge small files is enabled by default, that will cause additional CPU and will impact on the cube building performance (in extreme case the files will be merged to 256MB, then only very small number of mappers be started in building).
So Kylin should explicitly tell Hive to disable the merge small files feature when creating and redistributing the intermediate flat table. Will add "hive.merge.mapfiles" and "hive.merge.mapredfiles" to conf/kylin_hive_conf.xml with value "false".
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)