You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@kylin.apache.org by "Dayue Gao (JIRA)" <ji...@apache.org> on 2016/12/28 04:28:58 UTC
[jira] [Created] (KYLIN-2328) Reduce the size of metadata uploaded
to distributed cache
Dayue Gao created KYLIN-2328:
--------------------------------
Summary: Reduce the size of metadata uploaded to distributed cache
Key: KYLIN-2328
URL: https://issues.apache.org/jira/browse/KYLIN-2328
Project: Kylin
Issue Type: Improvement
Components: Job Engine
Affects Versions: all
Reporter: Dayue Gao
Assignee: Dayue Gao
Fix For: v2.0.0
Currently, each MR job uploads all the metadata belonging to a cube to distributed cache. When the total size of metadata increases, the submission time ("MapReduce Waiting" at Monitor UI) also increases and could become notable.
We could actually optimize the amount of metadata uploaded according to the type of job, for example
* CuboidJob only needs dictionary of the building segment
* CubeHFileJob doesn't need any dictionary
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)