You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@kylin.apache.org by "ShengJun Zheng (Jira)" <ji...@apache.org> on 2021/06/28 10:02:00 UTC
[jira] [Created] (KYLIN-5019) Avoid building global dictionary from
all data of fact table each time
ShengJun Zheng created KYLIN-5019:
-------------------------------------
Summary: Avoid building global dictionary from all data of fact table each time
Key: KYLIN-5019
URL: https://issues.apache.org/jira/browse/KYLIN-5019
Project: Kylin
Issue Type: New Feature
Reporter: ShengJun Zheng
Global dictionaries will be resized to avoid single dictionary bucket exceeding its threashold in each cubing job. To judge whether a dictionary should be resized, Kylin will read all data from fact table and do count distinct operation. Then rebuild dictionary from all history data.
It’s better to use a lightweight approach, since dictionary don't need to be resized for incremental building in most cases.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)