You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@kylin.apache.org by "liyang (JIRA)" <ji...@apache.org> on 2016/07/08 07:38:11 UTC
[jira] [Updated] (KYLIN-1844) Hold huge dictionary in 2nd storage
like disk/hbase
[ https://issues.apache.org/jira/browse/KYLIN-1844?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
liyang updated KYLIN-1844:
--------------------------
Summary: Hold huge dictionary in 2nd storage like disk/hbase (was: High cardinality dimensions in memory)
> Hold huge dictionary in 2nd storage like disk/hbase
> ---------------------------------------------------
>
> Key: KYLIN-1844
> URL: https://issues.apache.org/jira/browse/KYLIN-1844
> Project: Kylin
> Issue Type: Improvement
> Components: Query Engine
> Affects Versions: v1.2, v1.5.2
> Reporter: Abhilash L L
> Assignee: liyang
>
> A whole dimension is kept in memory.
> We should have a way to keep only certain number / size of total rows to be kept in memory. A LRU cache for rows in the dimension will help keep memory in check.
> Why not store all the dimensions data in hbase in a different table with a prefix of dimensionid, and all calls to the dimensions (get based on dim key), is mapped to hbase.
> This does mean it will cost more time on a miss.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)