You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@carbondata.apache.org by "Manish Gupta (JIRA)" <ji...@apache.org> on 2016/07/19 12:05:20 UTC
[jira] [Created] (CARBONDATA-80) Dictionary values should be
equally distributed in buckets while loading in memory
Manish Gupta created CARBONDATA-80:
--------------------------------------
Summary: Dictionary values should be equally distributed in buckets while loading in memory
Key: CARBONDATA-80
URL: https://issues.apache.org/jira/browse/CARBONDATA-80
Project: CarbonData
Issue Type: Improvement
Reporter: Manish Gupta
Assignee: Manish Gupta
Priority: Minor
Whenever a query is executed, dictionary for columns queried is loaded in memory. For incremental loads dictionary values are loaded incrementally and thus one list contains several sub lists with dictionary values.
The dictionary values on incremental load may not be equally distributed in the sub buckets and this might increase the search time of a value if there are too many incremental loads.
Therefore the dictionary values should be divided equally in the sub buckets.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)