You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Kevin Wilfong (JIRA)" <ji...@apache.org> on 2013/03/28 20:01:17 UTC

[jira] [Assigned] (HIVE-4244) Make string dictionaries adaptive in ORC

     [ https://issues.apache.org/jira/browse/HIVE-4244?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Kevin Wilfong reassigned HIVE-4244:
-----------------------------------

    Assignee: Kevin Wilfong  (was: Owen O'Malley)
    
> Make string dictionaries adaptive in ORC
> ----------------------------------------
>
>                 Key: HIVE-4244
>                 URL: https://issues.apache.org/jira/browse/HIVE-4244
>             Project: Hive
>          Issue Type: Bug
>          Components: Serializers/Deserializers
>            Reporter: Owen O'Malley
>            Assignee: Kevin Wilfong
>
> The ORC writer should adaptively switch between dictionary and direct encoding. I'd propose looking at the first 100,000 values in each column and decide whether there is sufficient loading in the dictionary to use dictionary encoding.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira