You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Sergey Shelukhin (JIRA)" <ji...@apache.org> on 2017/10/30 18:21:00 UTC

[jira] [Updated] (HIVE-15670) column_stats_accurate may not fit in PARTITION_PARAMS.VALUE

     [ https://issues.apache.org/jira/browse/HIVE-15670?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Sergey Shelukhin updated HIVE-15670:
------------------------------------
    Description: 
The JSON can be too big with many columns (see setColumnStatsState method).
This can be stored better even in JSON; it can also be stored in some other, more compact, representation, potentially outside the table properties.


  was:
The JSON can be too big with many columns (see setColumnStatsState method).
We can make JSON more compact by only storing the list of columns with true values. Or we can even store a bitmask in a dedicated column, and adjust it when altering table (rare enough). Or we can just change the VALUE column to text blob (might be a painful change wrt upgrade scripts, and supporting all the DBs' varied blob implementations, esp. in directsql).
Storing denormalized flags in a separate table will probably be slow, comparatively.



> column_stats_accurate may not fit in PARTITION_PARAMS.VALUE
> -----------------------------------------------------------
>
>                 Key: HIVE-15670
>                 URL: https://issues.apache.org/jira/browse/HIVE-15670
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Sergey Shelukhin
>
> The JSON can be too big with many columns (see setColumnStatsState method).
> This can be stored better even in JSON; it can also be stored in some other, more compact, representation, potentially outside the table properties.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)