You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Kevin Wilfong (JIRA)" <ji...@apache.org> on 2013/01/15 01:54:13 UTC

[jira] [Updated] (HIVE-3897) Add a way to get the uncompressed/compressed sizes of columns from an RC File

     [ https://issues.apache.org/jira/browse/HIVE-3897?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Kevin Wilfong updated HIVE-3897:
--------------------------------

    Attachment: HIVE-3897.1.patch.txt
    
> Add a way to get the uncompressed/compressed sizes of columns from an RC File
> -----------------------------------------------------------------------------
>
>                 Key: HIVE-3897
>                 URL: https://issues.apache.org/jira/browse/HIVE-3897
>             Project: Hive
>          Issue Type: New Feature
>    Affects Versions: 0.11.0
>            Reporter: Kevin Wilfong
>            Assignee: Kevin Wilfong
>         Attachments: HIVE-3897.1.patch.txt
>
>
> The uncompressed, compressed size of each column of an RCFile is stored in the header of an RCFile block.  Currently, we have no convenient way to get at this data.  This would be useful for identifying where RCFile is doing a poor job of compression, so that we can better focus our efforts.
> RCFileCat seems like a logical tool to extend to add this.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira