You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Kevin Liew (JIRA)" <ji...@apache.org> on 2016/07/14 15:39:20 UTC

[jira] [Commented] (HIVE-13680) HiveServer2: Provide a way to compress ResultSets

    [ https://issues.apache.org/jira/browse/HIVE-13680?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15377135#comment-15377135 ] 

Kevin Liew commented on HIVE-13680:
-----------------------------------

I attached an updated proposal. I think we can forward with this but additional feedback is welcome.

[~alangates] I looked more into the ORC format. ORC files are optimized through indexes and metadata for retrieving subsets of stored data. When serving remote clients: we have already extracted the final result-set, so ORC metadata is just overhead. ORC also does not provide plugglable compressors. 

> HiveServer2: Provide a way to compress ResultSets
> -------------------------------------------------
>
>                 Key: HIVE-13680
>                 URL: https://issues.apache.org/jira/browse/HIVE-13680
>             Project: Hive
>          Issue Type: Sub-task
>          Components: HiveServer2, JDBC
>            Reporter: Vaibhav Gumashta
>            Assignee: Kevin Liew
>         Attachments: proposal.pdf
>
>
> With HIVE-12049 in, we can provide an option to compress ResultSets before writing to disk. The user can specify a compression library via a config param which can be used in the tasks.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)