You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Alex McLintock (JIRA)" <ji...@apache.org> on 2014/11/05 12:18:33 UTC

[jira] [Commented] (HIVE-7858) Parquet compression should be configurable via table property

    [ https://issues.apache.org/jira/browse/HIVE-7858?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14198231#comment-14198231 ] 

Alex McLintock commented on HIVE-7858:
--------------------------------------

(I'm trying to understand the scope of this bug/bugfix)

For internal tables within Hive we can only create parquet files with the default compression format - up until Hive version 0.15 when this fix will be generally available. 

Can I ask how this affects external tables? 
Assuming that Hive can read a parquet file supplied to it does it need to be told what compression format was used?

 

> Parquet compression should be configurable via table property
> -------------------------------------------------------------
>
>                 Key: HIVE-7858
>                 URL: https://issues.apache.org/jira/browse/HIVE-7858
>             Project: Hive
>          Issue Type: Improvement
>            Reporter: Brock Noland
>            Assignee: Ferdinand Xu
>              Labels: TODOC15
>             Fix For: 0.15.0
>
>         Attachments: HIVE-7858.1.patch, HIVE-7858.patch, HIVE-7858.patch
>
>
> ORC supports the orc.compress table property:
> https://cwiki.apache.org/confluence/display/Hive/LanguageManual+ORC
> {noformat}
> create table Addresses (
>   name string,
>   street string,
>   city string,
>   state string,
>   zip int
> ) stored as orc tblproperties ("orc.compress"="NONE");
> {noformat}
> I think it'd be great to support the same for Parquet.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)