You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@hive.apache.org by "Szehon Ho (JIRA)" <ji...@apache.org> on 2014/04/01 00:07:27 UTC

[jira] [Commented] (HIVE-6783) Incompatible schema for maps between parquet-hive and parquet-pig

    [ https://issues.apache.org/jira/browse/HIVE-6783?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13955782#comment-13955782 ] 

Szehon Ho commented on HIVE-6783:
---------------------------------

Thanks, that is my only concern to not break existing stored maps, I'm ok if thats the case.  Although I wonder, it would seem pig should be able to read hive's maps if that is the case, because we are using pig's schema now to read?  Or is there some difference there?


> Incompatible schema for maps between parquet-hive and parquet-pig
> -----------------------------------------------------------------
>
>                 Key: HIVE-6783
>                 URL: https://issues.apache.org/jira/browse/HIVE-6783
>             Project: Hive
>          Issue Type: Bug
>          Components: File Formats
>    Affects Versions: 0.13.0
>            Reporter: Tongjie Chen
>             Fix For: 0.13.0
>
>         Attachments: HIVE-6783.1.patch.txt, HIVE-6783.2.patch.txt, HIVE-6783.3.patch.txt, HIVE-6783.4.patch.txt
>
>
> see also in following parquet issue:
> https://github.com/Parquet/parquet-mr/issues/290
> The schema written for maps isn't compatible between hive and pig. This means any files written in one cannot be properly read in the other.
> More specifically,  for the same map column c1, parquet-pig generates schema:
> message pig_schema {
>   optional group c1 (MAP) {
>     repeated group map (MAP_KEY_VALUE) {
>       required binary key (UTF8);
>       optional binary value;
>     }   
>   }
> }
> while parquet-hive generates schema:
> message hive_schema {
>    optional group c1 (MAP_KEY_VALUE) {
>      repeated group map {
>        required binary key;
>        optional binary value;
>    }
>  }
> }



--
This message was sent by Atlassian JIRA
(v6.2#6252)