You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Mickael Lacour (JIRA)" <ji...@apache.org> on 2014/11/17 19:04:34 UTC

[jira] [Updated] (HIVE-8359) Map containing null values are not correctly written in Parquet files

     [ https://issues.apache.org/jira/browse/HIVE-8359?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Mickael Lacour updated HIVE-8359:
---------------------------------
    Attachment: HIVE-8359.2.patch

I only did two minor changes on :

* ql/src/java/org/apache/hadoop/hive/ql/io/parquet/convert/ArrayWritableGroupConverter.java
* ql/src/java/org/apache/hadoop/hive/ql/io/parquet/serde/ParquetHiveSerDe.java

And added a qtest (same as HIVE-6994). I used the patch available on the review link.

> Map containing null values are not correctly written in Parquet files
> ---------------------------------------------------------------------
>
>                 Key: HIVE-8359
>                 URL: https://issues.apache.org/jira/browse/HIVE-8359
>             Project: Hive
>          Issue Type: Bug
>          Components: File Formats
>    Affects Versions: 0.13.1
>            Reporter: Frédéric TERRAZZONI
>            Assignee: Sergio Peña
>         Attachments: HIVE-8359.1.patch, HIVE-8359.2.patch, map_null_val.avro
>
>
> Tried write a map<string,string> column in a Parquet file. The table should contain :
> {code}
> {"key3":"val3","key4":null}
> {"key3":"val3","key4":null}
> {"key1":null,"key2":"val2"}
> {"key3":"val3","key4":null}
> {"key3":"val3","key4":null}
> {code}
> ... and when you do a query like {code}SELECT * from mytable{code}
> We can see that the table is corrupted :
> {code}
> {"key3":"val3"}
> {"key4":"val3"}
> {"key3":"val2"}
> {"key4":"val3"}
> {"key1":"val3"}
> {code}
> I've not been able to read the Parquet file in our software afterwards, and consequently I suspect it to be corrupted. 
> For those who are interested, I generated this Parquet table from an Avro file. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)