You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@parquet.apache.org by "Christian Nguyen Van Than (JIRA)" <ji...@apache.org> on 2015/08/03 18:06:04 UTC
[jira] [Created] (PARQUET-354) Question on parquet-protobuf and
parquet-pig
Christian Nguyen Van Than created PARQUET-354:
-------------------------------------------------
Summary: Question on parquet-protobuf and parquet-pig
Key: PARQUET-354
URL: https://issues.apache.org/jira/browse/PARQUET-354
Project: Parquet
Issue Type: Bug
Components: parquet-mr
Affects Versions: 1.8.0
Reporter: Christian Nguyen Van Than
Hi,
I have a question about protobuf to parquet conversion.
I have a message like this (simplified) :
{code}
message MyMessage {
repeated string language = 1;
}
{code}
parquet-protobuf convert it to the following schema :
{code}
message MyMessage {
repeated binary language (UTF8);
}
{code}
But, according to TestPigSchameConverter.java, the correct schema should be :
{code}
message MyMessage {
optional group language (LIST) {
repeated binary value (UTF8);
}
}
{code}
Language is an optional list of language, i want to store zero or more language.
Who have the correct schema for my case ? parquet-protobuf or parquet-pig ?
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)