You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@parquet.apache.org by "Christian Nguyen Van Than (JIRA)" <ji...@apache.org> on 2015/08/03 18:06:04 UTC

[jira] [Created] (PARQUET-354) Question on parquet-protobuf and parquet-pig

Christian Nguyen Van Than created PARQUET-354:
-------------------------------------------------

             Summary: Question on parquet-protobuf and parquet-pig
                 Key: PARQUET-354
                 URL: https://issues.apache.org/jira/browse/PARQUET-354
             Project: Parquet
          Issue Type: Bug
          Components: parquet-mr
    Affects Versions: 1.8.0
            Reporter: Christian Nguyen Van Than


Hi,

I have a question about protobuf to parquet conversion.
I have a message like this (simplified) :
{code} 
message MyMessage {
  repeated string language = 1;
}
{code}

parquet-protobuf convert it to the following schema :

{code}
message MyMessage {
  repeated binary language (UTF8);
}
{code}

But, according to TestPigSchameConverter.java, the correct schema should be :

{code}
message MyMessage {
  optional group language (LIST) {
    repeated binary value (UTF8);
  }
}
{code}

Language is an optional list of language, i want to store zero or more language.

Who have the correct schema for my case ? parquet-protobuf or parquet-pig ?




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)