You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@parquet.apache.org by "Benoit Hanotte (JIRA)" <ji...@apache.org> on 2018/02/14 18:19:00 UTC

[jira] [Created] (PARQUET-1216) Parquet protobuf: List wrapper should be optional

Benoit Hanotte created PARQUET-1216:
---------------------------------------

             Summary: Parquet protobuf: List wrapper should be optional
                 Key: PARQUET-1216
                 URL: https://issues.apache.org/jira/browse/PARQUET-1216
             Project: Parquet
          Issue Type: Bug
          Components: parquet-mr
    Affects Versions: 1.9.0
            Reporter: Benoit Hanotte


Following [https://github.com/apache/parquet-mr/pull/411] , parquet-protobuf will conform to the specs[1] regarding the definition of lists and will adopt the 3-level structure:
{code:java}
*required* group my_list (LIST) {
  repeated group list {
    optional binary element (UTF8);
  }
}{code}
The first level has the repetition level *required*, however this won't allow making the distinction between a null list and an empty one.

This ticket is for changing the root level repetition to *optional* as following:
{code:java}
*optional* group my_list (LIST) {
  repeated group list {
    optional binary element (UTF8);
  }
}
{code}
This will allow differentiating between null and empty lists.

[1] https://github.com/apache/parquet-format/blob/master/LogicalTypes.md#lists



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)