You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@parquet.apache.org by Laurynas Katkus <la...@mambu.com> on 2023/02/01 16:09:31 UTC

Parquet array schema incompatibilities

Hello,

I wanted to raise attention to incompatibilities when it comes to Parquet,
Avro and parquet-cli. My main findings can be found here:
https://github.com/MrR0807/Notes/blob/master/parquet-not-working-cases.md#simple-schema-with-array.
But in short, recommended schema definition for Lists as per parquet-format
(https://github.com/apache/parquet-format/blob/master/LogicalTypes.md#lists)
does not work well with Avro, parquet-cli or just in general. I wonder what
you think about this? Is it something that should be explicitly addressed
at least in documentation? Are you aware of these problems? I can create PR
into documentation, but before that I wanted to validate it with you.

Thank you,
Laurynas