You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Jonathan diamant (Jira)" <ji...@apache.org> on 2022/07/31 12:26:00 UTC

[jira] [Created] (FLINK-28756) Can't write nested object to parquet flink 1.15.1

Jonathan diamant created FLINK-28756:
----------------------------------------

             Summary: Can't write nested object to parquet flink 1.15.1
                 Key: FLINK-28756
                 URL: https://issues.apache.org/jira/browse/FLINK-28756
             Project: Flink
          Issue Type: New Feature
          Components: Formats (JSON, Avro, Parquet, ORC, SequenceFile), Table SQL / API
    Affects Versions: 1.15.1
         Environment: Local running and Flink on docker
            Reporter: Jonathan diamant


Hey, I'm trying to write nested objects to parquet files with Table api (by specifying the schema). I did manage to write flattened objects to parquet files. The objects I want to write contain array of objects. The schema I used is 

 
{code:java}
 DataTypes. ARRAY(DataTypes.ROW({code}
{code:java}
DataTypes.FIELD(...),...)){code}
 

When I tried to run it locally I got the exception:

 

empty fields are illegal, the field should be ommited completely instead

 

And when I debugged the code I saw that the method 
{code:java}
 Public void write(Array Data array Data, int ordinal) {code}
 

 

In the class RowWriter in ParquetRowDataWriter is not implemented and thus the content of the array cannot be written.

 

I have wondered if the method can be implemented or what it takes to implement it. 

(I have thought that this can be solved by call the write(RowData row) method of the same class with the correct parameters from the arrayData).



--
This message was sent by Atlassian Jira
(v8.20.10#820010)