You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "Nishanth (Jira)" <ji...@apache.org> on 2022/08/17 18:29:00 UTC

[jira] [Created] (ARROW-17450) Arrow-Parquet cannot read columns with Run Length Encoding (RLE)

Nishanth created ARROW-17450:
--------------------------------

             Summary: Arrow-Parquet cannot read columns with Run Length Encoding (RLE)
                 Key: ARROW-17450
                 URL: https://issues.apache.org/jira/browse/ARROW-17450
             Project: Apache Arrow
          Issue Type: Bug
          Components: C++
    Affects Versions: 9.0.0
            Reporter: Nishanth


Reading from Arrow-Parquet c++, Parquet files with RLE encoding in columns error out with 
{code:java}
"Unknown encoding type."{code}

The error is thrown only in arrow-parquet c++ and error is due to RLE encoding not defined in the decoder. 

 

[https://github.com/apache/arrow/blob/master/cpp/src/parquet/column_reader.cc#L769|http://example.com/]

The files were generated from Athena using Iceberg, with the following query. 



 
{code:java}
create table test (d_bool1 boolean)
LOCATION 's3://'
TBLPROPERTIES (
'table_type'='ICEBERG',
'format'='parquet'
);
insert into  test VALUES (true);
{code}
 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)