You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "Nishanth (Jira)" <ji...@apache.org> on 2022/08/17 18:29:00 UTC
[jira] [Created] (ARROW-17450) Arrow-Parquet cannot read columns with Run Length Encoding (RLE)
Nishanth created ARROW-17450:
--------------------------------
Summary: Arrow-Parquet cannot read columns with Run Length Encoding (RLE)
Key: ARROW-17450
URL: https://issues.apache.org/jira/browse/ARROW-17450
Project: Apache Arrow
Issue Type: Bug
Components: C++
Affects Versions: 9.0.0
Reporter: Nishanth
Reading from Arrow-Parquet c++, Parquet files with RLE encoding in columns error out with
{code:java}
"Unknown encoding type."{code}
The error is thrown only in arrow-parquet c++ and error is due to RLE encoding not defined in the decoder.
[https://github.com/apache/arrow/blob/master/cpp/src/parquet/column_reader.cc#L769|http://example.com/]
The files were generated from Athena using Iceberg, with the following query.
{code:java}
create table test (d_bool1 boolean)
LOCATION 's3://'
TBLPROPERTIES (
'table_type'='ICEBERG',
'format'='parquet'
);
insert into test VALUES (true);
{code}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)