You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Kazuyuki Tanimura (Jira)" <ji...@apache.org> on 2022/11/10 09:00:00 UTC

[jira] [Created] (SPARK-41096) Support reading parquet FIXED_LEN_BYTE_ARRAY type

Kazuyuki Tanimura created SPARK-41096:
-----------------------------------------

             Summary: Support reading parquet FIXED_LEN_BYTE_ARRAY type
                 Key: SPARK-41096
                 URL: https://issues.apache.org/jira/browse/SPARK-41096
             Project: Spark
          Issue Type: Improvement
          Components: SQL
    Affects Versions: 3.4.0
            Reporter: Kazuyuki Tanimura


Parquet has FIXED_LEN_BYTE_ARRAY (FLBA) data type. However, Spark Parquet reader currently cannot handle it.
Read it as BinaryType in Spark.

Iceberg Parquet reader, for example, can handle FLBA. This improvement should reduce the gap between Spark and Iceberg Parquet reader.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org