You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@parquet.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2020/11/10 12:57:00 UTC

[jira] [Commented] (PARQUET-1928) Interpret Parquet INT96 type as FIXED[12] AVRO Schema

    [ https://issues.apache.org/jira/browse/PARQUET-1928?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17229195#comment-17229195 ] 

ASF GitHub Bot commented on PARQUET-1928:
-----------------------------------------

anantdamle commented on pull request #831:
URL: https://github.com/apache/parquet-mr/pull/831#issuecomment-724684358


   Adding @rdblue  @tomwhite for review
   
   
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


> Interpret Parquet INT96 type as FIXED[12] AVRO Schema
> -----------------------------------------------------
>
>                 Key: PARQUET-1928
>                 URL: https://issues.apache.org/jira/browse/PARQUET-1928
>             Project: Parquet
>          Issue Type: Bug
>          Components: parquet-avro
>    Affects Versions: 1.11.0
>            Reporter: Anant Damle
>            Priority: Minor
>              Labels: patch
>             Fix For: 1.12.0
>
>
> Reading Parquet files in Apache Beam using ParquetIO uses `AvroParquetReader` causing it to throw `IllegalArgumentException("INT96 not implemented and is deprecated")`
> Customers have large datasets which can't be reprocessed again to convert into a supported type. An easier approach would be to convert into a byte array of 12 bytes, that can then be interpreted by the developer in any way they want to interpret it.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)