You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@parquet.apache.org by "Timothy Miller (Jira)" <ji...@apache.org> on 2022/04/08 16:35:00 UTC

[jira] [Commented] (PARQUET-2133) Support Int8 and Int16 as basic type

    [ https://issues.apache.org/jira/browse/PARQUET-2133?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17519690#comment-17519690 ] 

Timothy Miller commented on PARQUET-2133:
-----------------------------------------

Have you started working on implementing this? What is your progress. I'd be happy to work with you on it.

> Support Int8 and Int16 as basic type
> ------------------------------------
>
>                 Key: PARQUET-2133
>                 URL: https://issues.apache.org/jira/browse/PARQUET-2133
>             Project: Parquet
>          Issue Type: Improvement
>          Components: parquet-format
>            Reporter: Jackey Lee
>            Priority: Major
>
>  Int8 and Int16 are not supported as basic in previos version. Using 4 bytes to store int8 seems not a good idea, which means requiring more storage and read and write very slow. Besides, it is not friendly with regular computing format, such as velox, arrow, vector and so on.
> With Int8 and Int16 supported, we can get less storage and better performance on reading and writing. As for forward compatible, we can use version in FileMetaData to choose how to read parquet data.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)