You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@parquet.apache.org by "Lars Volker (JIRA)" <ji...@apache.org> on 2017/03/09 00:24:38 UTC

[jira] [Commented] (PARQUET-323) INT96 should be marked as deprecated

    [ https://issues.apache.org/jira/browse/PARQUET-323?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15902218#comment-15902218 ] 

Lars Volker commented on PARQUET-323:
-------------------------------------

We discussed this issue in today's Parquet sync and agreed to deprecate INT96. As a replacement to store timestamps (the most common use for INT96) we will encourage all projects who currently use INT96 to switch to INT64 and either use the TIMESTAMP_MILLIS or TIMESTAMP_MICROS logical types.

We will not fix the ordering issues around INT96 that resulted in parquet-mr writing wrong min/max statistics.

> INT96 should be marked as deprecated
> ------------------------------------
>
>                 Key: PARQUET-323
>                 URL: https://issues.apache.org/jira/browse/PARQUET-323
>             Project: Parquet
>          Issue Type: Bug
>          Components: parquet-format
>            Reporter: Cheng Lian
>
> As discussed in the mailing list, {{INT96}} is only used to represent nanosec timestamp in Impala for some historical reasons, and should be deprecated. Since nanosec precision is rarely a real requirement, one possible and simple solution would be replacing {{INT96}} with {{INT64 (TIMESTAMP_MILLIS)}} or {{INT64 (TIMESTAMP_MICROS)}}.
> Several projects (Impala, Hive, Spark, ...) support INT96.
> We need a clear spec of the replacement and the path to deprecation.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)