You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2020/06/06 00:24:01 UTC

[jira] [Work logged] (HIVE-23036) Incorrect ORC PPD eval with sub-millisecond timestamps

     [ https://issues.apache.org/jira/browse/HIVE-23036?focusedWorklogId=442096&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-442096 ]

ASF GitHub Bot logged work on HIVE-23036:
-----------------------------------------

                Author: ASF GitHub Bot
            Created on: 06/Jun/20 00:23
            Start Date: 06/Jun/20 00:23
    Worklog Time Spent: 10m 
      Work Description: github-actions[bot] closed pull request #956:
URL: https://github.com/apache/hive/pull/956


   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


Issue Time Tracking
-------------------

    Worklog Id:     (was: 442096)
    Time Spent: 0.5h  (was: 20m)

> Incorrect ORC PPD eval with sub-millisecond timestamps
> ------------------------------------------------------
>
>                 Key: HIVE-23036
>                 URL: https://issues.apache.org/jira/browse/HIVE-23036
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Panagiotis Garefalakis
>            Assignee: Panagiotis Garefalakis
>            Priority: Major
>              Labels: pull-request-available
>          Time Spent: 0.5h
>  Remaining Estimate: 0h
>
> See [ORC-611|https://issues.apache.org/jira/browse/ORC-611] for more details
> ORC stores timestamps with:
>  - nanosecond precision for the data itself
>  - milliseconds precision for min-max statistics
> As both min and max are rounded to the same value,  timestamps with ns precision will not pass the PPD evaluator.
> {code:java}
> create table tsstat (ts timestamp) stored as orc;
> insert into tsstat values ("1970-01-01 00:00:00.0005");
> select * from tsstat where ts = "1970-01-01 00:00:00.0005";
> -- returned 0 rows{code}
> ORC PPD evaluation currently happens as part of OrcInputFormat [https://github.com/apache/hive/blob/7e39a2c13711f9377c9ce1edb4224880421b1ea5/ql/src/java/org/apache/hadoop/hive/ql/io/orc/OrcInputFormat.java#L2314]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)