You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues-all@impala.apache.org by "Csaba Ringhofer (Jira)" <ji...@apache.org> on 2019/12/05 13:27:00 UTC

[jira] [Work started] (IMPALA-8184) Add timestamp validation to Orc scanner

     [ https://issues.apache.org/jira/browse/IMPALA-8184?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Work on IMPALA-8184 started by Csaba Ringhofer.
-----------------------------------------------
> Add timestamp validation to Orc scanner
> ---------------------------------------
>
>                 Key: IMPALA-8184
>                 URL: https://issues.apache.org/jira/browse/IMPALA-8184
>             Project: IMPALA
>          Issue Type: Bug
>          Components: Backend
>            Reporter: Csaba Ringhofer
>            Assignee: Csaba Ringhofer
>            Priority: Critical
>
> Similarly to Parquet, Orc can also contain timestamps that are not valid in Impala, e.g. Hive can insert timestamps before 1400 while these are invalid in Impala. These invalid timestamps are often handled similarly to NULL, bur are actually not "real" NULLs, which can lead to some some weird behavior:
> Hive:
> create table orcts (ts timestamp) stored as orc;
> insert into orcts values ("1200-01-01");
> Impala:
> select * from orcts where ts is not null;
> Returns 1 row:
> NULL



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscribe@impala.apache.org
For additional commands, e-mail: issues-all-help@impala.apache.org