You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@impala.apache.org by "Csaba Ringhofer (JIRA)" <ji...@apache.org> on 2019/02/11 18:22:00 UTC

[jira] [Created] (IMPALA-8184) Add timestamp validation to Orc scanner

Csaba Ringhofer created IMPALA-8184:
---------------------------------------

             Summary: Add timestamp validation to Orc scanner
                 Key: IMPALA-8184
                 URL: https://issues.apache.org/jira/browse/IMPALA-8184
             Project: IMPALA
          Issue Type: Bug
          Components: Backend
            Reporter: Csaba Ringhofer


Similarly to Parquet, Orc can also contain timestamps that are not valid in Impala, e.g. Hive can insert timestamps before 1400 while these are invalid in Impala. These invalid timestamps are often handled similarly to NULL, bur are actually not "real" NULLs, which can lead to some some weird behavior:

Hive:
create table orcts (ts timestamp) stored as orc;
insert into orcts values ("1200-01-01");

Impala:
select * from orcts where ts is not null;
Returns 1 row:
NULL



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)