You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@impala.apache.org by "Csaba Ringhofer (JIRA)" <ji...@apache.org> on 2019/02/11 18:22:00 UTC
[jira] [Created] (IMPALA-8184) Add timestamp validation to Orc
scanner
Csaba Ringhofer created IMPALA-8184:
---------------------------------------
Summary: Add timestamp validation to Orc scanner
Key: IMPALA-8184
URL: https://issues.apache.org/jira/browse/IMPALA-8184
Project: IMPALA
Issue Type: Bug
Components: Backend
Reporter: Csaba Ringhofer
Similarly to Parquet, Orc can also contain timestamps that are not valid in Impala, e.g. Hive can insert timestamps before 1400 while these are invalid in Impala. These invalid timestamps are often handled similarly to NULL, bur are actually not "real" NULLs, which can lead to some some weird behavior:
Hive:
create table orcts (ts timestamp) stored as orc;
insert into orcts values ("1200-01-01");
Impala:
select * from orcts where ts is not null;
Returns 1 row:
NULL
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)