You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@impala.apache.org by "Csaba Ringhofer (JIRA)" <ji...@apache.org> on 2017/09/15 16:36:00 UTC

[jira] [Created] (IMPALA-5942) Dateless timestamps (e.g. "10:00:00") are handled inconsistently

Csaba Ringhofer created IMPALA-5942:
---------------------------------------

             Summary: Dateless timestamps (e.g. "10:00:00") are handled inconsistently 
                 Key: IMPALA-5942
                 URL: https://issues.apache.org/jira/browse/IMPALA-5942
             Project: IMPALA
          Issue Type: Bug
            Reporter: Csaba Ringhofer


Impala cannot read back these timestamps from Parquet, while it  can read them back from textfiles.

According to https://www.cloudera.com/documentation/enterprise/5-9-x/topics/impala_timestamp.html , Impala should be able to handle these values somehow.

{code}

text:

create table TT1 (t timestamp);
insert into TT1 (t) values ("10:00:00");
select * from TT1;
+----------+
| t        |
+----------+
| 10:00:00 |
+----------+

parquet:

create table TT2(t timestamp) STORED AS PARQUET;
insert into TT2 (t) values ("10:00:00");
select * from TT2;
+------+
| t    |
+------+
| NULL |
+------+
WARNINGS: Parquet file 'hdfs://localhost:20500/test-warehouse/tt2/714d741212df3180-cd4e670800000000_226739479_data.0.parq' column 't' contains an out of range timestamp. The valid date range is 1400-01-01..9999-12-31.
{code}

I think that this is a side effect of the fix of IMPALA-4363, but I did not check what happens in versions that did not contain this fix.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)