You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "Krystal (JIRA)" <ji...@apache.org> on 2015/11/10 20:41:11 UTC

[jira] [Created] (DRILL-4060) CTAS to csv or json files gives incorrect time values

Krystal created DRILL-4060:
------------------------------

             Summary: CTAS to csv or json files gives incorrect time values
                 Key: DRILL-4060
                 URL: https://issues.apache.org/jira/browse/DRILL-4060
             Project: Apache Drill
          Issue Type: Bug
          Components: Storage - Writer
            Reporter: Krystal


I have a csv file with the following data:
select columns[8], columns[10] from `interval_data.csv`;
+-----------+----------------------+
|  EXPR$0   |        EXPR$1        |
+-----------+----------------------+
| 01:35:56  | 2015-01-24 07:27:05  |
| 12:22:07  | 2014-05-25 03:41:54  |
| 00:01:28  | 2014-07-30 08:03:11  |
| 00:00:01  | 2014-09-15 02:33:11  |
| 06:59:59  | 2014-06-17 13:04:09  |
| 23:38:16  | 2015-02-01 02:02:37  |
| 15:00:00  | 2014-08-16 13:11:12  |
| 05:55:36  | 2014-04-13 15:06:36  |
| 10:48:36  | 2013-03-23 01:24:20  |
+-----------+----------------------+

I created a json file using CTAS:
alter session set `store.format` = 'json';
create table `test1.json` as select cast(columns[8] as time) c_time,cast(columns[10] as timestamp) c_timestamp from `interval_data.csv`;

select c_time, c_timestamp from `test1.json`;
+---------------+--------------------------+
|    c_time     |       c_timestamp        |
+---------------+--------------------------+
| 09:35:56.000  | 2015-01-24 15:27:05.000  |
| 20:22:07.000  | 2014-05-25 10:41:54.000  |
| 08:01:28.000  | 2014-07-30 15:03:11.000  |
| 08:00:01.000  | 2014-09-15 09:33:11.000  |
| 14:59:59.000  | 2014-06-17 20:04:09.000  |
| 07:38:16.000  | 2015-02-01 10:02:37.000  |
| 23:00:00.000  | 2014-08-16 20:11:12.000  |
| 13:55:36.000  | 2014-04-13 22:06:36.000  |
| 18:48:36.000  | 2013-03-23 08:24:20.000  |
+---------------+--------------------------+

Notice that the times have 8 hours added to the original values.
I got the same result when create another CSV file using CTAS on the same data.
For CTAS as parquet, however, the resulting data is the same as the orginal data. 




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)