You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "Victoria Markman (JIRA)" <ji...@apache.org> on 2015/11/10 23:26:11 UTC

[jira] [Updated] (DRILL-4060) CTAS to csv or json files gives incorrect time values

     [ https://issues.apache.org/jira/browse/DRILL-4060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Victoria Markman updated DRILL-4060:
------------------------------------
    Priority: Critical  (was: Major)

> CTAS to csv or json files gives incorrect time values
> -----------------------------------------------------
>
>                 Key: DRILL-4060
>                 URL: https://issues.apache.org/jira/browse/DRILL-4060
>             Project: Apache Drill
>          Issue Type: Bug
>          Components: Storage - Writer
>            Reporter: Krystal
>            Priority: Critical
>
> I have a csv file with the following data:
> select columns[8], columns[10] from `interval_data.csv`;
> +-----------+----------------------+
> |  EXPR$0   |        EXPR$1        |
> +-----------+----------------------+
> | 01:35:56  | 2015-01-24 07:27:05  |
> | 12:22:07  | 2014-05-25 03:41:54  |
> | 00:01:28  | 2014-07-30 08:03:11  |
> | 00:00:01  | 2014-09-15 02:33:11  |
> | 06:59:59  | 2014-06-17 13:04:09  |
> | 23:38:16  | 2015-02-01 02:02:37  |
> | 15:00:00  | 2014-08-16 13:11:12  |
> | 05:55:36  | 2014-04-13 15:06:36  |
> | 10:48:36  | 2013-03-23 01:24:20  |
> +-----------+----------------------+
> I created a json file using CTAS:
> alter session set `store.format` = 'json';
> create table `test1.json` as select cast(columns[8] as time) c_time,cast(columns[10] as timestamp) c_timestamp from `interval_data.csv`;
> select c_time, c_timestamp from `test1.json`;
> +---------------+--------------------------+
> |    c_time     |       c_timestamp        |
> +---------------+--------------------------+
> | 09:35:56.000  | 2015-01-24 15:27:05.000  |
> | 20:22:07.000  | 2014-05-25 10:41:54.000  |
> | 08:01:28.000  | 2014-07-30 15:03:11.000  |
> | 08:00:01.000  | 2014-09-15 09:33:11.000  |
> | 14:59:59.000  | 2014-06-17 20:04:09.000  |
> | 07:38:16.000  | 2015-02-01 10:02:37.000  |
> | 23:00:00.000  | 2014-08-16 20:11:12.000  |
> | 13:55:36.000  | 2014-04-13 22:06:36.000  |
> | 18:48:36.000  | 2013-03-23 08:24:20.000  |
> +---------------+--------------------------+
> Notice that the times have 8 hours added to the original values.
> I got the same result when create another CSV file using CTAS on the same data.
> For CTAS as parquet, however, the resulting data is the same as the orginal data. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)