You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Nishant Bangarwa (JIRA)" <ji...@apache.org> on 2018/06/27 19:38:00 UTC

[jira] [Resolved] (HIVE-20014) Druid SECOND/HOUR/MINUTE does not return correct values when applied to String Columns

     [ https://issues.apache.org/jira/browse/HIVE-20014?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Nishant Bangarwa resolved HIVE-20014.
-------------------------------------
    Resolution: Invalid

Verified that this is the same behavior with orc tables. 
This is an expected behavior change in 3.0

> Druid SECOND/HOUR/MINUTE does not return correct values when applied to String Columns
> --------------------------------------------------------------------------------------
>
>                 Key: HIVE-20014
>                 URL: https://issues.apache.org/jira/browse/HIVE-20014
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Nishant Bangarwa
>            Assignee: Nishant Bangarwa
>            Priority: Major
>
> Query SELECT  MINUTE(`time1`) FROM calcs; returns null when the String column only contains timestamp and does not contain any date information in the column. The Druid parser fails to parse the time string values and returns null. 
> {code} 
> 1: jdbc:hive2://ctr-e138-1518143905142-379982> SELECT  MINUTE(`time1`) FROM calcs;
> INFO  : Compiling command(queryId=hive_20180627145215_05147329-b8d8-491c-9bab-6fd5045542db): SELECT  MINUTE(`time1`) FROM calcs
> INFO  : Semantic Analysis Completed (retrial = false)
> INFO  : Returning Hive schema: Schema(fieldSchemas:[FieldSchema(name:vc, type:int, comment:null)], properties:null)
> INFO  : Completed compiling command(queryId=hive_20180627145215_05147329-b8d8-491c-9bab-6fd5045542db); Time taken: 0.134 seconds
> INFO  : Executing command(queryId=hive_20180627145215_05147329-b8d8-491c-9bab-6fd5045542db): SELECT  MINUTE(`time1`) FROM calcs
> INFO  : Completed executing command(queryId=hive_20180627145215_05147329-b8d8-491c-9bab-6fd5045542db); Time taken: 0.002 seconds
> INFO  : OK
> +-------+
> |  vc   |
> +-------+
> | NULL  |
> | NULL  |
> | NULL  |
> | NULL  |
> | NULL  |
> | NULL  |
> | NULL  |
> | NULL  |
> | NULL  |
> | NULL  |
> | NULL  |
> | NULL  |
> | NULL  |
> | NULL  |
> | NULL  |
> | NULL  |
> | NULL  |
> +-------+
> 17 rows selected (0.266 seconds)
> 1: jdbc:hive2://ctr-e138-1518143905142-379982> SELECT time1 from calcs;
> INFO  : Compiling command(queryId=hive_20180627145225_93b872de-a698-4859-9730-983eede6935d): SELECT time1 from calcs
> INFO  : Semantic Analysis Completed (retrial = false)
> INFO  : Returning Hive schema: Schema(fieldSchemas:[FieldSchema(name:time1, type:string, comment:null)], properties:null)
> INFO  : Completed compiling command(queryId=hive_20180627145225_93b872de-a698-4859-9730-983eede6935d); Time taken: 0.116 seconds
> INFO  : Executing command(queryId=hive_20180627145225_93b872de-a698-4859-9730-983eede6935d): SELECT time1 from calcs
> INFO  : Completed executing command(queryId=hive_20180627145225_93b872de-a698-4859-9730-983eede6935d); Time taken: 0.003 seconds
> INFO  : OK
> +-----------+
> |   time1   |
> +-----------+
> | 22:20:14  |
> | 22:50:16  |
> | 19:36:22  |
> | 19:48:23  |
> | 00:05:57  |
> | NULL      |
> | 04:48:07  |
> | NULL      |
> | 19:57:33  |
> | NULL      |
> | 04:40:49  |
> | 02:05:25  |
> | NULL      |
> | NULL      |
> | 12:33:57  |
> | 18:58:41  |
> | 09:33:31  |
> +-----------+
> 17 rows selected (0.202 seconds)
> 1: jdbc:hive2://ctr-e138-1518143905142-379982> EXPLAIN SELECT  MINUTE(`time1`) FROM calcs;
> INFO  : Compiling command(queryId=hive_20180627145237_39e53a7e-35cb-4e17-8ccb-884c6f6358cd): EXPLAIN SELECT  MINUTE(`time1`) FROM calcs
> INFO  : Semantic Analysis Completed (retrial = false)
> INFO  : Returning Hive schema: Schema(fieldSchemas:[FieldSchema(name:Explain, type:string, comment:null)], properties:null)
> INFO  : Completed compiling command(queryId=hive_20180627145237_39e53a7e-35cb-4e17-8ccb-884c6f6358cd); Time taken: 0.107 seconds
> INFO  : Executing command(queryId=hive_20180627145237_39e53a7e-35cb-4e17-8ccb-884c6f6358cd): EXPLAIN SELECT  MINUTE(`time1`) FROM calcs
> INFO  : Starting task [Stage-1:EXPLAIN] in serial mode
> INFO  : Completed executing command(queryId=hive_20180627145237_39e53a7e-35cb-4e17-8ccb-884c6f6358cd); Time taken: 0.003 seconds
> INFO  : OK
> +----------------------------------------------------+
> |                      Explain                       |
> +----------------------------------------------------+
> | Plan optimized by CBO.                             |
> |                                                    |
> | Stage-0                                            |
> |   Fetch Operator                                   |
> |     limit:-1                                       |
> |     Select Operator [SEL_1]                        |
> |       Output:["_col0"]                             |
> |       TableScan [TS_0]                             |
> |         Output:["vc"],properties:{"druid.fieldNames":"vc","druid.fieldTypes":"int","druid.query.json":"{\"queryType\":\"scan\",\"dataSource\":\"druid_tableau.calcs\",\"intervals\":[\"1900-01-01T00:00:00.000Z/3000-01-01T00:00:00.000Z\"],\"virtualColumns\":[{\"type\":\"expression\",\"name\":\"vc\",\"expression\":\"timestamp_extract(timestamp_parse(\\\"time1\\\",null,'UTC'),'MINUTE','UTC')\",\"outputType\":\"LONG\"}],\"columns\":[\"vc\"],\"resultFormat\":\"compactedList\"}","druid.query.type":"scan"} |
> |                                                    |
> +----------------------------------------------------+
> 10 rows selected (0.136 seconds)
> {code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)