You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Nishant Bangarwa (JIRA)" <ji...@apache.org> on 2018/06/01 14:20:00 UTC

[jira] [Commented] (HIVE-19762) Druid Queries containing Joins gives wrong results.

    [ https://issues.apache.org/jira/browse/HIVE-19762?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16498051#comment-16498051 ] 

Nishant Bangarwa commented on HIVE-19762:
-----------------------------------------

+cc [~ashutoshc]

> Druid Queries containing Joins gives wrong results. 
> ----------------------------------------------------
>
>                 Key: HIVE-19762
>                 URL: https://issues.apache.org/jira/browse/HIVE-19762
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Nishant Bangarwa
>            Assignee: Nishant Bangarwa
>            Priority: Major
>         Attachments: HIVE-19762.patch
>
>
> Druid queries that have joins against self table gives wrong results. 
> e.g. 
> {code} 
> SELECT
> username AS `username`,
> SUM(double1) AS `sum_double1`
> FROM
> druid_table_with_nulls `tbl1`
>   JOIN (
>     SELECT
>     username AS `username`,
>     SUM(double1) AS `sum_double2`
>     FROM druid_table_with_nulls
>     GROUP BY `username`
>     ORDER BY `sum_double2`
>     DESC  LIMIT 10
>   )
>   `tbl2`
>     ON (`tbl1`.`username` = `tbl2`.`username`)
> GROUP BY `tbl1`.`username`;
> {code} 
> In this case one of the queries is a druid scan query and other is groupBy query. 
> During planning, the properties of these queries are set to the tableDesc and serdeInfo, while setting the map work, we overwrite the properties from the properties present in serdeInfo, this causes the scan query results to be deserialized using wrong column names and results in Null values. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)