You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Nicholas Brenwald (JIRA)" <ji...@apache.org> on 2015/07/30 13:27:04 UTC

[jira] [Created] (HIVE-11410) Join with subquery containing a group by incorrectly returns no results

Nicholas Brenwald created HIVE-11410:
----------------------------------------

             Summary: Join with subquery containing a group by incorrectly returns no results
                 Key: HIVE-11410
                 URL: https://issues.apache.org/jira/browse/HIVE-11410
             Project: Hive
          Issue Type: Bug
          Components: Hive
    Affects Versions: 1.1.0
            Reporter: Nicholas Brenwald
            Priority: Minor


Start by creating a table *t* with columns *c1* and *c2* and populate with 1 row of data. For example create table *t* from an existing table which contains at least 1 row of data by running:
{code}
create table t as select 'abc' as c1, 0 as c2 from Y limit 1; 
{code}

Table *t* looks like the following:
||c1||c2||
|abc|0|

Running the following query then returns zero results.
{code}
SELECT 
  t1.c1
FROM 
  t t1
JOIN
(SELECT 
   t2.c1,
   MAX(t2.c2) AS c2
 FROM 
   t t2 
 GROUP BY 
   t2.c1
) t3
ON t1.c2=t3.c2
{code}

However, we expected to see the following:
||c1||
|abc|

The problem seems to relate to the fact that in the subquery, we group by column *c1*, but this is not subsequently used in the join condition.




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)