You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Navis (JIRA)" <ji...@apache.org> on 2014/03/05 06:09:42 UTC

[jira] [Created] (HIVE-6551) group by after join with skew join optimization references invalid task sometimes

Navis created HIVE-6551:
---------------------------

             Summary: group by after join with skew join optimization references invalid task sometimes
                 Key: HIVE-6551
                 URL: https://issues.apache.org/jira/browse/HIVE-6551
             Project: Hive
          Issue Type: Bug
            Reporter: Navis
            Assignee: Navis
            Priority: Trivial


For example,
{noformat}
hive> set hive.auto.convert.join = true;
hive> set hive.optimize.skewjoin = true;
hive> set hive.skewjoin.key = 3;
hive> 
    > EXPLAIN FROM 
    > (SELECT src.* FROM src) x
    > JOIN 
    > (SELECT src.* FROM src) Y
    > ON (x.key = Y.key)
    > SELECT sum(hash(Y.key)), sum(hash(Y.value));
OK
STAGE DEPENDENCIES:
  Stage-8 is a root stage
  Stage-6 depends on stages: Stage-8
  Stage-5 depends on stages: Stage-6 , consists of Stage-4, Stage-2
  Stage-4
  Stage-2 depends on stages: Stage-4, Stage-1
  Stage-0 is a root stage
...
{noformat}

Stage-2 references not-existing Stage-1



--
This message was sent by Atlassian JIRA
(v6.2#6252)