You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Navis (JIRA)" <ji...@apache.org> on 2014/03/05 08:11:42 UTC

[jira] [Updated] (HIVE-6551) group by after join with skew join optimization references invalid task sometimes

     [ https://issues.apache.org/jira/browse/HIVE-6551?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Navis updated HIVE-6551:
------------------------

    Status: Patch Available  (was: Open)

> group by after join with skew join optimization references invalid task sometimes
> ---------------------------------------------------------------------------------
>
>                 Key: HIVE-6551
>                 URL: https://issues.apache.org/jira/browse/HIVE-6551
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Navis
>            Assignee: Navis
>            Priority: Trivial
>         Attachments: HIVE-6551.1.patch.txt
>
>
> For example,
> {noformat}
> hive> set hive.auto.convert.join = true;
> hive> set hive.optimize.skewjoin = true;
> hive> set hive.skewjoin.key = 3;
> hive> 
>     > EXPLAIN FROM 
>     > (SELECT src.* FROM src) x
>     > JOIN 
>     > (SELECT src.* FROM src) Y
>     > ON (x.key = Y.key)
>     > SELECT sum(hash(Y.key)), sum(hash(Y.value));
> OK
> STAGE DEPENDENCIES:
>   Stage-8 is a root stage
>   Stage-6 depends on stages: Stage-8
>   Stage-5 depends on stages: Stage-6 , consists of Stage-4, Stage-2
>   Stage-4
>   Stage-2 depends on stages: Stage-4, Stage-1
>   Stage-0 is a root stage
> ...
> {noformat}
> Stage-2 references not-existing Stage-1



--
This message was sent by Atlassian JIRA
(v6.2#6252)