You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@hive.apache.org by "Szehon Ho (JIRA)" <ji...@apache.org> on 2014/11/26 04:34:12 UTC

[jira] [Updated] (HIVE-8924) Investigate test failure for join_empty.q [Spark Branch]

     [ https://issues.apache.org/jira/browse/HIVE-8924?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Szehon Ho updated HIVE-8924:
----------------------------
    Status: Patch Available  (was: Open)

> Investigate test failure for join_empty.q [Spark Branch]
> --------------------------------------------------------
>
>                 Key: HIVE-8924
>                 URL: https://issues.apache.org/jira/browse/HIVE-8924
>             Project: Hive
>          Issue Type: Sub-task
>          Components: Spark
>    Affects Versions: spark-branch
>            Reporter: Chao
>            Assignee: Szehon Ho
>         Attachments: HIVE-8924-spark.patch
>
>
> This query has an interesting case where the big table work is empty. Here's the MR plan:
> {noformat}
> STAGE DEPENDENCIES:
>   Stage-4 is a root stage
>   Stage-3 depends on stages: Stage-4
>   Stage-0 depends on stages: Stage-3
> STAGE PLANS:
>   Stage: Stage-4
>     Map Reduce Local Work
>       Alias -> Map Local Tables:
>         b 
>           Fetch Operator
>             limit: -1
>       Alias -> Map Local Operator Tree:
>         b 
>           TableScan
>             alias: b
>             Statistics: Num rows: 29 Data size: 5812 Basic stats: COMPLETE Column stats: NONE
>             Filter Operator
>               predicate: UDFToDouble(key) is not null (type: boolean)
>               Statistics: Num rows: 15 Data size: 3006 Basic stats: COMPLETE Column stats: NONE
>               HashTable Sink Operator
>                 condition expressions:
>                   0 {key}
>                   1 {value}
>                 keys:
>                   0 UDFToDouble(key) (type: double)
>                   1 UDFToDouble(key) (type: double)
>   Stage: Stage-3
>     Map Reduce
>       Local Work:
>         Map Reduce Local Work
>   Stage: Stage-0
>     Fetch Operator
>       limit: -1
>       Processor Tree:
>         ListSink
> {noformat}
> The plan for Spark is not correct. We need to investigate the issue.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)