You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Szehon Ho (JIRA)" <ji...@apache.org> on 2014/11/26 04:34:12 UTC
[jira] [Updated] (HIVE-8924) Investigate test failure for
join_empty.q [Spark Branch]
[ https://issues.apache.org/jira/browse/HIVE-8924?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Szehon Ho updated HIVE-8924:
----------------------------
Status: Patch Available (was: Open)
> Investigate test failure for join_empty.q [Spark Branch]
> --------------------------------------------------------
>
> Key: HIVE-8924
> URL: https://issues.apache.org/jira/browse/HIVE-8924
> Project: Hive
> Issue Type: Sub-task
> Components: Spark
> Affects Versions: spark-branch
> Reporter: Chao
> Assignee: Szehon Ho
> Attachments: HIVE-8924-spark.patch
>
>
> This query has an interesting case where the big table work is empty. Here's the MR plan:
> {noformat}
> STAGE DEPENDENCIES:
> Stage-4 is a root stage
> Stage-3 depends on stages: Stage-4
> Stage-0 depends on stages: Stage-3
> STAGE PLANS:
> Stage: Stage-4
> Map Reduce Local Work
> Alias -> Map Local Tables:
> b
> Fetch Operator
> limit: -1
> Alias -> Map Local Operator Tree:
> b
> TableScan
> alias: b
> Statistics: Num rows: 29 Data size: 5812 Basic stats: COMPLETE Column stats: NONE
> Filter Operator
> predicate: UDFToDouble(key) is not null (type: boolean)
> Statistics: Num rows: 15 Data size: 3006 Basic stats: COMPLETE Column stats: NONE
> HashTable Sink Operator
> condition expressions:
> 0 {key}
> 1 {value}
> keys:
> 0 UDFToDouble(key) (type: double)
> 1 UDFToDouble(key) (type: double)
> Stage: Stage-3
> Map Reduce
> Local Work:
> Map Reduce Local Work
> Stage: Stage-0
> Fetch Operator
> limit: -1
> Processor Tree:
> ListSink
> {noformat}
> The plan for Spark is not correct. We need to investigate the issue.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)