You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Gopal V (JIRA)" <ji...@apache.org> on 2015/11/27 10:03:11 UTC
[jira] [Updated] (HIVE-12535) Dynamic Hash Join: Key references are
cyclic
[ https://issues.apache.org/jira/browse/HIVE-12535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Gopal V updated HIVE-12535:
---------------------------
Attachment: philz_26.txt
> Dynamic Hash Join: Key references are cyclic
> --------------------------------------------
>
> Key: HIVE-12535
> URL: https://issues.apache.org/jira/browse/HIVE-12535
> Project: Hive
> Issue Type: Bug
> Components: Query Planning
> Affects Versions: 2.0.0
> Reporter: Gopal V
> Attachments: philz_26.txt
>
>
> MAPJOIN_4227 is inside "Reducer 2", but refers back to "Reducer 2" in its keys.
> {code}
> | |<-Reducer 2 [SIMPLE_EDGE] vectorized, llap |
> | Reduce Output Operator [RS_4189] |
> | key expressions:_col0 (type: string), _col1 (type: int) |
> | Map-reduce partition columns:_col0 (type: string), _col1 (type: int) |
> | sort order:++ |
> | Statistics:Num rows: 83 Data size: 9213 Basic stats: COMPLETE Column stats: COMPLETE |
> | value expressions:_col2 (type: double) |
> | Group By Operator [OP_4229] |
> | aggregations:["sum(_col2)"] |
> | keys:_col0 (type: string), _col1 (type: int) |
> | outputColumnNames:["_col0","_col1","_col2"] |
> | Statistics:Num rows: 83 Data size: 9213 Basic stats: COMPLETE Column stats: COMPLETE |
> | Select Operator [OP_4228] |
> | outputColumnNames:["_col0","_col1","_col2"] |
> | Statistics:Num rows: 166 Data size: 26394 Basic stats: COMPLETE Column stats: COMPLETE |
> | Map Join Operator [MAPJOIN_4227] |
> | | condition map:[{"":"Inner Join 0 to 1"}] |
> | | keys:{"Reducer 2":"KEY.reducesinkkey0 (type: bigint), KEY.reducesinkkey1 (type: int), KEY.reducesinkkey2 (type: int)","Map 5":"KEY.reducesinkkey0 (type: bigint), KEY.reducesinkkey1 (type: int), KEY.reducesinkkey2 (type: int)"} |
> | | outputColumnNames:["_col1","_col3","_col5"] |
> | | Statistics:Num rows: 166 Data size: 26394 Basic stats: COMPLETE Column stats: COMPLETE |
> | |<-Map 5 [CUSTOM_SIMPLE_EDGE] vectorized, llap |
> | | Reduce Output Operator [RS_4226] |
> | | key expressions:_col1 (type: bigint), year(_col2) (type: int), month(_col2) (type: int) |
> | | Map-reduce partition columns:_col1 (type: bigint), year(_col2) (type: int), month(_col2) (type: int) |
> | | sort order:+++ |
> | | Statistics:Num rows: 74973886 Data size: 5098224248 Basic stats: COMPLETE Column stats: COMPLETE |
> | | value expressions:_col0 (type: float), _col2 (type: date) |
> | | Select Operator [OP_4225] |
> | | outputColumnNames:["_col0","_col1","_col2"] |
> | | Statistics:Num rows: 74973886 Data size: 5098224248 Basic stats: COMPLETE Column stats: COMPLETE |
> | | Filter Operator [FIL_4224] |
> | | predicate:((account_id is not null and month(effective_date) BETWEEN 4 AND 7) and month(effective_date) is not null) (type: boolean) |
> | | Statistics:Num rows: 74973886 Data size: 5098224248 Basic stats: COMPLETE Column stats: COMPLETE |
> | | TableScan [TS_4171] |
> | | alias:t |
> | | Statistics:Num rows: 149947772 Data size: 10196448496 Basic stats: COMPLETE Column stats: COMPLETE |
> | |<-Map 1 [CUSTOM_SIMPLE_EDGE] vectorized, llap |
> | Reduce Output Operator [RS_4223] |
> | key expressions:_col0 (type: bigint), year(_col2) (type: int), month(_col2) (type: int) |
> | Map-reduce partition columns:_col0 (type: bigint), year(_col2) (type: int), month(_col2) (type: int) |
> | sort order:+++ |
> | Statistics:Num rows: 50289673 Data size: 8197216699 Basic stats: COMPLETE Column stats: COMPLETE |
> | value expressions:_col1 (type: string) |
> | Map Join Operator [MAPJOIN_4222] |
> | | condition map:[{"":"Left Semi Join 0 to 1"}] |
> | | keys:{"Map 1":"_col1 (type: string)","Map 4":"_col0 (type: string)"} |
> | | outputColumnNames:["_col0","_col1","_col2"] |
> | | Statistics:Num rows: 50289673 Data size: 8197216699 Basic stats: COMPLETE Column stats: COMPLETE |
> | |<-Map 4 [BROADCAST_EDGE] vectorized, llap |
> | | Reduce Output Operator [RS_4179] |
> | | key expressions:_col0 (type: string) |
> | | Map-reduce partition columns:_col0 (type: string) |
> | | sort order:+ |
> | | Statistics:Num rows: 1 Data size: 99 Basic stats: COMPLETE Column stats: COMPLETE |
> | | Group By Operator [OP_4219] |
> | | keys:_col0 (type: string) |
> | | outputColumnNames:["_col0"] |
> | | Statistics:Num rows: 1 Data size: 99 Basic stats: COMPLETE Column stats: COMPLETE |
> | | Select Operator [OP_4218] |
> | | outputColumnNames:["_col0"] |
> | | Statistics:Num rows: 3 Data size: 297 Basic stats: COMPLETE Column stats: COMPLETE |
> | | Filter Operator [FIL_4217] |
> | | predicate:(account_type = 'order ahead') (type: boolean) |
> | | Statistics:Num rows: 3 Data size: 294 Basic stats: COMPLETE Column stats: COMPLETE |
> | | TableScan [TS_4168] |
> | | alias:at |
> | | Statistics:Num rows: 13 Data size: 1274 Basic stats: COMPLETE Column stats: COMPLETE |
> | |<-Select Operator [OP_4221] |
> | outputColumnNames:["_col0","_col1","_col2"] |
> | Statistics:Num rows: 50289673 Data size: 8197216699 Basic stats: COMPLETE Column stats: COMPLETE |
> | Filter Operator [FIL_4220] |
> | predicate:(((account_id is not null and (account_type = 'order ahead')) and year(effective_date) is not null) and month(effective_date) is not null) (type: boolean) |
> | Statistics:Num rows: 50289673 Data size: 8197216699 Basic stats: COMPLETE Column stats: COMPLETE |
> | TableScan [TS_4165] |
> | alias:a |
> | Statistics:Num rows: 201158695 Data size: 32788867285 Basic stats: COMPLETE Column stats: COMPLETE
> {code}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)