You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@tez.apache.org by "Rajesh Balamohan (JIRA)" <ji...@apache.org> on 2014/10/27 02:13:33 UTC

[jira] [Commented] (TEZ-1702) Hive : With Auto reduce parallelism enabled TPC-DS query 31 gets stuck in Reducer 12

    [ https://issues.apache.org/jira/browse/TEZ-1702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14184728#comment-14184728 ] 

Rajesh Balamohan commented on TEZ-1702:
---------------------------------------

Can you please upload the AM container log?  

> Hive : With Auto reduce parallelism enabled TPC-DS query 31 gets stuck in Reducer 12 
> -------------------------------------------------------------------------------------
>
>                 Key: TEZ-1702
>                 URL: https://issues.apache.org/jira/browse/TEZ-1702
>             Project: Apache Tez
>          Issue Type: Bug
>    Affects Versions: 0.5.2
>            Reporter: Mostafa Mokhtar
>            Priority: Critical
>             Fix For: 0.5.2
>
>         Attachments: Logs for container_1414029100044_0150_01_000001.zip
>
>
> Issue found in branch-0.5 , with latest commit as 
> {code}
> commit 2e65de88af709d30207403fea881b697a4853dd6
> Author: Bikas Saha <bi...@apache.org>
> Date:   Tue Oct 21 14:59:56 2014 -0700
> {code}
> Running TPC-DS Query 31 with Auto reduce parallelism enabled the query gets stuck in Reducer 12 
> Call Stack for stuck thread
> {code}
> Thread 14575: (state = BLOCKED)
>  - sun.misc.Unsafe.park(boolean, long) @bci=0 (Interpreted frame)
>  - java.util.concurrent.locks.LockSupport.park(java.lang.Object) @bci=14, line=186 (Interpreted frame)
>  - java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.await() @bci=42, line=2043 (Interpreted frame)
>  - java.util.concurrent.LinkedBlockingQueue.take() @bci=29, line=442 (Interpreted frame)
>  - org.apache.tez.runtime.library.shuffle.common.impl.ShuffleManager.getNextInput() @bci=67, line=663 (Interpreted frame)
>  - org.apache.tez.runtime.library.common.readers.UnorderedKVReader.moveToNextInput() @bci=26, line=176 (Interpreted frame)
>  - org.apache.tez.runtime.library.common.readers.UnorderedKVReader.next() @bci=30, line=117 (Interpreted frame)
>  - org.apache.hadoop.hive.ql.exec.tez.HashTableLoader.load(org.apache.hadoop.hive.ql.exec.persistence.MapJoinTableContainer[], org.apache.hadoop.hive.ql.exec.persistence.MapJoinTableContainerSerDe[]) @bci=259, line=112 (Compiled frame)
>  - org.apache.hadoop.hive.ql.exec.MapJoinOperator.loadHashTable() @bci=86, line=190 (Interpreted frame)
>  - org.apache.hadoop.hive.ql.exec.MapJoinOperator.processOp(java.lang.Object, int) @bci=12, line=244 (Interpreted frame)
>  - org.apache.hadoop.hive.ql.exec.Operator.forward(java.lang.Object, org.apache.hadoop.hive.serde2.objectinspector.ObjectInspector) @bci=63, line=815 (Interpreted frame)
>  - org.apache.hadoop.hive.ql.exec.SelectOperator.processOp(java.lang.Object, int) @bci=121, line=84 (Interpreted frame)
>  - org.apache.hadoop.hive.ql.exec.Operator.forward(java.lang.Object, org.apache.hadoop.hive.serde2.objectinspector.ObjectInspector) @bci=63, line=815 (Interpreted frame)
>  - org.apache.hadoop.hive.ql.exec.GroupByOperator.forward(java.lang.Object[], org.apache.hadoop.hive.ql.udf.generic.GenericUDAFEvaluator$AggregationBuffer[]) @bci=97, line=1072 (Interpreted frame)
>  - org.apache.hadoop.hive.ql.exec.GroupByOperator.processAggr(java.lang.Object, org.apache.hadoop.hive.serde2.objectinspector.ObjectInspector, org.apache.hadoop.hive.ql.exec.KeyWrapper) @bci=71, line=881 (Interpreted frame)
>  - org.apache.hadoop.hive.ql.exec.GroupByOperator.processKey(java.lang.Object, org.apache.hadoop.hive.serde2.objectinspector.ObjectInspector) @bci=34, line=741 (Interpreted frame)
>                                                                                                                                                                                          222,0-1       79%
>  - org.apache.hadoop.hive.ql.exec.GroupByOperator.processAggr(java.lang.Object, org.apache.hadoop.hive.serde2.objectinspector.ObjectInspector, org.apache.hadoop.hive.ql.exec.KeyWrapper) @bci=71, line=881 (Interpreted frame)
>  - org.apache.hadoop.hive.ql.exec.GroupByOperator.processKey(java.lang.Object, org.apache.hadoop.hive.serde2.objectinspector.ObjectInspector) @bci=34, line=741 (Interpreted frame)
>  - org.apache.hadoop.hive.ql.exec.GroupByOperator.processOp(java.lang.Object, int) @bci=457, line=809 (Interpreted frame)
>  - org.apache.hadoop.hive.ql.exec.tez.ReduceRecordSource.processKeyValues(java.lang.Iterable, byte) @bci=174, line=308 (Interpreted frame)
>  - org.apache.hadoop.hive.ql.exec.tez.ReduceRecordSource.pushRecord() @bci=218, line=252 (Interpreted frame)
>  - org.apache.hadoop.hive.ql.exec.tez.ReduceRecordProcessor.run() @bci=155, line=168 (Interpreted frame)
>  - org.apache.hadoop.hive.ql.exec.tez.TezProcessor.initializeAndRunProcessor(java.util.Map, java.util.Map) @bci=224, line=163 (Interpreted frame)
>  - org.apache.hadoop.hive.ql.exec.tez.TezProcessor.run(java.util.Map, java.util.Map) @bci=86, line=138 (Interpreted frame)
>  - org.apache.tez.runtime.LogicalIOProcessorRuntimeTask.run() @bci=76, line=324 (Interpreted frame)
>  - org.apache.tez.runtime.task.TezTaskRunner$TaskRunnerCallable$1.run() @bci=119, line=180 (Interpreted frame)
>  - org.apache.tez.runtime.task.TezTaskRunner$TaskRunnerCallable$1.run() @bci=1, line=172 (Interpreted frame)
>  - java.security.AccessController.doPrivileged(java.security.PrivilegedExceptionAction, java.security.AccessControlContext) @bci=0 (Interpreted frame)
>  - javax.security.auth.Subject.doAs(javax.security.auth.Subject, java.security.PrivilegedExceptionAction) @bci=42, line=415 (Interpreted frame)
>  - org.apache.hadoop.security.UserGroupInformation.doAs(java.security.PrivilegedExceptionAction) @bci=14, line=1548 (Interpreted frame)
>  - org.apache.tez.runtime.task.TezTaskRunner$TaskRunnerCallable.call() @bci=15, line=172 (Interpreted frame)
>  - org.apache.tez.runtime.task.TezTaskRunner$TaskRunnerCallable.call() @bci=1, line=167 (Interpreted frame)
>  - java.util.concurrent.FutureTask.run() @bci=42, line=262 (Interpreted frame)
>  - java.util.concurrent.ThreadPoolExecutor.runWorker(java.util.concurrent.ThreadPoolExecutor$Worker) @bci=95, line=1145 (Interpreted frame)
>  - java.util.concurrent.ThreadPoolExecutor$Worker.run() @bci=5, line=615 (Interpreted frame)
>  - java.lang.Thread.run() @bci=11, line=744 (Interpreted frame)
> {code}
> Plan 
> {code}
> STAGE DEPENDENCIES:
>   Stage-1 is a root stage
>   Stage-0 depends on stages: Stage-1
> STAGE PLANS:
>   Stage: Stage-1
>     Tez
>       Edges:
>         Map 1 <- Map 10 (BROADCAST_EDGE), Map 9 (BROADCAST_EDGE)
>         Map 11 <- Map 4 (BROADCAST_EDGE), Map 7 (BROADCAST_EDGE)
>         Map 14 <- Map 22 (BROADCAST_EDGE), Map 8 (BROADCAST_EDGE)
>         Map 16 <- Map 25 (BROADCAST_EDGE), Map 6 (BROADCAST_EDGE)
>         Map 18 <- Map 24 (BROADCAST_EDGE), Map 5 (BROADCAST_EDGE)
>         Map 20 <- Map 23 (BROADCAST_EDGE), Map 3 (BROADCAST_EDGE)
>         Reducer 12 <- Map 11 (SIMPLE_EDGE), Reducer 15 (BROADCAST_EDGE), Reducer 17 (BROADCAST_EDGE), Reducer 19 (BROADCAST_EDGE), Reducer 2 (BROADCAST_EDGE), Reducer 21 (BROADCAST_EDGE)
>         Reducer 13 <- Reducer 12 (SIMPLE_EDGE)
>         Reducer 15 <- Map 14 (SIMPLE_EDGE)
>         Reducer 17 <- Map 16 (SIMPLE_EDGE)
>         Reducer 19 <- Map 18 (SIMPLE_EDGE)
>         Reducer 2 <- Map 1 (SIMPLE_EDGE)
>         Reducer 21 <- Map 20 (SIMPLE_EDGE)
>       DagName: mmokhtar_20141026170505_6a5353ab-9f03-4d19-8ed3-ef15fbb35294:1
>       Vertices:
>         Map 1 
>             Map Operator Tree:
>                 TableScan
>                   alias: store_sales
>                   filterExpr: ss_addr_sk is not null (type: boolean)
>                   Statistics: Num rows: 550076554 Data size: 47370018896 Basic stats: COMPLETE Column stats: COMPLETE
>                   Filter Operator
>                     predicate: ss_addr_sk is not null (type: boolean)
>                     Statistics: Num rows: 537118738 Data size: 6344204488 Basic stats: COMPLETE Column stats: COMPLETE
>                     Select Operator
>                       expressions: ss_addr_sk (type: int), ss_ext_sales_price (type: float), ss_sold_date_sk (type: int)
>                       outputColumnNames: _col0, _col1, _col2
>                       Statistics: Num rows: 537118738 Data size: 6344204488 Basic stats: COMPLETE Column stats: COMPLETE
>                       Map Join Operator
>                         condition map:
>                              Inner Join 0 to 1
>                         condition expressions:
>                           0 {_col0} {_col1}
>                           1 
>                         keys:
>                           0 _col2 (type: int)
>                           1 _col0 (type: int)
>                         outputColumnNames: _col0, _col1
>                         input vertices:
>                           1 Map 9
>                         Statistics: Num rows: 600476053 Data size: 4803808424 Basic stats: COMPLETE Column stats: COMPLETE
>                         Map Join Operator
>                           condition map:
>                                Inner Join 0 to 1
>                           condition expressions:
>                             0 {_col1}
>                             1 {_col1}
>                           keys:
>                             0 _col0 (type: int)
>                             1 _col0 (type: int)
>                           outputColumnNames: _col1, _col7
>                           input vertices:
>                             1 Map 10
>                           Statistics: Num rows: 600476032 Data size: 58846651136 Basic stats: COMPLETE Column stats: COMPLETE
>                           Select Operator
>                             expressions: _col7 (type: string), 1 (type: int), 1998 (type: int), _col1 (type: float)
>                             outputColumnNames: _col0, _col1, _col2, _col3
>                             Statistics: Num rows: 600476032 Data size: 58846651136 Basic stats: COMPLETE Column stats: COMPLETE
>                             Group By Operator
>                               aggregations: sum(_col3)
>                               keys: _col0 (type: string), _col1 (type: int), _col2 (type: int)
>                               mode: hash
>                               outputColumnNames: _col0, _col1, _col2, _col3
>                               Statistics: Num rows: 589 Data size: 4712 Basic stats: COMPLETE Column stats: COMPLETE
>                               Reduce Output Operator
>                                 key expressions: _col0 (type: string), _col1 (type: int), _col2 (type: int)
>                                 sort order: +++
>                                 Map-reduce partition columns: _col0 (type: string), _col1 (type: int), _col2 (type: int)
>                                 Statistics: Num rows: 589 Data size: 4712 Basic stats: COMPLETE Column stats: COMPLETE
>                                 value expressions: _col3 (type: double)
>             Execution mode: vectorized
>         Map 10 
>             Map Operator Tree:
>                 TableScan
>                   alias: customer_address
>                   filterExpr: (ca_address_sk is not null and ca_county is not null) (type: boolean)
>                   Statistics: Num rows: 800000 Data size: 811903688 Basic stats: COMPLETE Column stats: COMPLETE
>                   Filter Operator
>                     predicate: (ca_address_sk is not null and ca_county is not null) (type: boolean)
>                     Statistics: Num rows: 800000 Data size: 81600000 Basic stats: COMPLETE Column stats: COMPLETE
>                     Select Operator
>                       expressions: ca_address_sk (type: int), ca_county (type: string)
>                       outputColumnNames: _col0, _col1
>                       Statistics: Num rows: 800000 Data size: 81600000 Basic stats: COMPLETE Column stats: COMPLETE
>                       Reduce Output Operator
>                         key expressions: _col0 (type: int)
>                         sort order: +
>                         Map-reduce partition columns: _col0 (type: int)
>                         Statistics: Num rows: 800000 Data size: 81600000 Basic stats: COMPLETE Column stats: COMPLETE
>                         value expressions: _col1 (type: string)
>             Execution mode: vectorized
>         Map 11 
>             Map Operator Tree:
>                 TableScan
>                   alias: store_sales
>                   filterExpr: ss_addr_sk is not null (type: boolean)
>                   Statistics: Num rows: 550076554 Data size: 47370018896 Basic stats: COMPLETE Column stats: COMPLETE
>                   Filter Operator
>                     predicate: ss_addr_sk is not null (type: boolean)
>                     Statistics: Num rows: 537118738 Data size: 6344204488 Basic stats: COMPLETE Column stats: COMPLETE
>                     Select Operator
>                       expressions: ss_addr_sk (type: int), ss_ext_sales_price (type: float), ss_sold_date_sk (type: int)
>                       outputColumnNames: _col0, _col1, _col2
>                       Statistics: Num rows: 537118738 Data size: 6344204488 Basic stats: COMPLETE Column stats: COMPLETE
>                       Map Join Operator
>                         condition map:
>                              Inner Join 0 to 1
>                         condition expressions:
>                           0 {_col0} {_col1}
>                           1 
>                         keys:
>                           0 _col2 (type: int)
>                           1 _col0 (type: int)
>                         outputColumnNames: _col0, _col1
>                         input vertices:
>                           1 Map 4
>                         Statistics: Num rows: 600476032 Data size: 4803808256 Basic stats: COMPLETE Column stats: COMPLETE
>                         Map Join Operator
>                           condition map:
>                                Inner Join 0 to 1
>                           condition expressions:
>                             0 {_col1}
>                             1 {_col1}
>                           keys:
>                             0 _col0 (type: int)
>                             1 _col0 (type: int)
>                           outputColumnNames: _col1, _col7
>                           input vertices:
>                             1 Map 7
>                           Statistics: Num rows: 600476032 Data size: 58846651136 Basic stats: COMPLETE Column stats: COMPLETE
>                           Select Operator
>                             expressions: _col7 (type: string), 2 (type: int), 1998 (type: int), _col1 (type: float)
>                             outputColumnNames: _col0, _col1, _col2, _col3
>                             Statistics: Num rows: 600476032 Data size: 58846651136 Basic stats: COMPLETE Column stats: COMPLETE
>                             Group By Operator
>                               aggregations: sum(_col3)
>                               keys: _col0 (type: string), _col1 (type: int), _col2 (type: int)
>                               mode: hash
>                               outputColumnNames: _col0, _col1, _col2, _col3
>                               Statistics: Num rows: 589 Data size: 4712 Basic stats: COMPLETE Column stats: COMPLETE
>                               Reduce Output Operator
>                                 key expressions: _col0 (type: string), _col1 (type: int), _col2 (type: int)
>                                 sort order: +++
>                                 Map-reduce partition columns: _col0 (type: string), _col1 (type: int), _col2 (type: int)
>                                 Statistics: Num rows: 589 Data size: 4712 Basic stats: COMPLETE Column stats: COMPLETE
>                                 value expressions: _col3 (type: double)
>             Execution mode: vectorized
>         Map 14 
>             Map Operator Tree:
>                 TableScan
>                   alias: web_sales
>                   filterExpr: ws_bill_addr_sk is not null (type: boolean)
>                   Statistics: Num rows: 143966864 Data size: 19001610332 Basic stats: COMPLETE Column stats: COMPLETE
>                   Filter Operator
>                     predicate: ws_bill_addr_sk is not null (type: boolean)
>                     Statistics: Num rows: 143949044 Data size: 1727244892 Basic stats: COMPLETE Column stats: COMPLETE
>                     Select Operator
>                       expressions: ws_bill_addr_sk (type: int), ws_ext_sales_price (type: float), ws_sold_date_sk (type: int)
>                       outputColumnNames: _col0, _col1, _col2
>                       Statistics: Num rows: 143949044 Data size: 1727244892 Basic stats: COMPLETE Column stats: COMPLETE
>                       Map Join Operator
>                         condition map:
>                              Inner Join 0 to 1
>                         condition expressions:
>                           0 {_col0} {_col1}
>                           1 
>                         keys:
>                           0 _col2 (type: int)
>                           1 _col0 (type: int)
>                         outputColumnNames: _col0, _col1
>                         input vertices:
>                           1 Map 8
>                         Statistics: Num rows: 600476032 Data size: 4803808256 Basic stats: COMPLETE Column stats: COMPLETE
>                         Select Operator
>                           expressions: _col0 (type: int), _col1 (type: float), 1998 (type: int), 1 (type: int)
>                           outputColumnNames: _col0, _col1, _col4, _col5
>                           Statistics: Num rows: 600476032 Data size: 9607616512 Basic stats: COMPLETE Column stats: COMPLETE
>                           Map Join Operator
>                             condition map:
>                                  Inner Join 0 to 1
>                             condition expressions:
>                               0 {_col1}
>                               1 {_col1} {_col4} {_col5}
>                             keys:
>                               0 _col0 (type: int)
>                               1 _col0 (type: int)
>                             outputColumnNames: _col1, _col3, _col6, _col7
>                             input vertices:
>                               0 Map 22
>                             Statistics: Num rows: 600476032 Data size: 7205712384 Basic stats: COMPLETE Column stats: COMPLETE
>                             Select Operator
>                               expressions: _col1 (type: string), _col7 (type: int), _col6 (type: int), _col3 (type: float)
>                               outputColumnNames: _col0, _col1, _col2, _col3
>                               Statistics: Num rows: 600476032 Data size: 7205712384 Basic stats: COMPLETE Column stats: COMPLETE
>                               Group By Operator
>                                 aggregations: sum(_col3)
>                                 keys: _col0 (type: string), _col1 (type: int), _col2 (type: int)
>                                 mode: hash
>                                 outputColumnNames: _col0, _col1, _col2, _col3
>                                 Statistics: Num rows: 73 Data size: 584 Basic stats: COMPLETE Column stats: COMPLETE
>                                 Reduce Output Operator
>                                   key expressions: _col0 (type: string), _col1 (type: int), _col2 (type: int)
>                                   sort order: +++
>                                   Map-reduce partition columns: _col0 (type: string), _col1 (type: int), _col2 (type: int)
>                                   Statistics: Num rows: 73 Data size: 584 Basic stats: COMPLETE Column stats: COMPLETE
>                                   value expressions: _col3 (type: double)
>             Execution mode: vectorized
>         Map 16 
>             Map Operator Tree:
>                 TableScan
>                   alias: web_sales
>                   filterExpr: ws_bill_addr_sk is not null (type: boolean)
>                   Statistics: Num rows: 143966864 Data size: 19001610332 Basic stats: COMPLETE Column stats: COMPLETE
>                   Filter Operator
>                     predicate: ws_bill_addr_sk is not null (type: boolean)
>                     Statistics: Num rows: 143949044 Data size: 1727244892 Basic stats: COMPLETE Column stats: COMPLETE
>                     Select Operator
>                       expressions: ws_bill_addr_sk (type: int), ws_ext_sales_price (type: float), ws_sold_date_sk (type: int)
>                       outputColumnNames: _col0, _col1, _col2
>                       Statistics: Num rows: 143949044 Data size: 1727244892 Basic stats: COMPLETE Column stats: COMPLETE
>                       Map Join Operator
>                         condition map:
>                              Inner Join 0 to 1
>                         condition expressions:
>                           0 {_col0} {_col1}
>                           1 
>                         keys:
>                           0 _col2 (type: int)
>                           1 _col0 (type: int)
>                         outputColumnNames: _col0, _col1
>                         input vertices:
>                           1 Map 25
>                         Statistics: Num rows: 600476032 Data size: 4803808256 Basic stats: COMPLETE Column stats: COMPLETE
>                         Select Operator
>                           expressions: _col0 (type: int), _col1 (type: float), 1998 (type: int), 3 (type: int)
>                           outputColumnNames: _col0, _col1, _col4, _col5
>                           Statistics: Num rows: 600476032 Data size: 9607616512 Basic stats: COMPLETE Column stats: COMPLETE
>                           Map Join Operator
>                             condition map:
>                                  Inner Join 0 to 1
>                             condition expressions:
>                               0 {_col1}
>                               1 {_col1} {_col4} {_col5}
>                             keys:
>                               0 _col0 (type: int)
>                               1 _col0 (type: int)
>                             outputColumnNames: _col1, _col3, _col6, _col7
>                             input vertices:
>                               0 Map 6
>                             Statistics: Num rows: 600476032 Data size: 7205712384 Basic stats: COMPLETE Column stats: COMPLETE
>                             Select Operator
>                               expressions: _col1 (type: string), _col7 (type: int), _col6 (type: int), _col3 (type: float)
>                               outputColumnNames: _col0, _col1, _col2, _col3
>                               Statistics: Num rows: 600476032 Data size: 7205712384 Basic stats: COMPLETE Column stats: COMPLETE
>                               Group By Operator
>                                 aggregations: sum(_col3)
>                                 keys: _col0 (type: string), _col1 (type: int), _col2 (type: int)
>                                 mode: hash
>                                 outputColumnNames: _col0, _col1, _col2, _col3
>                                 Statistics: Num rows: 73 Data size: 584 Basic stats: COMPLETE Column stats: COMPLETE
>                                 Reduce Output Operator
>                                   key expressions: _col0 (type: string), _col1 (type: int), _col2 (type: int)
>                                   sort order: +++
>                                   Map-reduce partition columns: _col0 (type: string), _col1 (type: int), _col2 (type: int)
>                                   Statistics: Num rows: 73 Data size: 584 Basic stats: COMPLETE Column stats: COMPLETE
>                                   value expressions: _col3 (type: double)
>             Execution mode: vectorized
>         Map 18 
>             Map Operator Tree:
>                 TableScan
>                   alias: store_sales
>                   filterExpr: ss_addr_sk is not null (type: boolean)
>                   Statistics: Num rows: 550076554 Data size: 47370018896 Basic stats: COMPLETE Column stats: COMPLETE
>                   Filter Operator
>                     predicate: ss_addr_sk is not null (type: boolean)
>                     Statistics: Num rows: 537118738 Data size: 6344204488 Basic stats: COMPLETE Column stats: COMPLETE
>                     Select Operator
>                       expressions: ss_addr_sk (type: int), ss_ext_sales_price (type: float), ss_sold_date_sk (type: int)
>                       outputColumnNames: _col0, _col1, _col2
>                       Statistics: Num rows: 537118738 Data size: 6344204488 Basic stats: COMPLETE Column stats: COMPLETE
>                       Map Join Operator
>                         condition map:
>                              Inner Join 0 to 1
>                         condition expressions:
>                           0 {_col0} {_col1}
>                           1 
>                         keys:
>                           0 _col2 (type: int)
>                           1 _col0 (type: int)
>                         outputColumnNames: _col0, _col1
>                         input vertices:
>                           1 Map 5
>                         Statistics: Num rows: 600476032 Data size: 4803808256 Basic stats: COMPLETE Column stats: COMPLETE
>                         Map Join Operator
>                           condition map:
>                                Inner Join 0 to 1
>                           condition expressions:
>                             0 {_col1}
>                             1 {_col1}
>                           keys:
>                             0 _col0 (type: int)
>                             1 _col0 (type: int)
>                           outputColumnNames: _col1, _col7
>                           input vertices:
>                             1 Map 24
>                           Statistics: Num rows: 600476032 Data size: 58846651136 Basic stats: COMPLETE Column stats: COMPLETE
>                           Select Operator
>                             expressions: _col7 (type: string), 3 (type: int), 1998 (type: int), _col1 (type: float)
>                             outputColumnNames: _col0, _col1, _col2, _col3
>                             Statistics: Num rows: 600476032 Data size: 58846651136 Basic stats: COMPLETE Column stats: COMPLETE
>                             Group By Operator
>                               aggregations: sum(_col3)
>                               keys: _col0 (type: string), _col1 (type: int), _col2 (type: int)
>                               mode: hash
>                               outputColumnNames: _col0, _col1, _col2, _col3
>                               Statistics: Num rows: 589 Data size: 4712 Basic stats: COMPLETE Column stats: COMPLETE
>                               Reduce Output Operator
>                                 key expressions: _col0 (type: string), _col1 (type: int), _col2 (type: int)
>                                 sort order: +++
>                                 Map-reduce partition columns: _col0 (type: string), _col1 (type: int), _col2 (type: int)
>                                 Statistics: Num rows: 589 Data size: 4712 Basic stats: COMPLETE Column stats: COMPLETE
>                                 value expressions: _col3 (type: double)
>             Execution mode: vectorized
>         Map 20 
>             Map Operator Tree:
>                 TableScan
>                   alias: web_sales
>                   filterExpr: ws_bill_addr_sk is not null (type: boolean)
>                   Statistics: Num rows: 143966864 Data size: 19001610332 Basic stats: COMPLETE Column stats: COMPLETE
>                   Filter Operator
>                     predicate: ws_bill_addr_sk is not null (type: boolean)
>                     Statistics: Num rows: 143949044 Data size: 1727244892 Basic stats: COMPLETE Column stats: COMPLETE
>                     Select Operator
>                       expressions: ws_bill_addr_sk (type: int), ws_ext_sales_price (type: float), ws_sold_date_sk (type: int)
>                       outputColumnNames: _col0, _col1, _col2
>                       Statistics: Num rows: 143949044 Data size: 1727244892 Basic stats: COMPLETE Column stats: COMPLETE
>                       Map Join Operator
>                         condition map:
>                              Inner Join 0 to 1
>                         condition expressions:
>                           0 {_col0} {_col1}
>                           1 
>                         keys:
>                           0 _col2 (type: int)
>                           1 _col0 (type: int)
>                         outputColumnNames: _col0, _col1
>                         input vertices:
>                           1 Map 3
>                         Statistics: Num rows: 600476032 Data size: 4803808256 Basic stats: COMPLETE Column stats: COMPLETE
>                         Select Operator
>                           expressions: _col0 (type: int), _col1 (type: float), 1998 (type: int), 2 (type: int)
>                           outputColumnNames: _col0, _col1, _col4, _col5
>                           Statistics: Num rows: 600476032 Data size: 9607616512 Basic stats: COMPLETE Column stats: COMPLETE
>                           Map Join Operator
>                             condition map:
>                                  Inner Join 0 to 1
>                             condition expressions:
>                               0 {_col1}
>                               1 {_col1} {_col4} {_col5}
>                             keys:
>                               0 _col0 (type: int)
>                               1 _col0 (type: int)
>                             outputColumnNames: _col1, _col3, _col6, _col7
>                             input vertices:
>                               0 Map 23
>                             Statistics: Num rows: 600476032 Data size: 7205712384 Basic stats: COMPLETE Column stats: COMPLETE
>                             Select Operator
>                               expressions: _col1 (type: string), _col7 (type: int), _col6 (type: int), _col3 (type: float)
>                               outputColumnNames: _col0, _col1, _col2, _col3
>                               Statistics: Num rows: 600476032 Data size: 7205712384 Basic stats: COMPLETE Column stats: COMPLETE
>                               Group By Operator
>                                 aggregations: sum(_col3)
>                                 keys: _col0 (type: string), _col1 (type: int), _col2 (type: int)
>                                 mode: hash
>                                 outputColumnNames: _col0, _col1, _col2, _col3
>                                 Statistics: Num rows: 73 Data size: 584 Basic stats: COMPLETE Column stats: COMPLETE
>                                 Reduce Output Operator
>                                   key expressions: _col0 (type: string), _col1 (type: int), _col2 (type: int)
>                                   sort order: +++
>                                   Map-reduce partition columns: _col0 (type: string), _col1 (type: int), _col2 (type: int)
>                                   Statistics: Num rows: 73 Data size: 584 Basic stats: COMPLETE Column stats: COMPLETE
>                                   value expressions: _col3 (type: double)
>             Execution mode: vectorized
>         Map 22 
>             Map Operator Tree:
>                 TableScan
>                   alias: customer_address
>                   filterExpr: (ca_address_sk is not null and ca_county is not null) (type: boolean)
>                   Statistics: Num rows: 800000 Data size: 811903688 Basic stats: COMPLETE Column stats: COMPLETE
>                   Filter Operator
>                     predicate: (ca_address_sk is not null and ca_county is not null) (type: boolean)
>                     Statistics: Num rows: 800000 Data size: 81600000 Basic stats: COMPLETE Column stats: COMPLETE
>                     Select Operator
>                       expressions: ca_address_sk (type: int), ca_county (type: string)
>                       outputColumnNames: _col0, _col1
>                       Statistics: Num rows: 800000 Data size: 81600000 Basic stats: COMPLETE Column stats: COMPLETE
>                       Reduce Output Operator
>                         key expressions: _col0 (type: int)
>                         sort order: +
>                         Map-reduce partition columns: _col0 (type: int)
>                         Statistics: Num rows: 800000 Data size: 81600000 Basic stats: COMPLETE Column stats: COMPLETE
>                         value expressions: _col1 (type: string)
>             Execution mode: vectorized
>         Map 23 
>             Map Operator Tree:
>                 TableScan
>                   alias: customer_address
>                   filterExpr: (ca_address_sk is not null and ca_county is not null) (type: boolean)
>                   Statistics: Num rows: 800000 Data size: 811903688 Basic stats: COMPLETE Column stats: COMPLETE
>                   Filter Operator
>                     predicate: (ca_address_sk is not null and ca_county is not null) (type: boolean)
>                     Statistics: Num rows: 800000 Data size: 81600000 Basic stats: COMPLETE Column stats: COMPLETE
>                     Select Operator
>                       expressions: ca_address_sk (type: int), ca_county (type: string)
>                       outputColumnNames: _col0, _col1
>                       Statistics: Num rows: 800000 Data size: 81600000 Basic stats: COMPLETE Column stats: COMPLETE
>                       Reduce Output Operator
>                         key expressions: _col0 (type: int)
>                         sort order: +
>                         Map-reduce partition columns: _col0 (type: int)
>                         Statistics: Num rows: 800000 Data size: 81600000 Basic stats: COMPLETE Column stats: COMPLETE
>                         value expressions: _col1 (type: string)
>             Execution mode: vectorized
>         Map 24 
>             Map Operator Tree:
>                 TableScan
>                   alias: customer_address
>                   filterExpr: (ca_address_sk is not null and ca_county is not null) (type: boolean)
>                   Statistics: Num rows: 800000 Data size: 811903688 Basic stats: COMPLETE Column stats: COMPLETE
>                   Filter Operator
>                     predicate: (ca_address_sk is not null and ca_county is not null) (type: boolean)
>                     Statistics: Num rows: 800000 Data size: 81600000 Basic stats: COMPLETE Column stats: COMPLETE
>                     Select Operator
>                       expressions: ca_address_sk (type: int), ca_county (type: string)
>                       outputColumnNames: _col0, _col1
>                       Statistics: Num rows: 800000 Data size: 81600000 Basic stats: COMPLETE Column stats: COMPLETE
>                       Reduce Output Operator
>                         key expressions: _col0 (type: int)
>                         sort order: +
>                         Map-reduce partition columns: _col0 (type: int)
>                         Statistics: Num rows: 800000 Data size: 81600000 Basic stats: COMPLETE Column stats: COMPLETE
>                         value expressions: _col1 (type: string)
>             Execution mode: vectorized
>         Map 25 
>             Map Operator Tree:
>                 TableScan
>                   alias: date_dim
>                   filterExpr: (((d_qoy = 3) and (d_year = 1998)) and d_date_sk is not null) (type: boolean)
>                   Statistics: Num rows: 73049 Data size: 81741831 Basic stats: COMPLETE Column stats: COMPLETE
>                   Filter Operator
>                     predicate: (((d_qoy = 3) and (d_year = 1998)) and d_date_sk is not null) (type: boolean)
>                     Statistics: Num rows: 635 Data size: 7620 Basic stats: COMPLETE Column stats: COMPLETE
>                     Select Operator
>                       expressions: d_date_sk (type: int)
>                       outputColumnNames: _col0
>                       Statistics: Num rows: 635 Data size: 2540 Basic stats: COMPLETE Column stats: COMPLETE
>                       Reduce Output Operator
>                         key expressions: _col0 (type: int)
>                         sort order: +
>                         Map-reduce partition columns: _col0 (type: int)
>                         Statistics: Num rows: 635 Data size: 2540 Basic stats: COMPLETE Column stats: COMPLETE
>                       Select Operator
>                         expressions: _col0 (type: int)
>                         outputColumnNames: _col0
>                         Statistics: Num rows: 635 Data size: 0 Basic stats: PARTIAL Column stats: COMPLETE
>                         Group By Operator
>                           keys: _col0 (type: int)
>                           mode: hash
>                           outputColumnNames: _col0
>                           Statistics: Num rows: 635 Data size: 0 Basic stats: PARTIAL Column stats: COMPLETE
>                           Dynamic Partitioning Event Operator
>                             Target Input: web_sales
>                             Partition key expr: ws_sold_date_sk
>                             Statistics: Num rows: 635 Data size: 0 Basic stats: PARTIAL Column stats: COMPLETE
>                             Target column: ws_sold_date_sk
>                             Target Vertex: Map 16
>             Execution mode: vectorized
>         Map 3 
>             Map Operator Tree:
>                 TableScan
>                   alias: date_dim
>                   filterExpr: (((d_qoy = 2) and (d_year = 1998)) and d_date_sk is not null) (type: boolean)
>                   Statistics: Num rows: 73049 Data size: 81741831 Basic stats: COMPLETE Column stats: COMPLETE
>                   Filter Operator
>                     predicate: (((d_qoy = 2) and (d_year = 1998)) and d_date_sk is not null) (type: boolean)
>                     Statistics: Num rows: 635 Data size: 7620 Basic stats: COMPLETE Column stats: COMPLETE
>                     Select Operator
>                       expressions: d_date_sk (type: int)
>                       outputColumnNames: _col0
>                       Statistics: Num rows: 635 Data size: 2540 Basic stats: COMPLETE Column stats: COMPLETE
>                       Reduce Output Operator
>                         key expressions: _col0 (type: int)
>                         sort order: +
>                         Map-reduce partition columns: _col0 (type: int)
>                         Statistics: Num rows: 635 Data size: 2540 Basic stats: COMPLETE Column stats: COMPLETE
>                       Select Operator
>                         expressions: _col0 (type: int)
>                         outputColumnNames: _col0
>                         Statistics: Num rows: 635 Data size: 0 Basic stats: PARTIAL Column stats: COMPLETE
>                         Group By Operator
>                           keys: _col0 (type: int)
>                           mode: hash
>                           outputColumnNames: _col0
>                           Statistics: Num rows: 635 Data size: 0 Basic stats: PARTIAL Column stats: COMPLETE
>                           Dynamic Partitioning Event Operator
>                             Target Input: web_sales
>                             Partition key expr: ws_sold_date_sk
>                             Statistics: Num rows: 635 Data size: 0 Basic stats: PARTIAL Column stats: COMPLETE
>                             Target column: ws_sold_date_sk
>                             Target Vertex: Map 20
>             Execution mode: vectorized
>         Map 4 
>             Map Operator Tree:
>                 TableScan
>                   alias: date_dim
>                   filterExpr: (((d_qoy = 2) and (d_year = 1998)) and d_date_sk is not null) (type: boolean)
>                   Statistics: Num rows: 73049 Data size: 81741831 Basic stats: COMPLETE Column stats: COMPLETE
>                   Filter Operator
>                     predicate: (((d_qoy = 2) and (d_year = 1998)) and d_date_sk is not null) (type: boolean)
>                     Statistics: Num rows: 635 Data size: 7620 Basic stats: COMPLETE Column stats: COMPLETE
>                     Select Operator
>                       expressions: d_date_sk (type: int)
>                       outputColumnNames: _col0
>                       Statistics: Num rows: 635 Data size: 2540 Basic stats: COMPLETE Column stats: COMPLETE
>                       Reduce Output Operator
>                         key expressions: _col0 (type: int)
>                         sort order: +
>                         Map-reduce partition columns: _col0 (type: int)
>                         Statistics: Num rows: 635 Data size: 2540 Basic stats: COMPLETE Column stats: COMPLETE
>                       Select Operator
>                         expressions: _col0 (type: int)
>                         outputColumnNames: _col0
>                         Statistics: Num rows: 635 Data size: 0 Basic stats: PARTIAL Column stats: COMPLETE
>                         Group By Operator
>                           keys: _col0 (type: int)
>                           mode: hash
>                           outputColumnNames: _col0
>                           Statistics: Num rows: 635 Data size: 0 Basic stats: PARTIAL Column stats: COMPLETE
>                           Dynamic Partitioning Event Operator
>                             Target Input: store_sales
>                             Partition key expr: ss_sold_date_sk
>                             Statistics: Num rows: 635 Data size: 0 Basic stats: PARTIAL Column stats: COMPLETE
>                             Target column: ss_sold_date_sk
>                             Target Vertex: Map 11
>             Execution mode: vectorized
>         Map 5 
>             Map Operator Tree:
>                 TableScan
>                   alias: date_dim
>                   filterExpr: (((d_qoy = 3) and (d_year = 1998)) and d_date_sk is not null) (type: boolean)
>                   Statistics: Num rows: 73049 Data size: 81741831 Basic stats: COMPLETE Column stats: COMPLETE
>                   Filter Operator
>                     predicate: (((d_qoy = 3) and (d_year = 1998)) and d_date_sk is not null) (type: boolean)
>                     Statistics: Num rows: 635 Data size: 7620 Basic stats: COMPLETE Column stats: COMPLETE
>                     Select Operator
>                       expressions: d_date_sk (type: int)
>                       outputColumnNames: _col0
>                       Statistics: Num rows: 635 Data size: 2540 Basic stats: COMPLETE Column stats: COMPLETE
>                       Reduce Output Operator
>                         key expressions: _col0 (type: int)
>                         sort order: +
>                         Map-reduce partition columns: _col0 (type: int)
>                         Statistics: Num rows: 635 Data size: 2540 Basic stats: COMPLETE Column stats: COMPLETE
>                       Select Operator
>                         expressions: _col0 (type: int)
>                         outputColumnNames: _col0
>                         Statistics: Num rows: 635 Data size: 0 Basic stats: PARTIAL Column stats: COMPLETE
>                         Group By Operator
>                           keys: _col0 (type: int)
>                           mode: hash
>                           outputColumnNames: _col0
>                           Statistics: Num rows: 635 Data size: 0 Basic stats: PARTIAL Column stats: COMPLETE
>                           Dynamic Partitioning Event Operator
>                             Target Input: store_sales
>                             Partition key expr: ss_sold_date_sk
>                             Statistics: Num rows: 635 Data size: 0 Basic stats: PARTIAL Column stats: COMPLETE
>                             Target column: ss_sold_date_sk
>                             Target Vertex: Map 18
>             Execution mode: vectorized
>         Map 6 
>             Map Operator Tree:
>                 TableScan
>                   alias: customer_address
>                   filterExpr: (ca_address_sk is not null and ca_county is not null) (type: boolean)
>                   Statistics: Num rows: 800000 Data size: 811903688 Basic stats: COMPLETE Column stats: COMPLETE
>                   Filter Operator
>                     predicate: (ca_address_sk is not null and ca_county is not null) (type: boolean)
>                     Statistics: Num rows: 800000 Data size: 81600000 Basic stats: COMPLETE Column stats: COMPLETE
>                     Select Operator
>                       expressions: ca_address_sk (type: int), ca_county (type: string)
>                       outputColumnNames: _col0, _col1
>                       Statistics: Num rows: 800000 Data size: 81600000 Basic stats: COMPLETE Column stats: COMPLETE
>                       Reduce Output Operator
>                         key expressions: _col0 (type: int)
>                         sort order: +
>                         Map-reduce partition columns: _col0 (type: int)
>                         Statistics: Num rows: 800000 Data size: 81600000 Basic stats: COMPLETE Column stats: COMPLETE
>                         value expressions: _col1 (type: string)
>             Execution mode: vectorized
>         Map 7 
>             Map Operator Tree:
>                 TableScan
>                   alias: customer_address
>                   filterExpr: (ca_address_sk is not null and ca_county is not null) (type: boolean)
>                   Statistics: Num rows: 800000 Data size: 811903688 Basic stats: COMPLETE Column stats: COMPLETE
>                   Filter Operator
>                     predicate: (ca_address_sk is not null and ca_county is not null) (type: boolean)
>                     Statistics: Num rows: 800000 Data size: 81600000 Basic stats: COMPLETE Column stats: COMPLETE
>                     Select Operator
>                       expressions: ca_address_sk (type: int), ca_county (type: string)
>                       outputColumnNames: _col0, _col1
>                       Statistics: Num rows: 800000 Data size: 81600000 Basic stats: COMPLETE Column stats: COMPLETE
>                       Reduce Output Operator
>                         key expressions: _col0 (type: int)
>                         sort order: +
>                         Map-reduce partition columns: _col0 (type: int)
>                         Statistics: Num rows: 800000 Data size: 81600000 Basic stats: COMPLETE Column stats: COMPLETE
>                         value expressions: _col1 (type: string)
>             Execution mode: vectorized
>         Map 8 
>             Map Operator Tree:
>                 TableScan
>                   alias: date_dim
>                   filterExpr: (((d_qoy = 1) and (d_year = 1998)) and d_date_sk is not null) (type: boolean)
>                   Statistics: Num rows: 73049 Data size: 81741831 Basic stats: COMPLETE Column stats: COMPLETE
>                   Filter Operator
>                     predicate: (((d_qoy = 1) and (d_year = 1998)) and d_date_sk is not null) (type: boolean)
>                     Statistics: Num rows: 635 Data size: 7620 Basic stats: COMPLETE Column stats: COMPLETE
>                     Select Operator
>                       expressions: d_date_sk (type: int)
>                       outputColumnNames: _col0
>                       Statistics: Num rows: 635 Data size: 2540 Basic stats: COMPLETE Column stats: COMPLETE
>                       Reduce Output Operator
>                         key expressions: _col0 (type: int)
>                         sort order: +
>                         Map-reduce partition columns: _col0 (type: int)
>                         Statistics: Num rows: 635 Data size: 2540 Basic stats: COMPLETE Column stats: COMPLETE
>                       Select Operator
>                         expressions: _col0 (type: int)
>                         outputColumnNames: _col0
>                         Statistics: Num rows: 635 Data size: 0 Basic stats: PARTIAL Column stats: COMPLETE
>                         Group By Operator
>                           keys: _col0 (type: int)
>                           mode: hash
>                           outputColumnNames: _col0
>                           Statistics: Num rows: 635 Data size: 0 Basic stats: PARTIAL Column stats: COMPLETE
>                           Dynamic Partitioning Event Operator
>                             Target Input: web_sales
>                             Partition key expr: ws_sold_date_sk
>                             Statistics: Num rows: 635 Data size: 0 Basic stats: PARTIAL Column stats: COMPLETE
>                             Target column: ws_sold_date_sk
>                             Target Vertex: Map 14
>             Execution mode: vectorized
>         Map 9 
>             Map Operator Tree:
>                 TableScan
>                   alias: date_dim
>                   filterExpr: (((d_qoy = 1) and (d_year = 1998)) and d_date_sk is not null) (type: boolean)
>                   Statistics: Num rows: 73049 Data size: 81741831 Basic stats: COMPLETE Column stats: COMPLETE
>                   Filter Operator
>                     predicate: (((d_qoy = 1) and (d_year = 1998)) and d_date_sk is not null) (type: boolean)
>                     Statistics: Num rows: 635 Data size: 7620 Basic stats: COMPLETE Column stats: COMPLETE
>                     Select Operator
>                       expressions: d_date_sk (type: int)
>                       outputColumnNames: _col0
>                       Statistics: Num rows: 635 Data size: 2540 Basic stats: COMPLETE Column stats: COMPLETE
>                       Reduce Output Operator
>                         key expressions: _col0 (type: int)
>                         sort order: +
>                         Map-reduce partition columns: _col0 (type: int)
>                         Statistics: Num rows: 635 Data size: 2540 Basic stats: COMPLETE Column stats: COMPLETE
>                       Select Operator
>                         expressions: _col0 (type: int)
>                         outputColumnNames: _col0
>                         Statistics: Num rows: 635 Data size: 0 Basic stats: PARTIAL Column stats: COMPLETE
>                         Group By Operator
>                           keys: _col0 (type: int)
>                           mode: hash
>                           outputColumnNames: _col0
>                           Statistics: Num rows: 635 Data size: 0 Basic stats: PARTIAL Column stats: COMPLETE
>                           Dynamic Partitioning Event Operator
>                             Target Input: store_sales
>                             Partition key expr: ss_sold_date_sk
>                             Statistics: Num rows: 635 Data size: 0 Basic stats: PARTIAL Column stats: COMPLETE
>                             Target column: ss_sold_date_sk
>                             Target Vertex: Map 1
>             Execution mode: vectorized
>         Reducer 12 
>             Reduce Operator Tree:
>               Group By Operator
>                 aggregations: sum(VALUE._col0)
>                 keys: KEY._col0 (type: string), KEY._col1 (type: int), KEY._col2 (type: int)
>                 mode: mergepartial
>                 outputColumnNames: _col0, _col1, _col2, _col3
>                 Statistics: Num rows: 589 Data size: 9424 Basic stats: COMPLETE Column stats: COMPLETE
>                 Select Operator
>                   expressions: _col0 (type: string), _col3 (type: double)
>                   outputColumnNames: _col0, _col1
>                   Statistics: Num rows: 589 Data size: 9424 Basic stats: COMPLETE Column stats: COMPLETE
>                   Map Join Operator
>                     condition map:
>                          Inner Join 0 to 1
>                     condition expressions:
>                       0 {_col0} {_col1}
>                       1 {_col1}
>                     keys:
>                       0 _col0 (type: string)
>                       1 _col0 (type: string)
>                     outputColumnNames: _col0, _col1, _col3
>                     input vertices:
>                       1 Reducer 19
>                     Statistics: Num rows: 600476032 Data size: 9607616512 Basic stats: COMPLETE Column stats: COMPLETE
>                     Select Operator
>                       expressions: _col0 (type: string), _col1 (type: double), _col3 (type: double)
>                       outputColumnNames: _col0, _col1, _col3
>                       Statistics: Num rows: 600476032 Data size: 9607616512 Basic stats: COMPLETE Column stats: COMPLETE
>                       Map Join Operator
>                         condition map:
>                              Inner Join 0 to 1
>                              Inner Join 0 to 2
>                              Inner Join 2 to 3
>                         condition expressions:
>                           0 {_col0} {_col1}
>                           1 {_col1}
>                           2 {_col0} {_col1} {_col2}
>                           3 {_col1} {_col3}
>                         keys:
>                           0 _col0 (type: string)
>                           1 _col0 (type: string)
>                           2 _col0 (type: string)
>                           3 _col0 (type: string)
>                         outputColumnNames: _col0, _col1, _col3, _col4, _col5, _col6, _col8, _col10
>                         input vertices:
>                           0 Reducer 15
>                           1 Reducer 21
>                           2 Reducer 2
>                         Statistics: Num rows: 600476032 Data size: 19215233024 Basic stats: COMPLETE Column stats: COMPLETE
>                         Filter Operator
>                           predicate: (CASE WHEN ((_col1 > 0.0)) THEN ((_col3 / _col1)) ELSE (null) END > CASE WHEN ((_col6 > 0.0)) THEN ((_col8 / _col6)) ELSE (null) END) (type: boolean)
>                           Statistics: Num rows: 200158677 Data size: 6405077664 Basic stats: COMPLETE Column stats: COMPLETE
>                           Select Operator
>                             expressions: _col4 (type: string), _col5 (type: int), _col3 (type: double), _col6 (type: double), _col8 (type: double), _col10 (type: double), _col0 (type: string), _col1 (type: double)
>                             outputColumnNames: _col0, _col1, _col10, _col2, _col4, _col6, _col7, _col8
>                             Statistics: Num rows: 200158677 Data size: 6405077664 Basic stats: COMPLETE Column stats: COMPLETE
>                             Map Join Operator
>                               condition map:
>                                    Inner Join 0 to 1
>                               condition expressions:
>                                 0 {_col1}
>                                 1 {_col0} {_col1} {_col2} {_col4} {_col6} {_col8} {_col10}
>                               keys:
>                                 0 _col0 (type: string)
>                                 1 _col7 (type: string)
>                               outputColumnNames: _col1, _col2, _col3, _col4, _col6, _col8, _col10, _col12
>                               input vertices:
>                                 0 Reducer 17
>                               Statistics: Num rows: 600476032 Data size: 19215233024 Basic stats: COMPLETE Column stats: COMPLETE
>                               Filter Operator
>                                 predicate: (CASE WHEN ((_col12 > 0.0)) THEN ((_col1 / _col12)) ELSE (null) END > CASE WHEN ((_col6 > 0.0)) THEN ((_col8 / _col6)) ELSE (null) END) (type: boolean)
>                                 Statistics: Num rows: 200158677 Data size: 6405077664 Basic stats: COMPLETE Column stats: COMPLETE
>                                 Select Operator
>                                   expressions: _col2 (type: string), _col3 (type: int), (_col12 / _col10) (type: double), (_col6 / _col4) (type: double), (_col1 / _col12) (type: double), (_col8 / _col6) (type: double)
>                                   outputColumnNames: _col0, _col1, _col2, _col3, _col4, _col5
>                                   Statistics: Num rows: 200158677 Data size: 8006347080 Basic stats: COMPLETE Column stats: COMPLETE
>                                   Reduce Output Operator
>                                     key expressions: _col2 (type: double)
>                                     sort order: +
>                                     Statistics: Num rows: 200158677 Data size: 8006347080 Basic stats: COMPLETE Column stats: COMPLETE
>                                     value expressions: _col0 (type: string), _col1 (type: int), _col3 (type: double), _col4 (type: double), _col5 (type: double)
>         Reducer 13 
>             Reduce Operator Tree:
>               Select Operator
>                 expressions: VALUE._col0 (type: string), VALUE._col1 (type: int), KEY.reducesinkkey0 (type: double), VALUE._col2 (type: double), VALUE._col3 (type: double), VALUE._col4 (type: double)
>                 outputColumnNames: _col0, _col1, _col2, _col3, _col4, _col5
>                 Statistics: Num rows: 200158677 Data size: 8006347080 Basic stats: COMPLETE Column stats: COMPLETE
>                 File Output Operator
>                   compressed: false
>                   Statistics: Num rows: 200158677 Data size: 8006347080 Basic stats: COMPLETE Column stats: COMPLETE
>                   table:
>                       input format: org.apache.hadoop.mapred.TextInputFormat
>                       output format: org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat
>                       serde: org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe
>             Execution mode: vectorized
>         Reducer 15 
>             Reduce Operator Tree:
>               Group By Operator
>                 aggregations: sum(VALUE._col0)
>                 keys: KEY._col0 (type: string), KEY._col1 (type: int), KEY._col2 (type: int)
>                 mode: mergepartial
>                 outputColumnNames: _col0, _col1, _col2, _col3
>                 Statistics: Num rows: 73 Data size: 1168 Basic stats: COMPLETE Column stats: COMPLETE
>                 Select Operator
>                   expressions: _col0 (type: string), _col3 (type: double)
>                   outputColumnNames: _col0, _col1
>                   Statistics: Num rows: 73 Data size: 1168 Basic stats: COMPLETE Column stats: COMPLETE
>                   Reduce Output Operator
>                     key expressions: _col0 (type: string)
>                     sort order: +
>                     Map-reduce partition columns: _col0 (type: string)
>                     Statistics: Num rows: 73 Data size: 1168 Basic stats: COMPLETE Column stats: COMPLETE
>                     value expressions: _col1 (type: double)
>             Execution mode: vectorized
>         Reducer 17 
>             Reduce Operator Tree:
>               Group By Operator
>                 aggregations: sum(VALUE._col0)
>                 keys: KEY._col0 (type: string), KEY._col1 (type: int), KEY._col2 (type: int)
>                 mode: mergepartial
>                 outputColumnNames: _col0, _col1, _col2, _col3
>                 Statistics: Num rows: 73 Data size: 1168 Basic stats: COMPLETE Column stats: COMPLETE
>                 Select Operator
>                   expressions: _col0 (type: string), _col3 (type: double)
>                   outputColumnNames: _col0, _col1
>                   Statistics: Num rows: 73 Data size: 1168 Basic stats: COMPLETE Column stats: COMPLETE
>                   Reduce Output Operator
>                     key expressions: _col0 (type: string)
>                     sort order: +
>                     Map-reduce partition columns: _col0 (type: string)
>                     Statistics: Num rows: 73 Data size: 1168 Basic stats: COMPLETE Column stats: COMPLETE
>                     value expressions: _col1 (type: double)
>             Execution mode: vectorized
>         Reducer 19 
>             Reduce Operator Tree:
>               Group By Operator
>                 aggregations: sum(VALUE._col0)
>                 keys: KEY._col0 (type: string), KEY._col1 (type: int), KEY._col2 (type: int)
>                 mode: mergepartial
>                 outputColumnNames: _col0, _col1, _col2, _col3
>                 Statistics: Num rows: 589 Data size: 9424 Basic stats: COMPLETE Column stats: COMPLETE
>                 Select Operator
>                   expressions: _col0 (type: string), _col3 (type: double)
>                   outputColumnNames: _col0, _col1
>                   Statistics: Num rows: 589 Data size: 9424 Basic stats: COMPLETE Column stats: COMPLETE
>                   Reduce Output Operator
>                     key expressions: _col0 (type: string)
>                     sort order: +
>                     Map-reduce partition columns: _col0 (type: string)
>                     Statistics: Num rows: 589 Data size: 9424 Basic stats: COMPLETE Column stats: COMPLETE
>                     value expressions: _col1 (type: double)
>             Execution mode: vectorized
>         Reducer 2 
>             Reduce Operator Tree:
>               Group By Operator
>                 aggregations: sum(VALUE._col0)
>                 keys: KEY._col0 (type: string), KEY._col1 (type: int), KEY._col2 (type: int)
>                 mode: mergepartial
>                 outputColumnNames: _col0, _col1, _col2, _col3
>                 Statistics: Num rows: 589 Data size: 9424 Basic stats: COMPLETE Column stats: COMPLETE
>                 Select Operator
>                   expressions: _col0 (type: string), _col2 (type: int), _col3 (type: double)
>                   outputColumnNames: _col0, _col1, _col2
>                   Statistics: Num rows: 589 Data size: 9424 Basic stats: COMPLETE Column stats: COMPLETE
>                   Reduce Output Operator
>                     key expressions: _col0 (type: string)
>                     sort order: +
>                     Map-reduce partition columns: _col0 (type: string)
>                     Statistics: Num rows: 589 Data size: 9424 Basic stats: COMPLETE Column stats: COMPLETE
>                     value expressions: _col1 (type: int), _col2 (type: double)
>             Execution mode: vectorized
>         Reducer 21 
>             Reduce Operator Tree:
>               Group By Operator
>                 aggregations: sum(VALUE._col0)
>                 keys: KEY._col0 (type: string), KEY._col1 (type: int), KEY._col2 (type: int)
>                 mode: mergepartial
>                 outputColumnNames: _col0, _col1, _col2, _col3
>                 Statistics: Num rows: 73 Data size: 1168 Basic stats: COMPLETE Column stats: COMPLETE
>                 Select Operator
>                   expressions: _col0 (type: string), _col3 (type: double)
>                   outputColumnNames: _col0, _col1
>                   Statistics: Num rows: 73 Data size: 1168 Basic stats: COMPLETE Column stats: COMPLETE
>                   Reduce Output Operator
>                     key expressions: _col0 (type: string)
>                     sort order: +
>                     Map-reduce partition columns: _col0 (type: string)
>                     Statistics: Num rows: 73 Data size: 1168 Basic stats: COMPLETE Column stats: COMPLETE
>                     value expressions: _col1 (type: double)
>             Execution mode: vectorized
>   Stage: Stage-0
>     Fetch Operator
>       limit: -1
>       Processor Tree:
>         ListSink
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)