You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hive.apache.org by kg...@apache.org on 2017/12/20 10:40:23 UTC
[22/37] hive git commit: HIVE-18149: Stats: rownum estimation from
datasize underestimates in most cases (Zoltan Haindrich,
reviewed by Ashutosh Chauhan)
http://git-wip-us.apache.org/repos/asf/hive/blob/e26b9325/ql/src/test/results/clientpositive/spark/auto_sortmerge_join_5.q.out
----------------------------------------------------------------------
diff --git a/ql/src/test/results/clientpositive/spark/auto_sortmerge_join_5.q.out b/ql/src/test/results/clientpositive/spark/auto_sortmerge_join_5.q.out
index 54f11c4..8e28cd1 100644
--- a/ql/src/test/results/clientpositive/spark/auto_sortmerge_join_5.q.out
+++ b/ql/src/test/results/clientpositive/spark/auto_sortmerge_join_5.q.out
@@ -81,16 +81,16 @@ STAGE PLANS:
Map Operator Tree:
TableScan
alias: b
- Statistics: Num rows: 1 Data size: 2750 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 27500 Basic stats: COMPLETE Column stats: NONE
GatherStats: false
Filter Operator
isSamplingPred: false
predicate: key is not null (type: boolean)
- Statistics: Num rows: 1 Data size: 2750 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 27500 Basic stats: COMPLETE Column stats: NONE
Select Operator
expressions: key (type: string)
outputColumnNames: _col0
- Statistics: Num rows: 1 Data size: 2750 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 27500 Basic stats: COMPLETE Column stats: NONE
Sorted Merge Bucket Map Join Operator
condition map:
Inner Join 0 to 1
@@ -98,7 +98,7 @@ STAGE PLANS:
0 _col0 (type: string)
1 _col0 (type: string)
Position of Big Table: 1
- Statistics: Num rows: 1 Data size: 248 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 2486 Basic stats: COMPLETE Column stats: NONE
BucketMapJoin: true
Group By Operator
aggregations: count()
@@ -232,16 +232,16 @@ STAGE PLANS:
Map Operator Tree:
TableScan
alias: a
- Statistics: Num rows: 1 Data size: 2750 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 27500 Basic stats: COMPLETE Column stats: NONE
GatherStats: false
Filter Operator
isSamplingPred: false
predicate: key is not null (type: boolean)
- Statistics: Num rows: 1 Data size: 2750 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 27500 Basic stats: COMPLETE Column stats: NONE
Select Operator
expressions: key (type: string)
outputColumnNames: _col0
- Statistics: Num rows: 1 Data size: 2750 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 27500 Basic stats: COMPLETE Column stats: NONE
Sorted Merge Bucket Map Join Operator
condition map:
Inner Join 0 to 1
@@ -249,7 +249,7 @@ STAGE PLANS:
0 _col0 (type: string)
1 _col0 (type: string)
Position of Big Table: 0
- Statistics: Num rows: 1 Data size: 3025 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 30250 Basic stats: COMPLETE Column stats: NONE
BucketMapJoin: true
Group By Operator
aggregations: count()
@@ -382,16 +382,16 @@ STAGE PLANS:
Map Operator Tree:
TableScan
alias: b
- Statistics: Num rows: 1 Data size: 226 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 2260 Basic stats: COMPLETE Column stats: NONE
GatherStats: false
Filter Operator
isSamplingPred: false
predicate: key is not null (type: boolean)
- Statistics: Num rows: 1 Data size: 226 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 2260 Basic stats: COMPLETE Column stats: NONE
Select Operator
expressions: key (type: string)
outputColumnNames: _col0
- Statistics: Num rows: 1 Data size: 226 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 2260 Basic stats: COMPLETE Column stats: NONE
Spark HashTable Sink Operator
keys:
0 _col0 (type: string)
@@ -468,16 +468,16 @@ STAGE PLANS:
Map Operator Tree:
TableScan
alias: a
- Statistics: Num rows: 1 Data size: 2750 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 27500 Basic stats: COMPLETE Column stats: NONE
GatherStats: false
Filter Operator
isSamplingPred: false
predicate: key is not null (type: boolean)
- Statistics: Num rows: 1 Data size: 2750 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 27500 Basic stats: COMPLETE Column stats: NONE
Select Operator
expressions: key (type: string)
outputColumnNames: _col0
- Statistics: Num rows: 1 Data size: 2750 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 27500 Basic stats: COMPLETE Column stats: NONE
Map Join Operator
condition map:
Inner Join 0 to 1
@@ -487,7 +487,7 @@ STAGE PLANS:
input vertices:
1 Map 3
Position of Big Table: 0
- Statistics: Num rows: 1 Data size: 3025 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 30250 Basic stats: COMPLETE Column stats: NONE
BucketMapJoin: true
Group By Operator
aggregations: count()
http://git-wip-us.apache.org/repos/asf/hive/blob/e26b9325/ql/src/test/results/clientpositive/spark/bucket_map_join_1.q.out
----------------------------------------------------------------------
diff --git a/ql/src/test/results/clientpositive/spark/bucket_map_join_1.q.out b/ql/src/test/results/clientpositive/spark/bucket_map_join_1.q.out
index b57ba19..02dae94 100644
--- a/ql/src/test/results/clientpositive/spark/bucket_map_join_1.q.out
+++ b/ql/src/test/results/clientpositive/spark/bucket_map_join_1.q.out
@@ -62,12 +62,12 @@ STAGE PLANS:
Map Operator Tree:
TableScan
alias: b
- Statistics: Num rows: 1 Data size: 21 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 210 Basic stats: COMPLETE Column stats: NONE
GatherStats: false
Filter Operator
isSamplingPred: false
predicate: (key is not null and value is not null) (type: boolean)
- Statistics: Num rows: 1 Data size: 21 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 210 Basic stats: COMPLETE Column stats: NONE
Spark HashTable Sink Operator
keys:
0 key (type: string), value (type: string)
@@ -139,12 +139,12 @@ STAGE PLANS:
Map Operator Tree:
TableScan
alias: a
- Statistics: Num rows: 1 Data size: 20 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 200 Basic stats: COMPLETE Column stats: NONE
GatherStats: false
Filter Operator
isSamplingPred: false
predicate: (key is not null and value is not null) (type: boolean)
- Statistics: Num rows: 1 Data size: 20 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 200 Basic stats: COMPLETE Column stats: NONE
Map Join Operator
condition map:
Inner Join 0 to 1
@@ -154,7 +154,7 @@ STAGE PLANS:
input vertices:
1 Map 3
Position of Big Table: 0
- Statistics: Num rows: 1 Data size: 22 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 220 Basic stats: COMPLETE Column stats: NONE
Group By Operator
aggregations: count()
mode: hash
http://git-wip-us.apache.org/repos/asf/hive/blob/e26b9325/ql/src/test/results/clientpositive/spark/bucket_map_join_2.q.out
----------------------------------------------------------------------
diff --git a/ql/src/test/results/clientpositive/spark/bucket_map_join_2.q.out b/ql/src/test/results/clientpositive/spark/bucket_map_join_2.q.out
index 4b8f985..4380869 100644
--- a/ql/src/test/results/clientpositive/spark/bucket_map_join_2.q.out
+++ b/ql/src/test/results/clientpositive/spark/bucket_map_join_2.q.out
@@ -62,12 +62,12 @@ STAGE PLANS:
Map Operator Tree:
TableScan
alias: b
- Statistics: Num rows: 1 Data size: 21 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 210 Basic stats: COMPLETE Column stats: NONE
GatherStats: false
Filter Operator
isSamplingPred: false
predicate: (key is not null and value is not null) (type: boolean)
- Statistics: Num rows: 1 Data size: 21 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 210 Basic stats: COMPLETE Column stats: NONE
Spark HashTable Sink Operator
keys:
0 key (type: string), value (type: string)
@@ -139,12 +139,12 @@ STAGE PLANS:
Map Operator Tree:
TableScan
alias: a
- Statistics: Num rows: 1 Data size: 20 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 200 Basic stats: COMPLETE Column stats: NONE
GatherStats: false
Filter Operator
isSamplingPred: false
predicate: (key is not null and value is not null) (type: boolean)
- Statistics: Num rows: 1 Data size: 20 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 200 Basic stats: COMPLETE Column stats: NONE
Map Join Operator
condition map:
Inner Join 0 to 1
@@ -154,7 +154,7 @@ STAGE PLANS:
input vertices:
1 Map 3
Position of Big Table: 0
- Statistics: Num rows: 1 Data size: 22 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 220 Basic stats: COMPLETE Column stats: NONE
Group By Operator
aggregations: count()
mode: hash
http://git-wip-us.apache.org/repos/asf/hive/blob/e26b9325/ql/src/test/results/clientpositive/spark/bucketmapjoin1.q.out
----------------------------------------------------------------------
diff --git a/ql/src/test/results/clientpositive/spark/bucketmapjoin1.q.out b/ql/src/test/results/clientpositive/spark/bucketmapjoin1.q.out
index d6e45d5..bec0451 100644
--- a/ql/src/test/results/clientpositive/spark/bucketmapjoin1.q.out
+++ b/ql/src/test/results/clientpositive/spark/bucketmapjoin1.q.out
@@ -390,22 +390,22 @@ STAGE PLANS:
Map Operator Tree:
TableScan
alias: a
- Statistics: Num rows: 1 Data size: 2750 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 27500 Basic stats: COMPLETE Column stats: NONE
GatherStats: false
Filter Operator
isSamplingPred: false
predicate: key is not null (type: boolean)
- Statistics: Num rows: 1 Data size: 2750 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 27500 Basic stats: COMPLETE Column stats: NONE
Select Operator
expressions: key (type: int), value (type: string)
outputColumnNames: _col0, _col1
- Statistics: Num rows: 1 Data size: 2750 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 27500 Basic stats: COMPLETE Column stats: NONE
Reduce Output Operator
key expressions: _col0 (type: int)
null sort order: a
sort order: +
Map-reduce partition columns: _col0 (type: int)
- Statistics: Num rows: 1 Data size: 2750 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 27500 Basic stats: COMPLETE Column stats: NONE
tag: 0
value expressions: _col1 (type: string)
auto parallelism: false
@@ -746,22 +746,22 @@ STAGE PLANS:
Map Operator Tree:
TableScan
alias: a
- Statistics: Num rows: 1 Data size: 2750 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 27500 Basic stats: COMPLETE Column stats: NONE
GatherStats: false
Filter Operator
isSamplingPred: false
predicate: key is not null (type: boolean)
- Statistics: Num rows: 1 Data size: 2750 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 27500 Basic stats: COMPLETE Column stats: NONE
Select Operator
expressions: key (type: int), value (type: string)
outputColumnNames: _col0, _col1
- Statistics: Num rows: 1 Data size: 2750 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 27500 Basic stats: COMPLETE Column stats: NONE
Reduce Output Operator
key expressions: _col0 (type: int)
null sort order: a
sort order: +
Map-reduce partition columns: _col0 (type: int)
- Statistics: Num rows: 1 Data size: 2750 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 27500 Basic stats: COMPLETE Column stats: NONE
tag: 0
value expressions: _col1 (type: string)
auto parallelism: false
http://git-wip-us.apache.org/repos/asf/hive/blob/e26b9325/ql/src/test/results/clientpositive/spark/bucketmapjoin4.q.out
----------------------------------------------------------------------
diff --git a/ql/src/test/results/clientpositive/spark/bucketmapjoin4.q.out b/ql/src/test/results/clientpositive/spark/bucketmapjoin4.q.out
index 0c6c2c7..2b384b7 100644
--- a/ql/src/test/results/clientpositive/spark/bucketmapjoin4.q.out
+++ b/ql/src/test/results/clientpositive/spark/bucketmapjoin4.q.out
@@ -140,22 +140,22 @@ STAGE PLANS:
Map Operator Tree:
TableScan
alias: a
- Statistics: Num rows: 1 Data size: 2750 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 27500 Basic stats: COMPLETE Column stats: NONE
GatherStats: false
Filter Operator
isSamplingPred: false
predicate: key is not null (type: boolean)
- Statistics: Num rows: 1 Data size: 2750 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 27500 Basic stats: COMPLETE Column stats: NONE
Select Operator
expressions: key (type: int), value (type: string)
outputColumnNames: _col0, _col1
- Statistics: Num rows: 1 Data size: 2750 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 27500 Basic stats: COMPLETE Column stats: NONE
Reduce Output Operator
key expressions: _col0 (type: int)
null sort order: a
sort order: +
Map-reduce partition columns: _col0 (type: int)
- Statistics: Num rows: 1 Data size: 2750 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 27500 Basic stats: COMPLETE Column stats: NONE
tag: 0
value expressions: _col1 (type: string)
auto parallelism: false
@@ -214,22 +214,22 @@ STAGE PLANS:
Map Operator Tree:
TableScan
alias: b
- Statistics: Num rows: 1 Data size: 2750 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 27500 Basic stats: COMPLETE Column stats: NONE
GatherStats: false
Filter Operator
isSamplingPred: false
predicate: key is not null (type: boolean)
- Statistics: Num rows: 1 Data size: 2750 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 27500 Basic stats: COMPLETE Column stats: NONE
Select Operator
expressions: key (type: int), value (type: string)
outputColumnNames: _col0, _col1
- Statistics: Num rows: 1 Data size: 2750 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 27500 Basic stats: COMPLETE Column stats: NONE
Reduce Output Operator
key expressions: _col0 (type: int)
null sort order: a
sort order: +
Map-reduce partition columns: _col0 (type: int)
- Statistics: Num rows: 1 Data size: 2750 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 27500 Basic stats: COMPLETE Column stats: NONE
tag: 1
value expressions: _col1 (type: string)
auto parallelism: false
@@ -294,17 +294,17 @@ STAGE PLANS:
0 _col0 (type: int)
1 _col0 (type: int)
outputColumnNames: _col0, _col1, _col3
- Statistics: Num rows: 1 Data size: 3025 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 30250 Basic stats: COMPLETE Column stats: NONE
Select Operator
expressions: UDFToString(_col0) (type: string), _col1 (type: string), _col3 (type: string)
outputColumnNames: _col0, _col1, _col2
- Statistics: Num rows: 1 Data size: 3025 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 30250 Basic stats: COMPLETE Column stats: NONE
File Output Operator
compressed: false
GlobalTableId: 1
#### A masked pattern was here ####
NumFilesPerFileSink: 1
- Statistics: Num rows: 1 Data size: 3025 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 30250 Basic stats: COMPLETE Column stats: NONE
#### A masked pattern was here ####
table:
input format: org.apache.hadoop.mapred.TextInputFormat
@@ -486,22 +486,22 @@ STAGE PLANS:
Map Operator Tree:
TableScan
alias: a
- Statistics: Num rows: 1 Data size: 2750 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 27500 Basic stats: COMPLETE Column stats: NONE
GatherStats: false
Filter Operator
isSamplingPred: false
predicate: key is not null (type: boolean)
- Statistics: Num rows: 1 Data size: 2750 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 27500 Basic stats: COMPLETE Column stats: NONE
Select Operator
expressions: key (type: int), value (type: string)
outputColumnNames: _col0, _col1
- Statistics: Num rows: 1 Data size: 2750 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 27500 Basic stats: COMPLETE Column stats: NONE
Reduce Output Operator
key expressions: _col0 (type: int)
null sort order: a
sort order: +
Map-reduce partition columns: _col0 (type: int)
- Statistics: Num rows: 1 Data size: 2750 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 27500 Basic stats: COMPLETE Column stats: NONE
tag: 0
value expressions: _col1 (type: string)
auto parallelism: false
@@ -560,22 +560,22 @@ STAGE PLANS:
Map Operator Tree:
TableScan
alias: b
- Statistics: Num rows: 1 Data size: 2750 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 27500 Basic stats: COMPLETE Column stats: NONE
GatherStats: false
Filter Operator
isSamplingPred: false
predicate: key is not null (type: boolean)
- Statistics: Num rows: 1 Data size: 2750 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 27500 Basic stats: COMPLETE Column stats: NONE
Select Operator
expressions: key (type: int), value (type: string)
outputColumnNames: _col0, _col1
- Statistics: Num rows: 1 Data size: 2750 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 27500 Basic stats: COMPLETE Column stats: NONE
Reduce Output Operator
key expressions: _col0 (type: int)
null sort order: a
sort order: +
Map-reduce partition columns: _col0 (type: int)
- Statistics: Num rows: 1 Data size: 2750 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 27500 Basic stats: COMPLETE Column stats: NONE
tag: 1
value expressions: _col1 (type: string)
auto parallelism: false
@@ -640,17 +640,17 @@ STAGE PLANS:
0 _col0 (type: int)
1 _col0 (type: int)
outputColumnNames: _col0, _col1, _col3
- Statistics: Num rows: 1 Data size: 3025 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 30250 Basic stats: COMPLETE Column stats: NONE
Select Operator
expressions: UDFToString(_col0) (type: string), _col1 (type: string), _col3 (type: string)
outputColumnNames: _col0, _col1, _col2
- Statistics: Num rows: 1 Data size: 3025 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 30250 Basic stats: COMPLETE Column stats: NONE
File Output Operator
compressed: false
GlobalTableId: 1
#### A masked pattern was here ####
NumFilesPerFileSink: 1
- Statistics: Num rows: 1 Data size: 3025 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 30250 Basic stats: COMPLETE Column stats: NONE
#### A masked pattern was here ####
table:
input format: org.apache.hadoop.mapred.TextInputFormat
http://git-wip-us.apache.org/repos/asf/hive/blob/e26b9325/ql/src/test/results/clientpositive/spark/bucketmapjoin5.q.out
----------------------------------------------------------------------
diff --git a/ql/src/test/results/clientpositive/spark/bucketmapjoin5.q.out b/ql/src/test/results/clientpositive/spark/bucketmapjoin5.q.out
index f7344de..93843ad 100644
--- a/ql/src/test/results/clientpositive/spark/bucketmapjoin5.q.out
+++ b/ql/src/test/results/clientpositive/spark/bucketmapjoin5.q.out
@@ -189,12 +189,12 @@ STAGE PLANS:
Map Operator Tree:
TableScan
alias: a
- Statistics: Num rows: 1 Data size: 2750 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 27500 Basic stats: COMPLETE Column stats: NONE
GatherStats: false
Filter Operator
isSamplingPred: false
predicate: key is not null (type: boolean)
- Statistics: Num rows: 1 Data size: 2750 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 27500 Basic stats: COMPLETE Column stats: NONE
Spark HashTable Sink Operator
keys:
0 key (type: int)
@@ -596,12 +596,12 @@ STAGE PLANS:
Map Operator Tree:
TableScan
alias: a
- Statistics: Num rows: 1 Data size: 2750 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 27500 Basic stats: COMPLETE Column stats: NONE
GatherStats: false
Filter Operator
isSamplingPred: false
predicate: key is not null (type: boolean)
- Statistics: Num rows: 1 Data size: 2750 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 27500 Basic stats: COMPLETE Column stats: NONE
Spark HashTable Sink Operator
keys:
0 key (type: int)
http://git-wip-us.apache.org/repos/asf/hive/blob/e26b9325/ql/src/test/results/clientpositive/spark/bucketmapjoin_negative.q.out
----------------------------------------------------------------------
diff --git a/ql/src/test/results/clientpositive/spark/bucketmapjoin_negative.q.out b/ql/src/test/results/clientpositive/spark/bucketmapjoin_negative.q.out
index dfba4ef..abf3c91 100644
--- a/ql/src/test/results/clientpositive/spark/bucketmapjoin_negative.q.out
+++ b/ql/src/test/results/clientpositive/spark/bucketmapjoin_negative.q.out
@@ -165,12 +165,12 @@ STAGE PLANS:
Map Operator Tree:
TableScan
alias: a
- Statistics: Num rows: 1 Data size: 2750 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 27500 Basic stats: COMPLETE Column stats: NONE
GatherStats: false
Filter Operator
isSamplingPred: false
predicate: key is not null (type: boolean)
- Statistics: Num rows: 1 Data size: 2750 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 27500 Basic stats: COMPLETE Column stats: NONE
Map Join Operator
condition map:
Inner Join 0 to 1
http://git-wip-us.apache.org/repos/asf/hive/blob/e26b9325/ql/src/test/results/clientpositive/spark/bucketmapjoin_negative2.q.out
----------------------------------------------------------------------
diff --git a/ql/src/test/results/clientpositive/spark/bucketmapjoin_negative2.q.out b/ql/src/test/results/clientpositive/spark/bucketmapjoin_negative2.q.out
index 0504a43..9fd2c72 100644
--- a/ql/src/test/results/clientpositive/spark/bucketmapjoin_negative2.q.out
+++ b/ql/src/test/results/clientpositive/spark/bucketmapjoin_negative2.q.out
@@ -228,12 +228,12 @@ STAGE PLANS:
Map Operator Tree:
TableScan
alias: a
- Statistics: Num rows: 1 Data size: 2750 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 27500 Basic stats: COMPLETE Column stats: NONE
GatherStats: false
Filter Operator
isSamplingPred: false
predicate: key is not null (type: boolean)
- Statistics: Num rows: 1 Data size: 2750 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 27500 Basic stats: COMPLETE Column stats: NONE
Map Join Operator
condition map:
Inner Join 0 to 1
http://git-wip-us.apache.org/repos/asf/hive/blob/e26b9325/ql/src/test/results/clientpositive/spark/bucketmapjoin_negative3.q.out
----------------------------------------------------------------------
diff --git a/ql/src/test/results/clientpositive/spark/bucketmapjoin_negative3.q.out b/ql/src/test/results/clientpositive/spark/bucketmapjoin_negative3.q.out
index 2fa0214..c1341d9 100644
--- a/ql/src/test/results/clientpositive/spark/bucketmapjoin_negative3.q.out
+++ b/ql/src/test/results/clientpositive/spark/bucketmapjoin_negative3.q.out
@@ -160,12 +160,12 @@ STAGE PLANS:
Map Operator Tree:
TableScan
alias: r
- Statistics: Num rows: 1 Data size: 4200 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 42000 Basic stats: COMPLETE Column stats: NONE
GatherStats: false
Filter Operator
isSamplingPred: false
predicate: (key is not null and value is not null) (type: boolean)
- Statistics: Num rows: 1 Data size: 4200 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 42000 Basic stats: COMPLETE Column stats: NONE
Spark HashTable Sink Operator
keys:
0 key (type: string), value (type: string)
@@ -240,12 +240,12 @@ STAGE PLANS:
Map Operator Tree:
TableScan
alias: l
- Statistics: Num rows: 1 Data size: 4200 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 42000 Basic stats: COMPLETE Column stats: NONE
GatherStats: false
Filter Operator
isSamplingPred: false
predicate: (key is not null and value is not null) (type: boolean)
- Statistics: Num rows: 1 Data size: 4200 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 42000 Basic stats: COMPLETE Column stats: NONE
Map Join Operator
condition map:
Inner Join 0 to 1
@@ -256,18 +256,18 @@ STAGE PLANS:
input vertices:
1 Map 2
Position of Big Table: 0
- Statistics: Num rows: 1 Data size: 4620 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 46200 Basic stats: COMPLETE Column stats: NONE
BucketMapJoin: true
Select Operator
expressions: _col0 (type: string), _col1 (type: string), _col5 (type: string), _col6 (type: string)
outputColumnNames: _col0, _col1, _col2, _col3
- Statistics: Num rows: 1 Data size: 4620 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 46200 Basic stats: COMPLETE Column stats: NONE
File Output Operator
compressed: false
GlobalTableId: 0
#### A masked pattern was here ####
NumFilesPerFileSink: 1
- Statistics: Num rows: 1 Data size: 4620 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 46200 Basic stats: COMPLETE Column stats: NONE
#### A masked pattern was here ####
table:
input format: org.apache.hadoop.mapred.SequenceFileInputFormat
@@ -369,12 +369,12 @@ STAGE PLANS:
Map Operator Tree:
TableScan
alias: r
- Statistics: Num rows: 1 Data size: 4200 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 42000 Basic stats: COMPLETE Column stats: NONE
GatherStats: false
Filter Operator
isSamplingPred: false
predicate: (key is not null and value is not null) (type: boolean)
- Statistics: Num rows: 1 Data size: 4200 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 42000 Basic stats: COMPLETE Column stats: NONE
Spark HashTable Sink Operator
keys:
0 key (type: string), value (type: string)
@@ -449,12 +449,12 @@ STAGE PLANS:
Map Operator Tree:
TableScan
alias: l
- Statistics: Num rows: 1 Data size: 4200 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 42000 Basic stats: COMPLETE Column stats: NONE
GatherStats: false
Filter Operator
isSamplingPred: false
predicate: (key is not null and value is not null) (type: boolean)
- Statistics: Num rows: 1 Data size: 4200 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 42000 Basic stats: COMPLETE Column stats: NONE
Map Join Operator
condition map:
Inner Join 0 to 1
@@ -465,18 +465,18 @@ STAGE PLANS:
input vertices:
1 Map 2
Position of Big Table: 0
- Statistics: Num rows: 1 Data size: 4620 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 46200 Basic stats: COMPLETE Column stats: NONE
BucketMapJoin: true
Select Operator
expressions: _col0 (type: string), _col1 (type: string), _col5 (type: string), _col6 (type: string)
outputColumnNames: _col0, _col1, _col2, _col3
- Statistics: Num rows: 1 Data size: 4620 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 46200 Basic stats: COMPLETE Column stats: NONE
File Output Operator
compressed: false
GlobalTableId: 0
#### A masked pattern was here ####
NumFilesPerFileSink: 1
- Statistics: Num rows: 1 Data size: 4620 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 46200 Basic stats: COMPLETE Column stats: NONE
#### A masked pattern was here ####
table:
input format: org.apache.hadoop.mapred.SequenceFileInputFormat
@@ -578,12 +578,12 @@ STAGE PLANS:
Map Operator Tree:
TableScan
alias: r
- Statistics: Num rows: 1 Data size: 4200 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 42000 Basic stats: COMPLETE Column stats: NONE
GatherStats: false
Filter Operator
isSamplingPred: false
predicate: UDFToDouble(key) is not null (type: boolean)
- Statistics: Num rows: 1 Data size: 4200 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 42000 Basic stats: COMPLETE Column stats: NONE
Spark HashTable Sink Operator
keys:
0 (key + key) (type: double)
@@ -653,12 +653,12 @@ STAGE PLANS:
Map Operator Tree:
TableScan
alias: l
- Statistics: Num rows: 1 Data size: 4200 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 42000 Basic stats: COMPLETE Column stats: NONE
GatherStats: false
Filter Operator
isSamplingPred: false
predicate: (key + key) is not null (type: boolean)
- Statistics: Num rows: 1 Data size: 4200 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 42000 Basic stats: COMPLETE Column stats: NONE
Map Join Operator
condition map:
Inner Join 0 to 1
@@ -669,17 +669,17 @@ STAGE PLANS:
input vertices:
1 Map 2
Position of Big Table: 0
- Statistics: Num rows: 1 Data size: 4620 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 46200 Basic stats: COMPLETE Column stats: NONE
Select Operator
expressions: _col0 (type: string), _col1 (type: string), _col5 (type: string), _col6 (type: string)
outputColumnNames: _col0, _col1, _col2, _col3
- Statistics: Num rows: 1 Data size: 4620 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 46200 Basic stats: COMPLETE Column stats: NONE
File Output Operator
compressed: false
GlobalTableId: 0
#### A masked pattern was here ####
NumFilesPerFileSink: 1
- Statistics: Num rows: 1 Data size: 4620 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 46200 Basic stats: COMPLETE Column stats: NONE
#### A masked pattern was here ####
table:
input format: org.apache.hadoop.mapred.SequenceFileInputFormat
@@ -776,12 +776,12 @@ STAGE PLANS:
Map Operator Tree:
TableScan
alias: r
- Statistics: Num rows: 1 Data size: 4200 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 42000 Basic stats: COMPLETE Column stats: NONE
GatherStats: false
Filter Operator
isSamplingPred: false
predicate: (key is not null and value is not null) (type: boolean)
- Statistics: Num rows: 1 Data size: 4200 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 42000 Basic stats: COMPLETE Column stats: NONE
Spark HashTable Sink Operator
keys:
0 key (type: string), value (type: string)
@@ -851,12 +851,12 @@ STAGE PLANS:
Map Operator Tree:
TableScan
alias: l
- Statistics: Num rows: 1 Data size: 4200 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 42000 Basic stats: COMPLETE Column stats: NONE
GatherStats: false
Filter Operator
isSamplingPred: false
predicate: (key is not null and value is not null) (type: boolean)
- Statistics: Num rows: 1 Data size: 4200 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 42000 Basic stats: COMPLETE Column stats: NONE
Map Join Operator
condition map:
Inner Join 0 to 1
@@ -867,17 +867,17 @@ STAGE PLANS:
input vertices:
1 Map 2
Position of Big Table: 0
- Statistics: Num rows: 1 Data size: 4620 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 46200 Basic stats: COMPLETE Column stats: NONE
Select Operator
expressions: _col0 (type: string), _col1 (type: string), _col5 (type: string), _col6 (type: string)
outputColumnNames: _col0, _col1, _col2, _col3
- Statistics: Num rows: 1 Data size: 4620 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 46200 Basic stats: COMPLETE Column stats: NONE
File Output Operator
compressed: false
GlobalTableId: 0
#### A masked pattern was here ####
NumFilesPerFileSink: 1
- Statistics: Num rows: 1 Data size: 4620 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 46200 Basic stats: COMPLETE Column stats: NONE
#### A masked pattern was here ####
table:
input format: org.apache.hadoop.mapred.SequenceFileInputFormat
@@ -974,12 +974,12 @@ STAGE PLANS:
Map Operator Tree:
TableScan
alias: r
- Statistics: Num rows: 1 Data size: 4200 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 42000 Basic stats: COMPLETE Column stats: NONE
GatherStats: false
Filter Operator
isSamplingPred: false
predicate: (key is not null and value is not null) (type: boolean)
- Statistics: Num rows: 1 Data size: 4200 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 42000 Basic stats: COMPLETE Column stats: NONE
Spark HashTable Sink Operator
keys:
0 key (type: string), value (type: string)
@@ -1049,12 +1049,12 @@ STAGE PLANS:
Map Operator Tree:
TableScan
alias: l
- Statistics: Num rows: 1 Data size: 4200 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 42000 Basic stats: COMPLETE Column stats: NONE
GatherStats: false
Filter Operator
isSamplingPred: false
predicate: (key is not null and value is not null) (type: boolean)
- Statistics: Num rows: 1 Data size: 4200 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 42000 Basic stats: COMPLETE Column stats: NONE
Map Join Operator
condition map:
Inner Join 0 to 1
@@ -1065,17 +1065,17 @@ STAGE PLANS:
input vertices:
1 Map 2
Position of Big Table: 0
- Statistics: Num rows: 1 Data size: 4620 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 46200 Basic stats: COMPLETE Column stats: NONE
Select Operator
expressions: _col0 (type: string), _col1 (type: string), _col5 (type: string), _col6 (type: string)
outputColumnNames: _col0, _col1, _col2, _col3
- Statistics: Num rows: 1 Data size: 4620 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 46200 Basic stats: COMPLETE Column stats: NONE
File Output Operator
compressed: false
GlobalTableId: 0
#### A masked pattern was here ####
NumFilesPerFileSink: 1
- Statistics: Num rows: 1 Data size: 4620 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 46200 Basic stats: COMPLETE Column stats: NONE
#### A masked pattern was here ####
table:
input format: org.apache.hadoop.mapred.SequenceFileInputFormat
@@ -1172,12 +1172,12 @@ STAGE PLANS:
Map Operator Tree:
TableScan
alias: r
- Statistics: Num rows: 1 Data size: 4200 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 42000 Basic stats: COMPLETE Column stats: NONE
GatherStats: false
Filter Operator
isSamplingPred: false
predicate: (key is not null and value is not null) (type: boolean)
- Statistics: Num rows: 1 Data size: 4200 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 42000 Basic stats: COMPLETE Column stats: NONE
Spark HashTable Sink Operator
keys:
0 key (type: string), value (type: string)
@@ -1247,12 +1247,12 @@ STAGE PLANS:
Map Operator Tree:
TableScan
alias: l
- Statistics: Num rows: 1 Data size: 4200 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 42000 Basic stats: COMPLETE Column stats: NONE
GatherStats: false
Filter Operator
isSamplingPred: false
predicate: (key is not null and value is not null) (type: boolean)
- Statistics: Num rows: 1 Data size: 4200 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 42000 Basic stats: COMPLETE Column stats: NONE
Map Join Operator
condition map:
Inner Join 0 to 1
@@ -1263,17 +1263,17 @@ STAGE PLANS:
input vertices:
1 Map 2
Position of Big Table: 0
- Statistics: Num rows: 1 Data size: 4620 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 46200 Basic stats: COMPLETE Column stats: NONE
Select Operator
expressions: _col0 (type: string), _col1 (type: string), _col5 (type: string), _col6 (type: string)
outputColumnNames: _col0, _col1, _col2, _col3
- Statistics: Num rows: 1 Data size: 4620 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 46200 Basic stats: COMPLETE Column stats: NONE
File Output Operator
compressed: false
GlobalTableId: 0
#### A masked pattern was here ####
NumFilesPerFileSink: 1
- Statistics: Num rows: 1 Data size: 4620 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 46200 Basic stats: COMPLETE Column stats: NONE
#### A masked pattern was here ####
table:
input format: org.apache.hadoop.mapred.SequenceFileInputFormat
@@ -1370,12 +1370,12 @@ STAGE PLANS:
Map Operator Tree:
TableScan
alias: r
- Statistics: Num rows: 1 Data size: 4200 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 42000 Basic stats: COMPLETE Column stats: NONE
GatherStats: false
Filter Operator
isSamplingPred: false
predicate: (key is not null and value is not null) (type: boolean)
- Statistics: Num rows: 1 Data size: 4200 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 42000 Basic stats: COMPLETE Column stats: NONE
Spark HashTable Sink Operator
keys:
0 key (type: string), value (type: string)
@@ -1445,12 +1445,12 @@ STAGE PLANS:
Map Operator Tree:
TableScan
alias: l
- Statistics: Num rows: 1 Data size: 4200 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 42000 Basic stats: COMPLETE Column stats: NONE
GatherStats: false
Filter Operator
isSamplingPred: false
predicate: (key is not null and value is not null) (type: boolean)
- Statistics: Num rows: 1 Data size: 4200 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 42000 Basic stats: COMPLETE Column stats: NONE
Map Join Operator
condition map:
Inner Join 0 to 1
@@ -1461,17 +1461,17 @@ STAGE PLANS:
input vertices:
1 Map 2
Position of Big Table: 0
- Statistics: Num rows: 1 Data size: 4620 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 46200 Basic stats: COMPLETE Column stats: NONE
Select Operator
expressions: _col0 (type: string), _col1 (type: string), _col5 (type: string), _col6 (type: string)
outputColumnNames: _col0, _col1, _col2, _col3
- Statistics: Num rows: 1 Data size: 4620 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 46200 Basic stats: COMPLETE Column stats: NONE
File Output Operator
compressed: false
GlobalTableId: 0
#### A masked pattern was here ####
NumFilesPerFileSink: 1
- Statistics: Num rows: 1 Data size: 4620 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 46200 Basic stats: COMPLETE Column stats: NONE
#### A masked pattern was here ####
table:
input format: org.apache.hadoop.mapred.SequenceFileInputFormat
@@ -1568,12 +1568,12 @@ STAGE PLANS:
Map Operator Tree:
TableScan
alias: r
- Statistics: Num rows: 1 Data size: 4200 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 42000 Basic stats: COMPLETE Column stats: NONE
GatherStats: false
Filter Operator
isSamplingPred: false
predicate: (key is not null and value is not null) (type: boolean)
- Statistics: Num rows: 1 Data size: 4200 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 42000 Basic stats: COMPLETE Column stats: NONE
Spark HashTable Sink Operator
keys:
0 key (type: string), value (type: string)
@@ -1643,12 +1643,12 @@ STAGE PLANS:
Map Operator Tree:
TableScan
alias: l
- Statistics: Num rows: 1 Data size: 4200 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 42000 Basic stats: COMPLETE Column stats: NONE
GatherStats: false
Filter Operator
isSamplingPred: false
predicate: (key is not null and value is not null) (type: boolean)
- Statistics: Num rows: 1 Data size: 4200 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 42000 Basic stats: COMPLETE Column stats: NONE
Map Join Operator
condition map:
Inner Join 0 to 1
@@ -1659,17 +1659,17 @@ STAGE PLANS:
input vertices:
1 Map 2
Position of Big Table: 0
- Statistics: Num rows: 1 Data size: 4620 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 46200 Basic stats: COMPLETE Column stats: NONE
Select Operator
expressions: _col0 (type: string), _col1 (type: string), _col5 (type: string), _col6 (type: string)
outputColumnNames: _col0, _col1, _col2, _col3
- Statistics: Num rows: 1 Data size: 4620 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 46200 Basic stats: COMPLETE Column stats: NONE
File Output Operator
compressed: false
GlobalTableId: 0
#### A masked pattern was here ####
NumFilesPerFileSink: 1
- Statistics: Num rows: 1 Data size: 4620 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 46200 Basic stats: COMPLETE Column stats: NONE
#### A masked pattern was here ####
table:
input format: org.apache.hadoop.mapred.SequenceFileInputFormat
@@ -1766,12 +1766,12 @@ STAGE PLANS:
Map Operator Tree:
TableScan
alias: r
- Statistics: Num rows: 1 Data size: 4200 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 42000 Basic stats: COMPLETE Column stats: NONE
GatherStats: false
Filter Operator
isSamplingPred: false
predicate: (key is not null and value is not null) (type: boolean)
- Statistics: Num rows: 1 Data size: 4200 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 42000 Basic stats: COMPLETE Column stats: NONE
Spark HashTable Sink Operator
keys:
0 key (type: string), value (type: string)
@@ -1841,12 +1841,12 @@ STAGE PLANS:
Map Operator Tree:
TableScan
alias: l
- Statistics: Num rows: 1 Data size: 4200 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 42000 Basic stats: COMPLETE Column stats: NONE
GatherStats: false
Filter Operator
isSamplingPred: false
predicate: (key is not null and value is not null) (type: boolean)
- Statistics: Num rows: 1 Data size: 4200 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 42000 Basic stats: COMPLETE Column stats: NONE
Map Join Operator
condition map:
Inner Join 0 to 1
@@ -1857,17 +1857,17 @@ STAGE PLANS:
input vertices:
1 Map 2
Position of Big Table: 0
- Statistics: Num rows: 1 Data size: 4620 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 46200 Basic stats: COMPLETE Column stats: NONE
Select Operator
expressions: _col0 (type: string), _col1 (type: string), _col5 (type: string), _col6 (type: string)
outputColumnNames: _col0, _col1, _col2, _col3
- Statistics: Num rows: 1 Data size: 4620 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 46200 Basic stats: COMPLETE Column stats: NONE
File Output Operator
compressed: false
GlobalTableId: 0
#### A masked pattern was here ####
NumFilesPerFileSink: 1
- Statistics: Num rows: 1 Data size: 4620 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 46200 Basic stats: COMPLETE Column stats: NONE
#### A masked pattern was here ####
table:
input format: org.apache.hadoop.mapred.SequenceFileInputFormat
http://git-wip-us.apache.org/repos/asf/hive/blob/e26b9325/ql/src/test/results/clientpositive/spark/column_access_stats.q.out
----------------------------------------------------------------------
diff --git a/ql/src/test/results/clientpositive/spark/column_access_stats.q.out b/ql/src/test/results/clientpositive/spark/column_access_stats.q.out
index 0fdef11..48574b9 100644
--- a/ql/src/test/results/clientpositive/spark/column_access_stats.q.out
+++ b/ql/src/test/results/clientpositive/spark/column_access_stats.q.out
@@ -181,14 +181,14 @@ STAGE PLANS:
Map Operator Tree:
TableScan
alias: t1
- Statistics: Num rows: 1 Data size: 30 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 300 Basic stats: COMPLETE Column stats: NONE
Select Operator
expressions: key (type: string)
outputColumnNames: _col0
- Statistics: Num rows: 1 Data size: 30 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 300 Basic stats: COMPLETE Column stats: NONE
File Output Operator
compressed: false
- Statistics: Num rows: 2 Data size: 60 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 2 Data size: 600 Basic stats: COMPLETE Column stats: NONE
table:
input format: org.apache.hadoop.mapred.SequenceFileInputFormat
output format: org.apache.hadoop.hive.ql.io.HiveSequenceFileOutputFormat
@@ -197,14 +197,14 @@ STAGE PLANS:
Map Operator Tree:
TableScan
alias: t1
- Statistics: Num rows: 1 Data size: 30 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 300 Basic stats: COMPLETE Column stats: NONE
Select Operator
expressions: val (type: string)
outputColumnNames: _col0
- Statistics: Num rows: 1 Data size: 30 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 300 Basic stats: COMPLETE Column stats: NONE
File Output Operator
compressed: false
- Statistics: Num rows: 2 Data size: 60 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 2 Data size: 600 Basic stats: COMPLETE Column stats: NONE
table:
input format: org.apache.hadoop.mapred.SequenceFileInputFormat
output format: org.apache.hadoop.hive.ql.io.HiveSequenceFileOutputFormat
@@ -259,14 +259,14 @@ STAGE PLANS:
Map Operator Tree:
TableScan
alias: t1
- Statistics: Num rows: 1 Data size: 30 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 300 Basic stats: COMPLETE Column stats: NONE
Select Operator
expressions: key (type: string)
outputColumnNames: _col0
- Statistics: Num rows: 1 Data size: 30 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 300 Basic stats: COMPLETE Column stats: NONE
File Output Operator
compressed: false
- Statistics: Num rows: 2 Data size: 60 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 2 Data size: 600 Basic stats: COMPLETE Column stats: NONE
table:
input format: org.apache.hadoop.mapred.SequenceFileInputFormat
output format: org.apache.hadoop.hive.ql.io.HiveSequenceFileOutputFormat
@@ -275,14 +275,14 @@ STAGE PLANS:
Map Operator Tree:
TableScan
alias: t1
- Statistics: Num rows: 1 Data size: 30 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 300 Basic stats: COMPLETE Column stats: NONE
Select Operator
expressions: key (type: string)
outputColumnNames: _col0
- Statistics: Num rows: 1 Data size: 30 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 300 Basic stats: COMPLETE Column stats: NONE
File Output Operator
compressed: false
- Statistics: Num rows: 2 Data size: 60 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 2 Data size: 600 Basic stats: COMPLETE Column stats: NONE
table:
input format: org.apache.hadoop.mapred.SequenceFileInputFormat
output format: org.apache.hadoop.hive.ql.io.HiveSequenceFileOutputFormat
@@ -366,19 +366,19 @@ STAGE PLANS:
Map Operator Tree:
TableScan
alias: t1
- Statistics: Num rows: 1 Data size: 30 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 300 Basic stats: COMPLETE Column stats: NONE
Filter Operator
predicate: key is not null (type: boolean)
- Statistics: Num rows: 1 Data size: 30 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 300 Basic stats: COMPLETE Column stats: NONE
Select Operator
expressions: key (type: string)
outputColumnNames: _col0
- Statistics: Num rows: 1 Data size: 30 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 300 Basic stats: COMPLETE Column stats: NONE
Reduce Output Operator
key expressions: _col0 (type: string)
sort order: +
Map-reduce partition columns: _col0 (type: string)
- Statistics: Num rows: 1 Data size: 30 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 300 Basic stats: COMPLETE Column stats: NONE
Map 3
Map Operator Tree:
TableScan
@@ -491,19 +491,19 @@ STAGE PLANS:
Map Operator Tree:
TableScan
alias: t1
- Statistics: Num rows: 1 Data size: 30 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 300 Basic stats: COMPLETE Column stats: NONE
Filter Operator
predicate: ((UDFToDouble(val) = 3.0) and key is not null) (type: boolean)
- Statistics: Num rows: 1 Data size: 30 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 300 Basic stats: COMPLETE Column stats: NONE
Select Operator
expressions: key (type: string), val (type: string)
outputColumnNames: _col0, _col1
- Statistics: Num rows: 1 Data size: 30 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 300 Basic stats: COMPLETE Column stats: NONE
Reduce Output Operator
key expressions: _col0 (type: string)
sort order: +
Map-reduce partition columns: _col0 (type: string)
- Statistics: Num rows: 1 Data size: 30 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 300 Basic stats: COMPLETE Column stats: NONE
value expressions: _col1 (type: string)
Map 3
Map Operator Tree:
@@ -587,19 +587,19 @@ STAGE PLANS:
Map Operator Tree:
TableScan
alias: t1
- Statistics: Num rows: 1 Data size: 30 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 300 Basic stats: COMPLETE Column stats: NONE
Filter Operator
predicate: ((UDFToDouble(key) = 5.0) and val is not null) (type: boolean)
- Statistics: Num rows: 1 Data size: 30 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 300 Basic stats: COMPLETE Column stats: NONE
Select Operator
expressions: val (type: string)
outputColumnNames: _col0
- Statistics: Num rows: 1 Data size: 30 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 300 Basic stats: COMPLETE Column stats: NONE
Reduce Output Operator
key expressions: _col0 (type: string)
sort order: +
Map-reduce partition columns: _col0 (type: string)
- Statistics: Num rows: 1 Data size: 30 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 300 Basic stats: COMPLETE Column stats: NONE
Map 3
Map Operator Tree:
TableScan
@@ -712,19 +712,19 @@ STAGE PLANS:
Map Operator Tree:
TableScan
alias: t1
- Statistics: Num rows: 1 Data size: 30 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 300 Basic stats: COMPLETE Column stats: NONE
Filter Operator
predicate: key is not null (type: boolean)
- Statistics: Num rows: 1 Data size: 30 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 300 Basic stats: COMPLETE Column stats: NONE
Select Operator
expressions: key (type: string)
outputColumnNames: _col0
- Statistics: Num rows: 1 Data size: 30 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 300 Basic stats: COMPLETE Column stats: NONE
Reduce Output Operator
key expressions: _col0 (type: string)
sort order: +
Map-reduce partition columns: _col0 (type: string)
- Statistics: Num rows: 1 Data size: 30 Basic stats: COMPLETE Column stats: NONE
+ Statistics: Num rows: 1 Data size: 300 Basic stats: COMPLETE Column stats: NONE
Map 5
Map Operator Tree:
TableScan