You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/03/30 08:58:00 UTC
[jira] [Updated] (HIVE-26088) a left join b left join c when b is null ,then b join c is null , in this case when set hive.auto.convert.join=true ; map join is not used ,because nullpointexecption
[ https://issues.apache.org/jira/browse/HIVE-26088?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
ASF GitHub Bot updated HIVE-26088:
----------------------------------
Labels: pull-request-available (was: )
> a left join b left join c when b is null ,then b join c is null , in this case when set hive.auto.convert.join=true ; map join is not used ,because nullpointexecption
> ----------------------------------------------------------------------------------------------------------------------------------------------------------------------
>
> Key: HIVE-26088
> URL: https://issues.apache.org/jira/browse/HIVE-26088
> Project: Hive
> Issue Type: Bug
> Affects Versions: 3.1.2
> Reporter: lkl
> Priority: Major
> Labels: pull-request-available
> Time Spent: 10m
> Remaining Estimate: 0h
>
> create database pr;
> use pr;
>
> create table t1(id int,name string) PARTITIONED BY (cls string) stored as orc;
> create table t2(id int,name string) PARTITIONED BY (cls string) stored as orc;
> create table t3(id int,name string) PARTITIONED BY (cls string) stored as orc;
> insert into table t1 partition(cls=1) values(1,"zzz");
> insert into table t3 partition(cls=1) values(1,"zzz");
>
> sql:
> select y1.id,y1.name,nvl(y2.id,"其他") as y2id ,nvl(y2.name,"其他") as y2name from
> (select t1.id,t1.name from t1) y1
> left join
> (
> select a.id,a.name ,b.id as bid,b.name as bname from
> (select t2.id,t2.name from t2) a
> left join
> (select t3.id,t3.name from t3) b
> on a.id = b.id
> )y2
> on y1.id=y2.id;
>
> log msg:
> No Stats for mapjoin@t1, Columns: name, id
> No Stats for mapjoin@t3, Columns: id
> Query ID = hadoop_20220330164204_2a822fd5-7630-43f9-929a-b95086df04bf
> Total jobs = 3
> 2022-03-30 16:42:11 Starting to launch local task to process map join; maximum memory = 3817865216
> 2022-03-30 16:42:12 Uploaded 1 File to: file:/data/emr/hive/tmp/23201dda-9c15-4d6e-9904-12ae28d4dccb/hive_2022-03-30_16-42-04_837_944561186926371473-2/-local-10007/HashTable-Stage-6/MapJoin-mapfile30--.hashtable (278 bytes)
> Execution completed successfully
> MapredLocal task succeeded
> Launching Job 1 out of 3
> Number of reduce tasks is set to 0 since there's no reduce operator
> Starting Job = job_1646126523394_0063, Tracking URL = http://172.21.64.46:5004/proxy/application_1646126523394_0063/
> Kill Command = /usr/local/service/hadoop/bin/mapred job -kill job_1646126523394_0063
> Hadoop job information for Stage-6: number of mappers: 0; number of reducers: 0
> 2022-03-30 16:42:18,781 Stage-6 map = 0%, reduce = 0%
> Ended Job = job_1646126523394_0063
> Stage-7 is filtered out by condition resolver.
> Stage-1 is selected by condition resolver.
> Launching Job 2 out of 3
> Number of reduce tasks not specified. Estimated from input data size: 1
> In order to change the average load for a reducer (in bytes):
> set hive.exec.reducers.bytes.per.reducer=<number>
> In order to limit the maximum number of reducers:
> set hive.exec.reducers.max=<number>
> In order to set a constant number of reducers:
> set mapreduce.job.reduces=<number>
> Starting Job = job_1646126523394_0064, Tracking URL = http://172.21.64.46:5004/proxy/application_1646126523394_0064/
> Kill Command = /usr/local/service/hadoop/bin/mapred job -kill job_1646126523394_0064
> Hadoop job information for Stage-1: number of mappers: 2; number of reducers: 1
> 2022-03-30 16:42:26,673 Stage-1 map = 0%, reduce = 0%
> 2022-03-30 16:42:30,768 Stage-1 map = 50%, reduce = 0%, Cumulative CPU 1.39 sec
> 2022-03-30 16:42:31,792 Stage-1 map = 100%, reduce = 0%, Cumulative CPU 2.92 sec
> 2022-03-30 16:42:36,886 Stage-1 map = 100%, reduce = 100%, Cumulative CPU 4.83 sec
> MapReduce Total cumulative CPU time: 4 seconds 830 msec
> Ended Job = job_1646126523394_0064
> MapReduce Jobs Launched:
> Stage-Stage-6: HDFS Read: 0 HDFS Write: 0 SUCCESS
> Stage-Stage-1: Map: 2 Reduce: 1 Cumulative CPU: 4.83 sec HDFS Read: 15362 HDFS Write: 131 SUCCESS
> Total MapReduce CPU Time Spent: 4 seconds 830 msec
> OK
> 1 zzz 其他 其他
> Time taken: 33.102 seconds, Fetched: 1 row(s)
>
>
>
--
This message was sent by Atlassian Jira
(v8.20.1#820001)