You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@hive.apache.org by Xiaoyong Zhu <xi...@microsoft.com> on 2014/12/16 07:55:03 UTC

relationship between a hive query and a tez dag

Hi experts

I know that Hive could be run on Tez in a DAG "format" - if I have a complex Hive script which is consisted of several Hive queries, will it be translated to 1 Tez DAG? Or several Tez DAGs?

Thanks!

Xiaoyong


RE: relationship between a hive query and a tez dag

Posted by Xiaoyong Zhu <xi...@microsoft.com>.
Sorry to update this again - but why don't we do a cross query optimization and make the query into 1 DAG (if all the queries in a certain script are linked with each other).. this seems a more optimized way..

Xiaoyong

From: Xiaoyong Zhu [mailto:xiaoyzhu@microsoft.com]
Sent: Thursday, December 18, 2014 8:08 AM
To: Alan Gates
Cc: user@hive.apache.org
Subject: RE: relationship between a hive query and a tez dag

I see, thanks for the explaination!

Xiaoyong

From: Alan Gates [mailto:gates@hortonworks.com]
Sent: Thursday, December 18, 2014 8:05 AM
To: Xiaoyong Zhu
Cc: user@hive.apache.org<ma...@hive.apache.org>
Subject: Re: relationship between a hive query and a tez dag

DDL operations are done from the client, not as part of a Tez DAG.  There still is no linkage between the DAGs even if they access the same tables or somehow depend on each other.  Hive manages making sure queries are executed in the order given.  Tez just executed each query.

Alan.

[cid:image001.jpg@01D01B99.9B4DD560]
Xiaoyong Zhu<ma...@microsoft.com>
December 16, 2014 at 15:41
Thanks Alan! Then what if different hive queries have some links (e.g. query 1 creates table abc and query 2 reads data from abc) will this be in 1 DAG or 2 DAGs? If 2, then is there any link/relatioinship between the 2 DAGs?

Xiaoyong

From: Alan Gates [mailto:gates@hortonworks.com]
Sent: Wednesday, December 17, 2014 5:27 AM
To: user@hive.apache.org<ma...@hive.apache.org>
Subject: Re: relationship between a hive query and a tez dag

Several.  Each Hive query becomes one Tez DAG.

Alan.



--
Sent with Postbox<http://www.getpostbox.com>

CONFIDENTIALITY NOTICE
NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You.
[cid:image002.jpg@01D01B99.9B4DD560]
Alan Gates<ma...@hortonworks.com>
December 16, 2014 at 13:26
Several.  Each Hive query becomes one Tez DAG.

Alan.

--
Sent with Postbox<http://www.getpostbox.com>

CONFIDENTIALITY NOTICE
NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You.

RE: relationship between a hive query and a tez dag

Posted by Xiaoyong Zhu <xi...@microsoft.com>.
I see, thanks for the explaination!

Xiaoyong

From: Alan Gates [mailto:gates@hortonworks.com]
Sent: Thursday, December 18, 2014 8:05 AM
To: Xiaoyong Zhu
Cc: user@hive.apache.org
Subject: Re: relationship between a hive query and a tez dag

DDL operations are done from the client, not as part of a Tez DAG.  There still is no linkage between the DAGs even if they access the same tables or somehow depend on each other.  Hive manages making sure queries are executed in the order given.  Tez just executed each query.

Alan.


[cid:image001.jpg@01D01A99.C2B8BDF0]
Xiaoyong Zhu<ma...@microsoft.com>
December 16, 2014 at 15:41
Thanks Alan! Then what if different hive queries have some links (e.g. query 1 creates table abc and query 2 reads data from abc) will this be in 1 DAG or 2 DAGs? If 2, then is there any link/relatioinship between the 2 DAGs?

Xiaoyong

From: Alan Gates [mailto:gates@hortonworks.com]
Sent: Wednesday, December 17, 2014 5:27 AM
To: user@hive.apache.org<ma...@hive.apache.org>
Subject: Re: relationship between a hive query and a tez dag

Several.  Each Hive query becomes one Tez DAG.

Alan.




--
Sent with Postbox<http://www.getpostbox.com>

CONFIDENTIALITY NOTICE
NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You.
[cid:image002.jpg@01D01A99.C2B8BDF0]
Alan Gates<ma...@hortonworks.com>
December 16, 2014 at 13:26
Several.  Each Hive query becomes one Tez DAG.

Alan.


--
Sent with Postbox<http://www.getpostbox.com>

CONFIDENTIALITY NOTICE
NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You.

Re: relationship between a hive query and a tez dag

Posted by Alan Gates <ga...@hortonworks.com>.
Yes, that is correct.

Alan.

> Xiaoyong Zhu <ma...@microsoft.com>
> January 21, 2015 at 17:51
>
> Thanks Alan and just for double check: if I have the following script:
>
> 1)Create table tbl1, tbl2;
>
> 2)Select count(*) from tbl1 group by A;
>
> 3)Select count(*) from tbl2 groub by B;
>
> 4)Create table tb3;
>
> And this script will be executed exactly in 1 >2 > 3 > 4 order even 2 
> and 3 do not have relationship with each other and 4 does not have 
> relation with all query 1,2,3?
>
> Xiaoyong
>
> *From:*Alan Gates [mailto:gates@hortonworks.com]
> *Sent:* Thursday, December 18, 2014 8:05 AM
> *To:* Xiaoyong Zhu
> *Cc:* user@hive.apache.org
> *Subject:* Re: relationship between a hive query and a tez dag
>
> DDL operations are done from the client, not as part of a Tez DAG.  
> There still is no linkage between the DAGs even if they access the 
> same tables or somehow depend on each other.  Hive manages making sure 
> queries are executed in the order given.  Tez just executed each query.
>
> Alan.
>
>
> -- 
>
> Sent with Postbox <http://www.getpostbox.com>
>
>
> CONFIDENTIALITY NOTICE
> NOTICE: This message is intended for the use of the individual or 
> entity to which it is addressed and may contain information that is 
> confidential, privileged and exempt from disclosure under applicable 
> law. If the reader of this message is not the intended recipient, you 
> are hereby notified that any printing, copying, dissemination, 
> distribution, disclosure or forwarding of this communication is 
> strictly prohibited. If you have received this communication in error, 
> please contact the sender immediately and delete it from your system. 
> Thank You.
>
> Alan Gates <ma...@hortonworks.com>
> December 17, 2014 at 16:04
> DDL operations are done from the client, not as part of a Tez DAG.  
> There still is no linkage between the DAGs even if they access the 
> same tables or somehow depend on each other.  Hive manages making sure 
> queries are executed in the order given.  Tez just executed each query.
>
> Alan.
>
>
> Xiaoyong Zhu <ma...@microsoft.com>
> December 16, 2014 at 15:41
>
> Thanks Alan! Then what if different hive queries have some links (e.g. 
> query 1 creates table abc and query 2 reads data from abc) will this 
> be in 1 DAG or 2 DAGs? If 2, then is there any link/relatioinship 
> between the 2 DAGs?
>
> Xiaoyong
>
> *From:*Alan Gates [mailto:gates@hortonworks.com]
> *Sent:* Wednesday, December 17, 2014 5:27 AM
> *To:* user@hive.apache.org
> *Subject:* Re: relationship between a hive query and a tez dag
>
> Several.  Each Hive query becomes one Tez DAG.
>
> Alan.
>
>
> -- 
>
> Sent with Postbox <http://www.getpostbox.com>
>
>
> CONFIDENTIALITY NOTICE
> NOTICE: This message is intended for the use of the individual or 
> entity to which it is addressed and may contain information that is 
> confidential, privileged and exempt from disclosure under applicable 
> law. If the reader of this message is not the intended recipient, you 
> are hereby notified that any printing, copying, dissemination, 
> distribution, disclosure or forwarding of this communication is 
> strictly prohibited. If you have received this communication in error, 
> please contact the sender immediately and delete it from your system. 
> Thank You.
>
> Alan Gates <ma...@hortonworks.com>
> December 16, 2014 at 13:26
> Several.  Each Hive query becomes one Tez DAG.
>
> Alan.
>
>
> Xiaoyong Zhu <ma...@microsoft.com>
> December 15, 2014 at 22:55
>
> Hi experts
>
> I know that Hive could be run on Tez in a DAG "format" -- if I have a 
> complex Hive script which is consisted of several Hive queries, will 
> it be translated to 1 Tez DAG? Or several Tez DAGs?
>
> Thanks!
>
> Xiaoyong
>

-- 
CONFIDENTIALITY NOTICE
NOTICE: This message is intended for the use of the individual or entity to 
which it is addressed and may contain information that is confidential, 
privileged and exempt from disclosure under applicable law. If the reader 
of this message is not the intended recipient, you are hereby notified that 
any printing, copying, dissemination, distribution, disclosure or 
forwarding of this communication is strictly prohibited. If you have 
received this communication in error, please contact the sender immediately 
and delete it from your system. Thank You.

RE: relationship between a hive query and a tez dag

Posted by Xiaoyong Zhu <xi...@microsoft.com>.
Thanks Alan and just for double check: if I have the following script:

1)      Create table tbl1, tbl2;

2)      Select count(*) from tbl1 group by A;

3)      Select count(*) from tbl2 groub by B;

4)      Create table tb3;

And this script will be executed exactly in 1 >2 > 3 > 4 order even 2 and 3 do not have relationship with each other and 4 does not have relation with all query 1,2,3?

Xiaoyong

From: Alan Gates [mailto:gates@hortonworks.com]
Sent: Thursday, December 18, 2014 8:05 AM
To: Xiaoyong Zhu
Cc: user@hive.apache.org
Subject: Re: relationship between a hive query and a tez dag

DDL operations are done from the client, not as part of a Tez DAG.  There still is no linkage between the DAGs even if they access the same tables or somehow depend on each other.  Hive manages making sure queries are executed in the order given.  Tez just executed each query.

Alan.


[cid:image001.jpg@01D03628.5360D0F0]
Xiaoyong Zhu<ma...@microsoft.com>
December 16, 2014 at 15:41
Thanks Alan! Then what if different hive queries have some links (e.g. query 1 creates table abc and query 2 reads data from abc) will this be in 1 DAG or 2 DAGs? If 2, then is there any link/relatioinship between the 2 DAGs?

Xiaoyong

From: Alan Gates [mailto:gates@hortonworks.com]
Sent: Wednesday, December 17, 2014 5:27 AM
To: user@hive.apache.org<ma...@hive.apache.org>
Subject: Re: relationship between a hive query and a tez dag

Several.  Each Hive query becomes one Tez DAG.

Alan.




--
Sent with Postbox<http://www.getpostbox.com>

CONFIDENTIALITY NOTICE
NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You.
[cid:image002.jpg@01D03628.5360D0F0]
Alan Gates<ma...@hortonworks.com>
December 16, 2014 at 13:26
Several.  Each Hive query becomes one Tez DAG.

Alan.


--
Sent with Postbox<http://www.getpostbox.com>

CONFIDENTIALITY NOTICE
NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You.

Re: relationship between a hive query and a tez dag

Posted by Alan Gates <ga...@hortonworks.com>.
DDL operations are done from the client, not as part of a Tez DAG.  
There still is no linkage between the DAGs even if they access the same 
tables or somehow depend on each other.  Hive manages making sure 
queries are executed in the order given.  Tez just executed each query.

Alan.

> Xiaoyong Zhu <ma...@microsoft.com>
> December 16, 2014 at 15:41
>
> Thanks Alan! Then what if different hive queries have some links (e.g. 
> query 1 creates table abc and query 2 reads data from abc) will this 
> be in 1 DAG or 2 DAGs? If 2, then is there any link/relatioinship 
> between the 2 DAGs?
>
> Xiaoyong
>
> *From:*Alan Gates [mailto:gates@hortonworks.com]
> *Sent:* Wednesday, December 17, 2014 5:27 AM
> *To:* user@hive.apache.org
> *Subject:* Re: relationship between a hive query and a tez dag
>
> Several.  Each Hive query becomes one Tez DAG.
>
> Alan.
>
>
> -- 
>
> Sent with Postbox <http://www.getpostbox.com>
>
>
> CONFIDENTIALITY NOTICE
> NOTICE: This message is intended for the use of the individual or 
> entity to which it is addressed and may contain information that is 
> confidential, privileged and exempt from disclosure under applicable 
> law. If the reader of this message is not the intended recipient, you 
> are hereby notified that any printing, copying, dissemination, 
> distribution, disclosure or forwarding of this communication is 
> strictly prohibited. If you have received this communication in error, 
> please contact the sender immediately and delete it from your system. 
> Thank You.
>
> Alan Gates <ma...@hortonworks.com>
> December 16, 2014 at 13:26
> Several.  Each Hive query becomes one Tez DAG.
>
> Alan.
>
>

-- 
Sent with Postbox <http://www.getpostbox.com>

-- 
CONFIDENTIALITY NOTICE
NOTICE: This message is intended for the use of the individual or entity to 
which it is addressed and may contain information that is confidential, 
privileged and exempt from disclosure under applicable law. If the reader 
of this message is not the intended recipient, you are hereby notified that 
any printing, copying, dissemination, distribution, disclosure or 
forwarding of this communication is strictly prohibited. If you have 
received this communication in error, please contact the sender immediately 
and delete it from your system. Thank You.

RE: relationship between a hive query and a tez dag

Posted by Xiaoyong Zhu <xi...@microsoft.com>.
Thanks Alan! Then what if different hive queries have some links (e.g. query 1 creates table abc and query 2 reads data from abc) will this be in 1 DAG or 2 DAGs? If 2, then is there any link/relatioinship between the 2 DAGs?

Xiaoyong

From: Alan Gates [mailto:gates@hortonworks.com]
Sent: Wednesday, December 17, 2014 5:27 AM
To: user@hive.apache.org
Subject: Re: relationship between a hive query and a tez dag

Several.  Each Hive query becomes one Tez DAG.

Alan.


[cid:image001.jpg@01D019CC.EE7BFBF0]
Xiaoyong Zhu<ma...@microsoft.com>
December 15, 2014 at 22:55
Hi experts

I know that Hive could be run on Tez in a DAG "format" - if I have a complex Hive script which is consisted of several Hive queries, will it be translated to 1 Tez DAG? Or several Tez DAGs?

Thanks!

Xiaoyong


--
Sent with Postbox<http://www.getpostbox.com>

CONFIDENTIALITY NOTICE
NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You.

Re: relationship between a hive query and a tez dag

Posted by Alan Gates <ga...@hortonworks.com>.
Several.  Each Hive query becomes one Tez DAG.

Alan.

> Xiaoyong Zhu <ma...@microsoft.com>
> December 15, 2014 at 22:55
>
> Hi experts
>
> I know that Hive could be run on Tez in a DAG "format" -- if I have a 
> complex Hive script which is consisted of several Hive queries, will 
> it be translated to 1 Tez DAG? Or several Tez DAGs?
>
> Thanks!
>
> Xiaoyong
>

-- 
Sent with Postbox <http://www.getpostbox.com>

-- 
CONFIDENTIALITY NOTICE
NOTICE: This message is intended for the use of the individual or entity to 
which it is addressed and may contain information that is confidential, 
privileged and exempt from disclosure under applicable law. If the reader 
of this message is not the intended recipient, you are hereby notified that 
any printing, copying, dissemination, distribution, disclosure or 
forwarding of this communication is strictly prohibited. If you have 
received this communication in error, please contact the sender immediately 
and delete it from your system. Thank You.