You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Pengcheng Xiong (JIRA)" <ji...@apache.org> on 2018/11/05 18:27:00 UTC

[jira] [Commented] (HIVE-20867) Rewrite INTERSECT into LEFT SEMI JOIN instead of UNION + Group by

    [ https://issues.apache.org/jira/browse/HIVE-20867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16675550#comment-16675550 ] 

Pengcheng Xiong commented on HIVE-20867:
----------------------------------------

I have some questions about this jira. Could you share your design document on this? I assumed that we compared several candidates when we made the decision, and lefts semi join was one of them. We chose union-based one because a) a similar approach can be applied to except(all) as well, thus we have better code reuse. b) when we have more then 2 branchesĀ as the inputs of intersect, we assume that in the future those branches can be executed in parallel. Comparing with left-semi join one, we need to do the join one by one.

> Rewrite INTERSECT into LEFT SEMI JOIN instead of UNION + Group by
> -----------------------------------------------------------------
>
>                 Key: HIVE-20867
>                 URL: https://issues.apache.org/jira/browse/HIVE-20867
>             Project: Hive
>          Issue Type: Improvement
>          Components: Query Planning
>    Affects Versions: 4.0.0
>            Reporter: Vineet Garg
>            Assignee: Vineet Garg
>            Priority: Major
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)