You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@drill.apache.org by "Boaz Ben-Zvi (JIRA)" <ji...@apache.org> on 2018/10/17 01:38:00 UTC

[jira] [Created] (DRILL-6799) Enhance the Hash-Join Operator to perform Anti-Semi-Join

Boaz Ben-Zvi created DRILL-6799:
-----------------------------------

             Summary: Enhance the Hash-Join Operator to perform Anti-Semi-Join
                 Key: DRILL-6799
                 URL: https://issues.apache.org/jira/browse/DRILL-6799
             Project: Apache Drill
          Issue Type: Improvement
          Components: Execution - Relational Operators, Query Planning &amp; Optimization
    Affects Versions: 1.14.0
            Reporter: Boaz Ben-Zvi
            Assignee: Boaz Ben-Zvi
             Fix For: 1.16.0


Similar to handling Semi-Join (see DRILL-6735), the Anti-Semi-Join can be enhanced by eliminating the extra DISTINCT (i.e. Hash-Aggr) operator.

Example (note the NOT IN):
select c.c_first_name, c.c_last_name from dfs.`/data/json/s1/customer` c where c.c_customer_sk NOT IN (select s.ss_customer_sk from dfs.`/data/json/s1/store_sales` s) limit 4;



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)