You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Ayushi Agarwal (Jira)" <ji...@apache.org> on 2022/08/22 08:14:00 UTC

[jira] [Created] (SPARK-40177) Simplify join condition of form (a==b) || (a==null&&b==null) to a<=>b

Ayushi Agarwal created SPARK-40177:
--------------------------------------

             Summary: Simplify join condition of form (a==b) || (a==null&&b==null) to a<=>b
                 Key: SPARK-40177
                 URL: https://issues.apache.org/jira/browse/SPARK-40177
             Project: Spark
          Issue Type: Task
          Components: SQL
    Affects Versions: 3.3.0, 3.2.0
            Reporter: Ayushi Agarwal
             Fix For: 3.3.1


If the join condition is like key1==key2 || (key1==null && key2==null), join is executed as Broadcast Nested Loop Join as this condition doesn't satisfy equi join condition. BNLJ takes more time as compared to Sort merge or broadcast join. This condition can be converted to key1<=>key2 to make the join execute as Broadcast or sort merge join.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org