You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "XiDuo You (Jira)" <ji...@apache.org> on 2022/04/13 07:06:00 UTC
[jira] [Created] (SPARK-38886) Remove outer join if aggregate functions are duplicate agnostic on streamed side
XiDuo You created SPARK-38886:
---------------------------------
Summary: Remove outer join if aggregate functions are duplicate agnostic on streamed side
Key: SPARK-38886
URL: https://issues.apache.org/jira/browse/SPARK-38886
Project: Spark
Issue Type: Improvement
Components: SQL
Affects Versions: 3.4.0
Reporter: XiDuo You
If aggregate child is outer join, and the aggregate references are all coming from the streamed side and the aggregate functions are all duplicate agnostic, we can remve the outer join.
For example:
{code:java}
SELECT t1.c1, max(t1.c2) FROM t1 LEFT JOIN t2 ON t1.c1 = t2.c1 GROUP BY t1.c1
==>
SELECT t1.c1, max(t1.c2) FROM t1 GROUP BY t1.c1
{code}
--
This message was sent by Atlassian Jira
(v8.20.1#820001)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org