You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "jiaan.geng (Jira)" <ji...@apache.org> on 2023/04/04 06:07:00 UTC

[jira] [Commented] (SPARK-43025) Eliminate Union if filters on the same child plan

    [ https://issues.apache.org/jira/browse/SPARK-43025?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17708229#comment-17708229 ] 

jiaan.geng commented on SPARK-43025:
------------------------------------

I'm working on.

> Eliminate Union if filters on the same child plan
> -------------------------------------------------
>
>                 Key: SPARK-43025
>                 URL: https://issues.apache.org/jira/browse/SPARK-43025
>             Project: Spark
>          Issue Type: New Feature
>          Components: SQL
>    Affects Versions: 3.5.0
>            Reporter: jiaan.geng
>            Priority: Major
>
> There are a lot of SQL in user scenarios. Take an example,
> SELECT
>   ss_item_sk,
>   ss_ticket_number,
>   ss_customer_sk
> FROM store_sales
>   LEFT OUTER JOIN store_returns
>     ON (sr_item_sk = ss_item_sk AND sr_ticket_number = ss_ticket_number)
> WHERE sr_return_amt > 10000
> UNION ALL
> SELECT
>   ss_item_sk,
>   ss_ticket_number,
>   ss_customer_sk
> FROM store_sales
>   LEFT OUTER JOIN store_returns
>     ON (sr_item_sk = ss_item_sk AND sr_ticket_number = ss_ticket_number)
> WHERE sr_return_amt < 1000
> We can simplify this SQL as
> SELECT
>   ss_item_sk,
>   ss_ticket_number,
>   ss_customer_sk
> FROM store_sales
>   LEFT OUTER JOIN store_returns
>     ON (sr_item_sk = ss_item_sk AND sr_ticket_number = ss_ticket_number)
> WHERE sr_return_amt > 10000 or sr_return_amt < 1000



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org