You are viewing a plain text version of this content. The canonical link for it is here.
Posted to reviews@spark.apache.org by GitBox <gi...@apache.org> on 2021/02/17 03:38:45 UTC

[GitHub] [spark] wangyum edited a comment on pull request #29726: [SPARK-32855][SQL] Improve DPP for some join type do not support broadcast filtering side

wangyum edited a comment on pull request #29726:
URL: https://github.com/apache/spark/pull/29726#issuecomment-780266596


   ```scala
   sql("set spark.sql.adaptive.enabled=false")
   sql("CREATE TABLE t1 using parquet partitioned by (b) AS SELECT id AS a, id % 1000 AS b FROM range(200000000L) distribute by b")
   sql("CREATE TABLE t2 using parquet AS SELECT id as c,id AS d FROM range(2000)")
   sql("SELECT count(*) FROM t2 left join t1 on t1.b=t2.c where t2.d < 10").show
   ```
   Before this pr | After this pr
   -- | --
   ![image](https://user-images.githubusercontent.com/5399861/108152681-8d5e8a80-7114-11eb-9e9d-ca6bb5bb3e29.png)  | ![image](https://user-images.githubusercontent.com/5399861/108150840-59816600-7110-11eb-8b94-c2ad740d125d.png)
   
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org