You are viewing a plain text version of this content. The canonical link for it is here.
Posted to reviews@spark.apache.org by GitBox <gi...@apache.org> on 2022/11/30 15:12:54 UTC

[GitHub] [spark] kmozaid commented on pull request #31573: [SPARK-34444][SQL] Pushdown scalar-subquery filter to FileSourceScan

kmozaid commented on PR #31573:
URL: https://github.com/apache/spark/pull/31573#issuecomment-1332320279

   @SaurabhChawla100 Hi Saurabh, I didn't get your statement `If Scalar Subquery completes first, than only scan of t1 starts before this change and after this PR also , Than push down of scalar subquery will always be faster`
   
   The query `SELECT * FROM t1 WHERE b = (select max(d) from t2);` given in this example completes very fast and still it is not being pushed down. 
   
   Suggestion - Can we provide hint to planner to execute subquery first and push down subquery filter?
   
   cc @wangyum Could you please reopen this PR?
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org