You are viewing a plain text version of this content. The canonical link for it is here.
Posted to reviews@spark.apache.org by GitBox <gi...@apache.org> on 2019/04/15 16:36:43 UTC

[GitHub] [spark] cloud-fan opened a new pull request #24376: [SPARK-27225][SQL] fix the join strategy for broadcast nested loop join

cloud-fan opened a new pull request #24376: [SPARK-27225][SQL] fix the join strategy for broadcast nested loop join
URL: https://github.com/apache/spark/pull/24376
 
 
   ## What changes were proposed in this pull request?
   
   Fix a mistake in https://github.com/apache/spark/pull/24164
   
   We should prefer broadcast nested loop join, if the data is small enough. This was the behavior before #24164 (see the [code](https://github.com/apache/spark/pull/24164/files#diff-7253a38df7e111ecf6b1ef71feba383bL293)), we should not change it.
   
   This PR also fixes a related bug that, broadcast hint should be respected for  broadcast nested loop join.
   
   ## How was this patch tested?
   
   new tests

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org