You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@arrow.apache.org by GitBox <gi...@apache.org> on 2022/04/06 16:16:31 UTC

[GitHub] [arrow-datafusion] tustvold opened a new issue, #2173: HashJoinExec Evaluates in `ExecutionPlan::execute`

tustvold opened a new issue, #2173:
URL: https://github.com/apache/arrow-datafusion/issues/2173

   **Describe the bug**
   
   Similar to #1939, `HashJoinExec` drives its first child to completion within `HashJoinExec::execute`, see [here](https://github.com/apache/arrow-datafusion/blob/master/datafusion/core/src/physical_plan/hash_join.rs#L316), instead of returning a stream that performs this operation lazily.
   
   This effectively stalls out constructing the rest of the physical plan until the child has been fully evaluated, and prevents result streaming from working correctly. It is also highly probable that one could construct a diamond shaped plan that would block indefinitely as a result.
   
   **To Reproduce**
   
   Run query incorporating a join
   
   **Expected behavior**
   
   `ExecutionPlan::execute` should return a stream of results, but should not block on those results being available
   
   **Additional context**
   
   I am currently experimenting with custom scheduling of physical plans, and this requires the construction of the physical plan to be correctly decoupled from its evaluation
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [arrow-datafusion] alamb closed issue #2173: HashJoinExec Evaluates in `ExecutionPlan::execute`

Posted by GitBox <gi...@apache.org>.
alamb closed issue #2173: HashJoinExec Evaluates in `ExecutionPlan::execute`
URL: https://github.com/apache/arrow-datafusion/issues/2173


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org