You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@arrow.apache.org by "Dandandan (via GitHub)" <gi...@apache.org> on 2023/04/14 12:33:01 UTC

[GitHub] [arrow-datafusion] Dandandan commented on issue #5999: Improve DataFusion scalability as more cores are added

Dandandan commented on issue #5999:
URL: https://github.com/apache/arrow-datafusion/issues/5999#issuecomment-1508438210

   Suggestion I posted in slack:
   
   I wonder whether there might be some regressions here wrt scalability with the nr of cores - https://github.com/apache/arrow-datafusion/pull/4219 comes to mind which makes building the hashmap for the build-side done in a single thread for smaller tables
   might be checked by changing datafusion.optimizer.hash_join_single_partition_threshold


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org