You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@arrow.apache.org by "comphead (via GitHub)" <gi...@apache.org> on 2023/05/08 17:57:17 UTC

[GitHub] [arrow-datafusion] comphead commented on issue #6287: [DISCUSS] Set DataFusion settings for maximum "out of the box" performance

comphead commented on issue #6287:
URL: https://github.com/apache/arrow-datafusion/issues/6287#issuecomment-1538796242

   Another example what I have found is increasing batch size solves some performance troubles related to excessive collection allocation or loops. 
   
   I suppose there is also a trouble when the code gets triggered for empty batches, so the code spins CPU but in fact does nothing, I think noticed this behavior couple of times but need to have a good reproduce test to prove it.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org