You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@arrow.apache.org by GitBox <gi...@apache.org> on 2020/09/15 17:05:32 UTC

[GitHub] [arrow] nealrichardson commented on pull request #8188: ARROW-9924: [C++][Dataset] Enable per-column parallelism for single ParquetFileFragment scans

nealrichardson commented on pull request #8188:
URL: https://github.com/apache/arrow/pull/8188#issuecomment-692848891


   Can you run some kind of benchmark or ad-hoc performance check, including with the file/shape in question here, to see the effects? I see two changes (batch size and single-file behavior change) so we should be clear about the effects.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org