You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@arrow.apache.org by GitBox <gi...@apache.org> on 2022/04/24 13:09:44 UTC

[GitHub] [arrow-rs] tustvold commented on issue #1605: Push-Based Parquet Reader

tustvold commented on issue #1605:
URL: https://github.com/apache/arrow-rs/issues/1605#issuecomment-1107838816

   Adding some free functions to assist migration makes sense to me. It should be pretty much a drop-in replacement.
   
   I am somewhat apprehensive about providing an async version, as the whole intent is to let users handle what level of buffering/pre-fetch makes sense for their use case, but I guess with sufficient disclaimers...
   
   As for SerializedRowGroupReader the challenge is the RowGroupReader trait exposes APIs for column projection, etc... which then gets rather confusing if you have similar concepts on the Scan. I'd rather a clean break, than trying to shoehorn an existing API.
   
   W.r.t to users constructing per-row group, entirely up to their use-case, if they want to they can, if they don't want to, they don't need to


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org