You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@iceberg.apache.org by "singhpk234 (via GitHub)" <gi...@apache.org> on 2023/04/04 19:58:49 UTC

[GitHub] [iceberg] singhpk234 commented on pull request #7279: [Parquet] Eagerly fetch row groups when reading parquet

singhpk234 commented on PR #7279:
URL: https://github.com/apache/iceberg/pull/7279#issuecomment-1496519281

   one more idea on this context and at a higher level, is that can we prefetch next task and keep it in memory and expose an iterator around it, when we are reading a task groups to be specific here 
   
   https://github.com/apache/iceberg/blob/49e930877a16bce2df51d6e51b737d2969208644/spark/v3.3/spark/src/main/java/org/apache/iceberg/spark/source/BaseReader.java#L132-L157
   
   
   
   so that we don't have to wait to read next task and directly ask iterator on it. 
   
   cc @jackye1995 @rdblue  please let me know you thoughts on this as well. 
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org