You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@iceberg.apache.org by GitBox <gi...@apache.org> on 2021/02/03 09:53:30 UTC

[GitHub] [iceberg] vvellanki commented on pull request #2182: Support for PartitionStatsFile in each snapshot

vvellanki commented on pull request #2182:
URL: https://github.com/apache/iceberg/pull/2182#issuecomment-772380537


   @rdblue The use case for tracking per-partition stats is mainly for planning purposes - estimating the number of rows (for join planning, and join ordering) and the number of files to read based on the dataset that needs to be read.
   
   As of now, the submitted change rewrites only the required files. Are suggesting the following approach:
   - Commit the transaction by including placeholders for the partition stats files
   - Asynchronously write the partition stats files


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org