You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@iceberg.apache.org by GitBox <gi...@apache.org> on 2021/03/05 07:13:23 UTC

[GitHub] [iceberg] openinx opened a new issue #2298: Consider both delete file size and data file size when planing tasks

openinx opened a new issue #2298:
URL: https://github.com/apache/iceberg/issues/2298


   Currently, TableScanUtil#splitFiles and TableScanUtil#planTasks only consider data file size,  that's reasonable when we plan tasks for data in iceberg table format v1.  While as we will produces lots of delete files after enabled the row-level delete,  the split policy and plan task policy need to consider both data file size and delete file size, so that the data size of each task would be balanced.
    


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org