You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@iceberg.apache.org by GitBox <gi...@apache.org> on 2022/10/25 02:03:27 UTC

[GitHub] [iceberg] ajantha-bhat commented on issue #6042: Add delete file information to partitions table

ajantha-bhat commented on issue #6042:
URL: https://github.com/apache/iceberg/issues/6042#issuecomment-1289876866

   @szehon-ho : 
   Say for `partition-a` I have `record_count`=6 and `file_count`=2. [3 records in each file]
   Now, I do position delete which marks 3 records in file1 as deleted for `partition-a`. 
   
   So, are you suggesting instead of decrementing `record_count` to 3 and incrementing `file_count` to 3,
   increment `pos_delete_record_count`=3, `pos_delete_file_count`=1 ?
   
   In this case, the user has to manually calculate that now instead of 6 there are 3 total records and instead of 2 files, now need to scan 3 files.
   
   Sounds good to me. It avoids confusion. 
   
   What about the global deletes?  (row level delete for unpartitioned data). Does it come under the partition value `null`?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org