You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@iceberg.apache.org by "szehon-ho (via GitHub)" <gi...@apache.org> on 2023/04/29 06:26:31 UTC

[GitHub] [iceberg] szehon-ho commented on issue #7463: Spark: inconsistency in rewrite data and summary

szehon-ho commented on issue #7463:
URL: https://github.com/apache/iceberg/issues/7463#issuecomment-1528684496

   Yea, my writeup (linked by @singhpk234 ) explains the problem with not cleaning up dangling deletes.  (If that is problem you refer to).  Actually I have been working on #7389 to solve it.  Maybe its also possible to improve rewrite_data_files to do this automatically as well after this change. 
   
   > Also I am not still sure, why the doc mentions .files table to show current data_files but displays delete files as well, imho we should fix the doc or the behaviour
   > 
   > https://iceberg.apache.org/docs/latest/spark-queries/#files
   
   Totally agree, I forgot to document those tables when I added them.
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org