You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@iceberg.apache.org by GitBox <gi...@apache.org> on 2022/06/13 17:28:06 UTC

[GitHub] [iceberg] kbendick commented on issue #5023: Failed to delete data from iceberg data table

kbendick commented on issue #5023:
URL: https://github.com/apache/iceberg/issues/5023#issuecomment-1154190065

   First, you don’t need to include both `iceberg-core` and `iceberg-spark-runtime-3.2_2.12`. For Iceberg, the “Spark runtime” jar is the only dependency required (though you nigh need some non-Iceberg dependencies, such as AWS s3 Java SDK v2 to use S3FileIO). But `iceberg-core` is already included in the Spark runtime jar.
   
   Second, can you please provide the full output of `desc extended <tbl>`? I’m wondering if one of the columns has maybe been dropped, and so the query filter is being simplified to false because a non-null value is being provided.
   
   In any case, the `desc extended <tbl>` output, as well as the full logs from the deletion, would be beneficial.
   
   It’s also possible that one of the columns you marked is a partition column or something and that value isn’t present. Just a couple guesses, but the full logs as well as the `desc extended <tbl>` output would help a lot in debugging.
   
   Additionally, querying the current snapshot with that same filter to see if any data is returned (eg `select count(*) from tbl where <that deletion where clause>`) would also be beneficial in debugging.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org