You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@iceberg.apache.org by GitBox <gi...@apache.org> on 2020/10/28 23:17:31 UTC

[GitHub] [iceberg] rdblue commented on issue #1637: Spark Reads of Duplicate Datafiles

rdblue commented on issue #1637:
URL: https://github.com/apache/iceberg/issues/1637#issuecomment-718262368


   It should be safe to update the map to one that allows replacement, but keep in mind that committing the same files twice will lead to duplicate data, not just a problem reading. So you may want the existing behavior.
   
   Can you avoid committing duplicates to your table?


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org