You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@iceberg.apache.org by GitBox <gi...@apache.org> on 2020/04/06 22:43:40 UTC

[GitHub] [incubator-iceberg] rdblue commented on issue #360: Spec: Add column equality delete files

rdblue commented on issue #360: Spec: Add column equality delete files
URL: https://github.com/apache/incubator-iceberg/issues/360#issuecomment-610075049
 
 
   @openinx, what is the motivating use case for a format like that one?
   
   The use cases that we've been considering for equality deletes mostly have a stable column set that will be used for deletes. For example, a CDC use case typically uses the primary key of the table that is changing. In that case, it's easier if the schema of the delete file is a data file with just the primary key columns. So if I had a `users` table, I might have a delete file with a single column, `user_id` with a field ID that matches the data files.
   
   What are the use cases for a format that allows a dynamic column set for every row? I would want to make sure that it is worth the extra complexity.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org