You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by GitBox <gi...@apache.org> on 2019/11/27 19:29:03 UTC

[GitHub] [incubator-hudi] bschell commented on issue #1052: [HUDI-326] Add new index to suppport global update/delete

bschell commented on issue #1052: [HUDI-326] Add new index to suppport global update/delete
URL: https://github.com/apache/incubator-hudi/pull/1052#issuecomment-559222366
 
 
   I see, I think this feature could instead possibly be refactored into HoodieBloomIndex instead then as the logic is compatible. My thinking was that a user would use this Index only to scan extra partitions for global update/delete but could then use the normal HoodieBloomIndex by default for the improved performance. But that is more of a view than an index then.
   
   As it stands the current implementation of HoodieGlobalBloomIndex doesn't really guarantee globally unique keys either if an insert contains duplicates. This causes weird behaviors when records with duplicates are updated which is the reason why we created this Index.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services