You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by GitBox <gi...@apache.org> on 2021/08/06 20:13:44 UTC

[GitHub] [hudi] FeiZou commented on issue #3418: [SUPPORT] Hudi Upsert Very Slow/ Failed With No Space Left on Device

FeiZou commented on issue #3418:
URL: https://github.com/apache/hudi/issues/3418#issuecomment-894495149


   @nsivabalan Sorry I didn't made it clear. The source table we are migrating contains duplicate that's why we want to use `upsert` to remove the duplicate during the migration. And I have another question that should I use partitioned or non-partitioned table in this case? From reading the docs, it's saying that if I want to update records within partitioned table, I need to use `globalBloomIndex` instead `bloomIndex`. Not sure how partitioned table with `globalBloomIndex` will perform comparing with non-partitioned table with `bloomIndex` base the total data I have?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org