You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by GitBox <gi...@apache.org> on 2020/11/10 19:28:42 UTC

[GitHub] [hudi] kpurella commented on issue #2240: [SUPPORT] Performance Issue : HUDI MOR ,UPSERT Job running forever

kpurella commented on issue #2240:
URL: https://github.com/apache/hudi/issues/2240#issuecomment-724917083


   @bvaradar Thank you for your quick response.
   1) we are not using any ordering as my key is a composite key ( combination 4 attributes).
   -sure I will give a try with hoodie.bloom.index.prune.by.ranges=false
   2)Earlier we were seeing Duplicates as my records are stored across partitions( DAILY), so we switched to GLOBAL_BLOOM,i will give a  try with SIMPLE and BLOOM as well as GLOBAL_SIMPLE
   - Questions
   > How can I control the Hudi output file size?
   >is it gonna improve the performance if I use a single partition?
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org