You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by GitBox <gi...@apache.org> on 2021/11/01 17:41:03 UTC

[GitHub] [hudi] bkosuru commented on issue #3892: Insert produces 44764 files with ~50MB each

bkosuru commented on issue #3892:
URL: https://github.com/apache/hudi/issues/3892#issuecomment-956441700


   Hi @dongkelun, 
   Thanks for the suggestion. The number of files reduced to 2998 for INSERT. I have couple of questions.
   
   1) What options control the size and number of parquet files for INSERT?
   2) If it takes same amount of time to copy/write the data, which option is better for query performance INSERT or BULK_INSERT? Overall which is better INSERT or BULK_INSERT?
   
   Thanks,
   Bindu


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org