You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "HuangFru (via GitHub)" <gi...@apache.org> on 2023/04/01 03:30:12 UTC

[GitHub] [hudi] HuangFru commented on issue #8332: [SUPPORT] Spark insert overwrite in partition table causes executors OOM.

HuangFru commented on issue #8332:
URL: https://github.com/apache/hudi/issues/8332#issuecomment-1492815564

   > insert overwrite will still deal with small file, you can use `bulk insert` avoid it to speed up, but it only support to `insert into` which is not idempotent operation, you need truncate partitions which you want to insert. Fortunately, the community has related patch support bulk overwrite, #8076 , you can try it.
   
   If the table is empty, will the 'insert into' using 'bulk insert' have better performance than 'insert overwrite'?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org