You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by GitBox <gi...@apache.org> on 2020/10/20 11:42:53 UTC

[GitHub] [hudi] SteNicholas edited a comment on pull request #2111: [HUDI-1234] Insert new records regardless of small file when using insert operation

SteNicholas edited a comment on pull request #2111:
URL: https://github.com/apache/hudi/pull/2111#issuecomment-712566489


   > @SteNicholas @leesf : Does this essentially mean we no longer support small file handling for "inserts" ?
   > If user doesn't essentially care about duplicates, I agree that we need to have same behavior w/o small file handling. Instead of this approach, can we create a new type of Write Handle which looks like MergeHandle but does not merge but rather appends records and creates a new version of Parquet file. You can then use this Handle instead of UpdateHandle when pure insert operation type is used.
   > 
   > cc @vinothchandar
   
   Yes, user doesn't essentially care about duplicates for small files and the same behavior w/o small file handling makes sense.
   @vinothchandar What do you think about?


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org