You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by GitBox <gi...@apache.org> on 2022/01/27 05:39:01 UTC

[GitHub] [hudi] stayrascal edited a comment on issue #4030: [SUPPORT] Flink uses updated fields to update data

stayrascal edited a comment on issue #4030:
URL: https://github.com/apache/hudi/issues/4030#issuecomment-1022865829


   > 
   
   Thanks for replay.
   
   I'm thinking how about change the ValueState of  `BucketAssignFunction` by store the whole `HoodieRecord` instead of `HoodieRecordGlobalLocation`, and then once the partition path changed, beside output a delete record on old file(partition), and also we can update the partition path of old record, and output the updated record on new file(partition).
   
   And later, the incoming record and updated record(from old record) will be `#preCombine` in `StreamWriteFunction`.
   
   Not sure is there any other impact that we use ValueState to store whole HoodieRecord instead of `HoodieRecordGlobalLocation` except the state size will bigger?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org