You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by GitBox <gi...@apache.org> on 2022/07/17 15:13:31 UTC

[GitHub] [hudi] YannByron commented on pull request #5885: [RFC-51][HUDI-3478] Hudi CDC

YannByron commented on PR #5885:
URL: https://github.com/apache/hudi/pull/5885#issuecomment-1186545393

   Hey @prasannarajaperumal Thank you very much for reviewing this.
   
   CDC is not a new concept, is a common concept for database. So I think it's better to distinguish CDC and `Incremental Query`. Some reasons: 
   - CDC is better known than incremental query. `incremental query` is defined by hudi.
   - Different from `Incremental Query` and `Snapshot Query`, CDC has its own output format in which every record have `op`, `ts_ms`, `before` and `after` fields.
   - According to RFC-51, CDC has its own read and write logical. We have to persist some other information for CDC when data is written to hudi.
   
   Looking forward to your reply.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org