You are viewing a plain text version of this content. The canonical link for it is here.

Posted to commits@hudi.apache.org by GitBox <gi...@apache.org> on 2022/04/22 04:03:21 UTC

[GitHub] [hudi] yesemsanthoshkumar commented on issue #1985: [SUPPORT]Error while running deltastreamer on top of backfilled data using Hudi

yesemsanthoshkumar commented on issue #1985:
URL: https://github.com/apache/hudi/issues/1985#issuecomment-1105979967

   @bvaradar In my usecase, the initial data is being loaded using spark.write() to hudi. Further data is flowing through kafka which I'd like to ingest on the same table. Steps I've followed are
   1. spark.write to hudi
   2. Run deltastreamer with checkpoint in the format `kafkatopic,partition:0` I've kept 0 as offset as the kafka topic is newly created and I have all the data inside the topic when I run deltastreamer.
   
   This is the correct way to do it, yeah?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org