You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by GitBox <gi...@apache.org> on 2022/09/24 00:04:59 UTC

[GitHub] [hudi] nsivabalan commented on pull request #5633: [HUDI-4123] Fix the exception due to SqlSource return null checkpoint

nsivabalan commented on PR #5633:
URL: https://github.com/apache/hudi/pull/5633#issuecomment-1256807510

   I don't think we have any bugs as such. here is the context.
   SqlSource is different from other sources since there is no concept of checkpoint. 
   
   So, lets say, we define a sql as "select * from tbl1"
   
   We do syncOnce() and ingest into hudi. 
   
   next time again, we are going to invoke the same sql only. which is "select * from tbl1". So, we don't have anything like a checkpoint to resume from previous attempt when we invoked sql. So, just for sqlSource, the checkpoint that gets serialized into commit metadata is always null. So, if we invoke syncOnce() again, hudi tries to fetch the checkpoint from commit metadata which is null and again we just invoke the sql as is. 
   
   your test had some issue. we have to generate new data to sql test. 
   
   I have enhanced the test for sqlSource to test 2 syncOnce(). 
   
   https://github.com/apache/hudi/pull/6781
   
   Let me know if it makes sense. 
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org