You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "samirakarki (via GitHub)" <gi...@apache.org> on 2023/03/20 02:48:09 UTC

[GitHub] [hudi] samirakarki commented on issue #8135: [SUPPORT] Data discrepancies on COW upon joining table

samirakarki commented on issue #8135:
URL: https://github.com/apache/hudi/issues/8135#issuecomment-1475531867

   @nsivabalan 
   Both tables are in the same zone i.e. landing Zone (this is the zone where data is landed from RDS database).
   
   As I stated above, for our use casetable 1 data didn’t alter in the incremental load.
   
   1. The screen shot below is the table 2(after incremental load :
          Here id = 26, record changed for “complete” column to False.
   
   ![Screen Shot 2023-03-19 at 7 25 20 PM](https://user-images.githubusercontent.com/84825093/226234423-7067ccae-01a7-4c14-8af9-ca7794f887e4.png)
   
   2. The screenshot below is output table from s3 target bucket (result from joined table 1 and 2)
   execution_id is the primary key from table 2 and step_execution_id is primary key from table 1. In this target table step_execution-id is the primary key.
   After the incremental load, six rows with execution_id =26, completed column data hasn’t updated to “False”.Although,completed flag changed to “false” for that id (table2) as show in above screenshot.
   
   
   ![Screen Shot 2023-03-19 at 7 30 01 PM](https://user-images.githubusercontent.com/84825093/226234577-b6f05b57-3336-48f4-ad80-050f3d6a99ca.png)
   
   
   This is one use case we are trying to solve. 
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org