You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "ad1happy2go (via GitHub)" <gi...@apache.org> on 2023/04/03 15:42:06 UTC

[GitHub] [hudi] ad1happy2go commented on issue #8365: [SUPPORT] inconsistent Readoptimized view in merge on read table

ad1happy2go commented on issue #8365:
URL: https://github.com/apache/hudi/issues/8365#issuecomment-1494557334

   @pravin1406 I did some experimentation on the same and here is my findings. 
   
   It handles only updates in the log file and new insert goes to parquet file.
   
   Only updates - can see only log file
   Only inserts - can see only parquet file
   Updates or Inserts - can see both log and parquet file.
   
   So When we read the read optimised view it will read the parquet files. So we can see new inserts without existing rows being updated as they will just be coming from old parquet. 
   
   So This can be reason for data inconsistency issues. Will confirm and if its a valid bug I will raise the JIRA for the same.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org