You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@iceberg.apache.org by GitBox <gi...@apache.org> on 2022/06/30 04:33:07 UTC

[GitHub] [iceberg] Reo-LEI commented on pull request #5061: Flink: upsert table join failed

Reo-LEI commented on PR #5061:
URL: https://github.com/apache/iceberg/pull/5061#issuecomment-1170750467

   ![image](https://user-images.githubusercontent.com/17312872/176584779-2274c7ba-d40e-4b95-aa24-711091b8a3db.png)
   @hililiwei I reproduced this case in my local env, the `id=2` record will be deleted is Flink's behavior, has nothing to do with iceberg. And I think this record should be delete becasue the record `('2','20220503')` does not meet this filter condition `t1.data > t2.data`. So the expected final result should be `('1','20220607'), ('3','20220505')` same as the batch result.    
   <br/>
   
   ![image](https://user-images.githubusercontent.com/17312872/176589188-ed824b7b-b1ee-4dc2-b24c-35c6d7e973da.png)
   > so it is strange to have +I[2, ...] and -D[2, ...] in the join output.
   @rdblue If we adjust the env setting, the +I[2, ...] and -D[2, ...] will disappear occasionally. I think the occurrence of these records is related to the timing of checkpoint and the distribution of data.
   <br/>
   
   > As we apache iceberg format v2 don't support incremental streaming reading, then how could we accomplish this streaming read and streaming join ?
   @openinx Because this test only setting the env as `StreamTableEnvironment`, but there not setting the `ScanContext.isStreaming` as true(that is false by default). Therefor, the iceberg table will be scan by batch but not streaming.
   
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org