You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "sivabalan narayanan (Jira)" <ji...@apache.org> on 2021/10/08 14:33:00 UTC
[jira] [Assigned] (HUDI-2535) Late arriving records and global
index with partition path update set to true
[ https://issues.apache.org/jira/browse/HUDI-2535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
sivabalan narayanan reassigned HUDI-2535:
-----------------------------------------
Assignee: sivabalan narayanan
> Late arriving records and global index with partition path update set to true
> -----------------------------------------------------------------------------
>
> Key: HUDI-2535
> URL: https://issues.apache.org/jira/browse/HUDI-2535
> Project: Apache Hudi
> Issue Type: Improvement
> Components: Writer Core
> Reporter: sivabalan narayanan
> Assignee: sivabalan narayanan
> Priority: Major
> Labels: sev:high, user-support-issues
>
> incase of a global index, we have a config to update partition path. if this is set to true, if there is an incoming record to a newer partition compared to whats in storage, older record will be deleted and new incoming record will be routed to new partition.
> But it could run into issues if new incoming is a late arriving record. Expected behavior is, old record is retained and new one is discarded it it has lower preCombine value. But in this case, we may not honor that.
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)