You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "sivabalan narayanan (Jira)" <ji...@apache.org> on 2021/10/04 22:22:00 UTC
[jira] [Comment Edited] (HUDI-1492) Handle DeltaWriteStat correctly
for storage schemes that support appends
[ https://issues.apache.org/jira/browse/HUDI-1492?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17424212#comment-17424212 ]
sivabalan narayanan edited comment on HUDI-1492 at 10/4/21, 10:21 PM:
----------------------------------------------------------------------
While merging multiple entires in HodieMetadataPayload, we do merge multiple entries with same file name, and take the higher file length as the final value. I don't see an issue as such here.
Or is this something else? [~nishith29]
was (Author: shivnarayan):
While merging multiple entires in HodieMetadataPayload, we do merge multiple entries with same file name, and take the higher file length as the final value. I don't see an issue as such here.
> Handle DeltaWriteStat correctly for storage schemes that support appends
> ------------------------------------------------------------------------
>
> Key: HUDI-1492
> URL: https://issues.apache.org/jira/browse/HUDI-1492
> Project: Apache Hudi
> Issue Type: Sub-task
> Reporter: Vinoth Chandar
> Assignee: sivabalan narayanan
> Priority: Blocker
> Fix For: 0.10.0
>
>
> Current implementation simply uses the
> {code:java}
> String pathWithPartition = hoodieWriteStat.getPath(); {code}
> to write the metadata table. this is problematic, if the delta write was merely an append. and can technically add duplicate files into the metadata table
> (not sure if this is a problem per se. but filing a Jira to track and either close/fix )
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)