You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "Vinoth Chandar (Jira)" <ji...@apache.org> on 2021/01/11 06:25:00 UTC

[jira] [Updated] (HUDI-845) Allow parallel writing to Hudi tables using Locking service

     [ https://issues.apache.org/jira/browse/HUDI-845?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Vinoth Chandar updated HUDI-845:
--------------------------------
    Fix Version/s:     (was: 0.7.0)
                   0.8.0

> Allow parallel writing to Hudi tables using Locking service
> -----------------------------------------------------------
>
>                 Key: HUDI-845
>                 URL: https://issues.apache.org/jira/browse/HUDI-845
>             Project: Apache Hudi
>          Issue Type: Improvement
>          Components: Writer Core
>            Reporter: Vinoth Chandar
>            Assignee: Nishith Agarwal
>            Priority: Blocker
>              Labels: help-requested, pull-request-available
>             Fix For: 0.8.0
>
>
> Things to think about 
>  * Commit time has to be unique across writers 
>  * Parallel writers can finish commits out of order i.e c2 commits before c1.
>  * MOR log blocks fence uncommited data.. 
>  * Cleaner should loudly complain if it cannot finish cleaning up partial writes.  
>  * Move the pending rollback work into cleaner
>  
> P.S: think about what is left for the general thing : log files may have different order, inserts may violate uniqueness constraint



--
This message was sent by Atlassian Jira
(v8.3.4#803005)