You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "Prashant Wason (Jira)" <ji...@apache.org> on 2021/08/07 08:17:00 UTC

[jira] [Updated] (HUDI-1794) Generating a new instant time in HoodieActiveTimeline is not thread safe

     [ https://issues.apache.org/jira/browse/HUDI-1794?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Prashant Wason updated HUDI-1794:
---------------------------------
    Fix Version/s: 0.9.0

> Generating a new instant time in HoodieActiveTimeline is not thread safe
> ------------------------------------------------------------------------
>
>                 Key: HUDI-1794
>                 URL: https://issues.apache.org/jira/browse/HUDI-1794
>             Project: Apache Hudi
>          Issue Type: Bug
>            Reporter: Prashant Wason
>            Assignee: Prashant Wason
>            Priority: Major
>              Labels: pull-request-available, sev:high
>             Fix For: 0.9.0
>
>
> When generating a new instant time in HoodieActiveTimeline, a static instance of SimpleDateFormat is used. This class is not thread safe.
> We have a production usecase where multiple HUDI datasets are processed in parallel in different threads of a ThreadPool. Each of these threads creates its own SparkRDDBackedWriteClient and calls startCommit() which generates a new commit time. Because SimpleDateFormat is not thread safe, we get corrupted instant times in several threads.
> The solution is to use a thread-specific instance of the SimpleDateFormat for generating new instant times. 
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)