You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "Vinoth Chandar (Jira)" <ji...@apache.org> on 2019/10/25 11:23:00 UTC

[jira] [Updated] (HUDI-1) Design and Implement embedded timeline service to cache filesystem view to reduce listStatus calls

     [ https://issues.apache.org/jira/browse/HUDI-1?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Vinoth Chandar updated HUDI-1:
------------------------------
    Summary: Design and Implement embedded timeline service to cache filesystem view to reduce listStatus calls  (was: Desgin and Implement embedded timeline service to cache filesystem view to reduce listStatus calls)

> Design and Implement embedded timeline service to cache filesystem view to reduce listStatus calls
> --------------------------------------------------------------------------------------------------
>
>                 Key: HUDI-1
>                 URL: https://issues.apache.org/jira/browse/HUDI-1
>             Project: Apache Hudi (incubating)
>          Issue Type: Improvement
>          Components: Performance, Write Client
>            Reporter: Balaji Varadarajan
>            Assignee: Balaji Varadarajan
>            Priority: Major
>             Fix For: 0.5.0
>
>   Original Estimate: 1,008h
>  Remaining Estimate: 1,008h
>
> Currently, Hudi writers repeatedly list partitions to create file-system views in executors. This task addresses the reductions in listStatus name-node calls in Hudi 2.0 writers by taking advantage of MVCC view of HUDI and caching file-system view and reusing them.
> An embedded file-system view server on driver will be preloaded with the view. It will act as a cache and service File-system view calls from executors.
>  
> [https://github.com/uber/hudi/issues/433]
> https://github.com/uber/hudi/issues/269
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)