You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "Vinoth Chandar (Jira)" <ji...@apache.org> on 2019/10/25 11:23:00 UTC
[jira] [Updated] (HUDI-1) Design and Implement embedded timeline
service to cache filesystem view to reduce listStatus calls
[ https://issues.apache.org/jira/browse/HUDI-1?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Vinoth Chandar updated HUDI-1:
------------------------------
Summary: Design and Implement embedded timeline service to cache filesystem view to reduce listStatus calls (was: Desgin and Implement embedded timeline service to cache filesystem view to reduce listStatus calls)
> Design and Implement embedded timeline service to cache filesystem view to reduce listStatus calls
> --------------------------------------------------------------------------------------------------
>
> Key: HUDI-1
> URL: https://issues.apache.org/jira/browse/HUDI-1
> Project: Apache Hudi (incubating)
> Issue Type: Improvement
> Components: Performance, Write Client
> Reporter: Balaji Varadarajan
> Assignee: Balaji Varadarajan
> Priority: Major
> Fix For: 0.5.0
>
> Original Estimate: 1,008h
> Remaining Estimate: 1,008h
>
> Currently, Hudi writers repeatedly list partitions to create file-system views in executors. This task addresses the reductions in listStatus name-node calls in Hudi 2.0 writers by taking advantage of MVCC view of HUDI and caching file-system view and reusing them.
> An embedded file-system view server on driver will be preloaded with the view. It will act as a cache and service File-system view calls from executors.
>
> [https://github.com/uber/hudi/issues/433]
> https://github.com/uber/hudi/issues/269
>
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)