You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "Sagar Sumit (Jira)" <ji...@apache.org> on 2022/07/26 16:03:00 UTC

[jira] [Updated] (HUDI-4073) Optimize FileStatus construction in file listing from metadata table

     [ https://issues.apache.org/jira/browse/HUDI-4073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Sagar Sumit updated HUDI-4073:
------------------------------
    Fix Version/s: 0.13.0
                       (was: 0.12.0)

> Optimize FileStatus construction in file listing from metadata table
> --------------------------------------------------------------------
>
>                 Key: HUDI-4073
>                 URL: https://issues.apache.org/jira/browse/HUDI-4073
>             Project: Apache Hudi
>          Issue Type: Task
>          Components: metadata
>            Reporter: Ethan Guo
>            Assignee: Ethan Guo
>            Priority: Blocker
>             Fix For: 0.13.0
>
>
> In BaseTableMetadata::getAllFilesInPartitions, we construct FileStatus instances for the later use of reading files.  This can be deferred, or the necessary information can be stored in simple POJO instead of FileStatus.  The instantiation of FileStatus instances can also be parallelized.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)