You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tez.apache.org by "Syed Shameerur Rahman (Jira)" <ji...@apache.org> on 2023/03/04 15:07:00 UTC

[jira] [Created] (TEZ-4479) Eagerly Init/Load FileSystem In Tez Task Containers

Syed Shameerur Rahman created TEZ-4479:
------------------------------------------

             Summary: Eagerly Init/Load FileSystem In Tez Task Containers
                 Key: TEZ-4479
                 URL: https://issues.apache.org/jira/browse/TEZ-4479
             Project: Apache Tez
          Issue Type: Improvement
            Reporter: Syed Shameerur Rahman
            Assignee: Syed Shameerur Rahman
             Fix For: 0.10.3


Initing/Loading FileSystem such as S3 can take ~10s - ~20s when called for the first time and the time taken for subsequent calls are negligable. If we can load the FileSystem much before it is used can help us to save some time. It can be especially useful in case of pre-warm Tez containers where the Tez task containers comes up when the Application Master (AM) is launched and not on-demand which is the default behavior. It can be also useful in cases where the Mapper tasks spends considerable time consuming the upstream shuffle data and then heads to process some FileSystem operations, in all such cases we have few FileSystem load up time.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)