You are viewing a plain text version of this content. The canonical link for it is here.
Posted to hdfs-dev@hadoop.apache.org by "Aravindan Vijayan (Jira)" <ji...@apache.org> on 2019/10/03 20:39:00 UTC

[jira] [Created] (HDDS-2241) Optimize the refresh pipeline logic used by KeyManagerImpl to obtain the pipeline for a key

Aravindan Vijayan created HDDS-2241:
---------------------------------------

             Summary: Optimize the refresh pipeline logic used by KeyManagerImpl to obtain the pipeline for a key
                 Key: HDDS-2241
                 URL: https://issues.apache.org/jira/browse/HDDS-2241
             Project: Hadoop Distributed Data Store
          Issue Type: Bug
          Components: Ozone Manager
            Reporter: Aravindan Vijayan
            Assignee: Aravindan Vijayan


Currently, while looking up a key, the Ozone Manager gets the pipeline location information from SCM through an RPC for every block in the key. For large files > 1GB, we may end up making ~4 RPC calls for this. This can be optimized in a couple of ways

* We can implement a batch getContainerWithPipeline API in SCM using which we can get the pipeline info locations for all the blocks for a file.
* Instead, we can have a method local cache for ContainerID -> Pipeline that we got from SCM so that we don't need to make calls to SCM again for the same pipeline.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-dev-unsubscribe@hadoop.apache.org
For additional commands, e-mail: hdfs-dev-help@hadoop.apache.org