You are viewing a plain text version of this content. The canonical link for it is here.
Posted to mapreduce-issues@hadoop.apache.org by "Allen Wittenauer (JIRA)" <ji...@apache.org> on 2015/02/05 02:35:34 UTC

[jira] [Updated] (MAPREDUCE-6059) Speed up history server startup time

     [ https://issues.apache.org/jira/browse/MAPREDUCE-6059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Allen Wittenauer updated MAPREDUCE-6059:
----------------------------------------
    Issue Type: Improvement  (was: Bug)

> Speed up history server startup time
> ------------------------------------
>
>                 Key: MAPREDUCE-6059
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-6059
>             Project: Hadoop Map/Reduce
>          Issue Type: Improvement
>    Affects Versions: 2.4.0
>            Reporter: Siqi Li
>            Assignee: Siqi Li
>         Attachments: YARN-2366.v1.patch
>
>
> When history server starts up, It scans every history directories and put all history files into a cache, whereas this cache only stores 20K recent history files. Therefore, it is wasting a large portion of time loading old history files into the cache, and the startup time will keep increasing if we don't trim the number of history files. For example, when history server starts up with 2.5M history files in HDFS, it took ~5 minutes.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)