You are viewing a plain text version of this content. The canonical link for it is here.
Posted to mapreduce-issues@hadoop.apache.org by "Ahmed Hussein (Jira)" <ji...@apache.org> on 2020/01/21 19:10:00 UTC

[jira] [Moved] (MAPREDUCE-7261) Memory efficiency in speculator

     [ https://issues.apache.org/jira/browse/MAPREDUCE-7261?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Ahmed Hussein moved YARN-9597 to MAPREDUCE-7261:
------------------------------------------------

        Key: MAPREDUCE-7261  (was: YARN-9597)
    Project: Hadoop Map/Reduce  (was: Hadoop YARN)

> Memory efficiency in speculator 
> --------------------------------
>
>                 Key: MAPREDUCE-7261
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-7261
>             Project: Hadoop Map/Reduce
>          Issue Type: Improvement
>            Reporter: Ahmed Hussein
>            Assignee: Ahmed Hussein
>            Priority: Minor
>
> The data structures in speculator and runtime-estimator are bloating. Data elements such as (taskID, TA-ID, task stats, tasks speculated, tasks finished..etc) are added to the concurrent maps but never removed.
> For long running jobs, there are couple of issues:
>  # memory leakage: the speculator memory usage increases over time. 
>  # performance: keeping large structures in the heap affects the performance due to locality and cache misses.
> *Suggested Fixes:*
> - When a TA transitions to {{MoveContainerToSucceededFinishingTransition}}, the TA notifies the speculator. The latter handles the event by cleaning the internal structure accordingly.
> - When a task transitions is failed/killed, the speculator is notified to clean the internal data structure.
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: mapreduce-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: mapreduce-issues-help@hadoop.apache.org