You are viewing a plain text version of this content. The canonical link for it is here.
Posted to mapreduce-issues@hadoop.apache.org by "Allen Wittenauer (JIRA)" <ji...@apache.org> on 2014/07/31 01:47:40 UTC

[jira] [Resolved] (MAPREDUCE-2047) reduce overhead of findSpeculativeTask

     [ https://issues.apache.org/jira/browse/MAPREDUCE-2047?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Allen Wittenauer resolved MAPREDUCE-2047.
-----------------------------------------

    Resolution: Incomplete

Spec exec got a major overhaul since then. Closing this as stale.

> reduce overhead of findSpeculativeTask
> --------------------------------------
>
>                 Key: MAPREDUCE-2047
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2047
>             Project: Hadoop Map/Reduce
>          Issue Type: Improvement
>          Components: jobtracker
>         Environment: hadoop-20 with HADOOP-2141
>            Reporter: Joydeep Sen Sarma
>            Assignee: Joydeep Sen Sarma
>         Attachments: 2047.3.patch, mapreduce-2047.1.patch, mapreduce-2047.2.patch
>
>
> We are bottlenecked (in the JT) on the jobtracker lock and calls to findSpeculativeTask frequently show up as one of the top routines (by time) called holding this lock.
> this routine calls canBeSpeculated() and hasRunOnMachine() for each task in a candidate job. Both these routines are reasonably expensive when invoked repeatedly  for thousands of tasks. The top candidates for speculation from a job only need to be refreshed periodically (and not once every heartbeat) - and we can can avoid most of these invocations this way.



--
This message was sent by Atlassian JIRA
(v6.2#6252)