You are viewing a plain text version of this content. The canonical link for it is here.
Posted to mapreduce-issues@hadoop.apache.org by "Jason Lowe (JIRA)" <ji...@apache.org> on 2014/04/18 15:33:22 UTC

[jira] [Updated] (MAPREDUCE-5848) MapReduce counts forcibly preempted containers as FAILED

     [ https://issues.apache.org/jira/browse/MAPREDUCE-5848?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jason Lowe updated MAPREDUCE-5848:
----------------------------------

     Target Version/s: 2.5.0
    Affects Version/s: 2.1.0-beta

Moving this to MAPREDUCE since it's an issue with the MaoReduce AM and not YARN.

Note that the small patch may not be sufficient to completely solve the issue of preempted tasks being interpreted as FAILED rather than KILLED.  Due to races between the AM heartbeat to the RM and the AM receiving task status updates via the task umbilical the AM may still think that a task attempt failed even though it was preempted.  When a task attempt receives SIGTERM from the NM it causes the FileSystem to close via the shutdown hook and often causes exceptions within the task.  Those exceptions are reported as task failure via the task umbilical.  If that status arrives at the AM before the AM receives the preempted container status from the RM then the AM counts it as a task failure rather than a task preempt.

> MapReduce counts forcibly preempted containers as FAILED
> --------------------------------------------------------
>
>                 Key: MAPREDUCE-5848
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5848
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>    Affects Versions: 2.1.0-beta
>            Reporter: Carlo Curino
>            Assignee: Subramaniam Krishnan
>         Attachments: YARN-1958.patch
>
>
> The MapReduce AM is considering a forcibly preempted container as FAILED, while I think it should be considered as KILLED (i.e., not count against the maximum number of failures). 



--
This message was sent by Atlassian JIRA
(v6.2#6252)