You are viewing a plain text version of this content. The canonical link for it is here.
Posted to mapreduce-dev@hadoop.apache.org by "Zhizhen Hou (JIRA)" <ji...@apache.org> on 2019/02/28 08:55:00 UTC

[jira] [Created] (MAPREDUCE-7187) RMContainerAllocator.ScheduledRequests#getContainerReqToReplace may not find a task when the priority of container is PRIORITY_MAP

Zhizhen Hou created MAPREDUCE-7187:
--------------------------------------

             Summary: RMContainerAllocator.ScheduledRequests#getContainerReqToReplace may not find a task when the priority of container is PRIORITY_MAP
                 Key: MAPREDUCE-7187
                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-7187
             Project: Hadoop Map/Reduce
          Issue Type: Bug
          Components: applicationmaster
    Affects Versions: 2.7.5
            Reporter: Zhizhen Hou


The resource manager may has allocated a map container on a host ("h1" for example) for a application, and the container has not been fetched by the MRAppMaster. At this time, the  MRAppMaster receives a task fail event, and the task is on host h1. The event cause the h1 blacklisted. Now the MRAppMaster send a heartbeat, and receive a container on h1. The MRAppMaster can not assign the container since it is on a blacklisted host. The #getContainerReqToReplace fails returning  another task, may cause a map task hang forever.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: mapreduce-dev-unsubscribe@hadoop.apache.org
For additional commands, e-mail: mapreduce-dev-help@hadoop.apache.org