You are viewing a plain text version of this content. The canonical link for it is here.
Posted to yarn-dev@hadoop.apache.org by "Rohith (JIRA)" <ji...@apache.org> on 2014/02/05 13:52:09 UTC

[jira] [Created] (YARN-1686) NodeManager.resyncWithRM() does not handle exception which cause NodeManger to Hang.

Rohith created YARN-1686:
----------------------------

             Summary: NodeManager.resyncWithRM() does not handle exception which cause NodeManger to Hang.
                 Key: YARN-1686
                 URL: https://issues.apache.org/jira/browse/YARN-1686
             Project: Hadoop YARN
          Issue Type: Bug
          Components: nodemanager
    Affects Versions: 2.3.0
            Reporter: Rohith
            Assignee: Rohith


During start of NodeManager,if registration with resourcemanager throw exception then nodemager shutdown happens. 

Consider case where NM-1 is registered with RM. RM issued Resync to NM. If any exception thrown in "resyncWithRM" (starts new thread which does not handle exception) during RESYNC evet, then this thread is lost. NodeManger enters hanged state. 



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)