You are viewing a plain text version of this content. The canonical link for it is here.
Posted to yarn-issues@hadoop.apache.org by "sandflee (JIRA)" <ji...@apache.org> on 2015/11/02 00:00:28 UTC

[jira] [Commented] (YARN-4277) containers would be leaked if nm crashed and rm failover

    [ https://issues.apache.org/jira/browse/YARN-4277?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14984585#comment-14984585 ] 

sandflee commented on YARN-4277:
--------------------------------

Is there any plan to store NM info?  [~jlowe] [~djp] [~jianhe],  we could just store NM info not containers running on NM.
Without NM info, 
1,  containers could be leaked as in  this issue. 
2,  AM knows nothing if nm crashed forever and RM failover

> containers would be leaked if nm crashed  and rm failover
> ---------------------------------------------------------
>
>                 Key: YARN-4277
>                 URL: https://issues.apache.org/jira/browse/YARN-4277
>             Project: Hadoop YARN
>          Issue Type: Bug
>            Reporter: sandflee
>
> nm restart and rm ha is enabled.
> 1,  nm crashed, after timeout, rm send container complete msg to corresponding AM.
> 2, rm failovers
> 3, nm restart and register to RM , recovering containers running on NM, these containers and leaked.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)