You are viewing a plain text version of this content. The canonical link for it is here.
Posted to yarn-issues@hadoop.apache.org by "Junping Du (JIRA)" <ji...@apache.org> on 2015/09/15 15:54:46 UTC

[jira] [Updated] (YARN-3212) RMNode State Transition Update with DECOMMISSIONING state

     [ https://issues.apache.org/jira/browse/YARN-3212?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Junping Du updated YARN-3212:
-----------------------------
    Attachment: YARN-3212-v6.patch

Update patch (v6) with following updates to address comments from Wangda and Sunil:
1. Remove unnecessary debug log
2. RMNodeEventType.DECOMMISSION_WITH_TIMEOUT -> RMNodeEventType.GRACEFUL_DECOMMISSION
3. Update transition from Unhealthy to Decommissioning when receiving GRACEFUL_DECOMMISSION event, also keep node in Decommissioning when receiving node unhealthy update.

> RMNode State Transition Update with DECOMMISSIONING state
> ---------------------------------------------------------
>
>                 Key: YARN-3212
>                 URL: https://issues.apache.org/jira/browse/YARN-3212
>             Project: Hadoop YARN
>          Issue Type: Sub-task
>          Components: resourcemanager
>            Reporter: Junping Du
>            Assignee: Junping Du
>         Attachments: RMNodeImpl - new.png, YARN-3212-v1.patch, YARN-3212-v2.patch, YARN-3212-v3.patch, YARN-3212-v4.1.patch, YARN-3212-v4.patch, YARN-3212-v5.1.patch, YARN-3212-v5.patch, YARN-3212-v6.patch
>
>
> As proposed in YARN-914, a new state of “DECOMMISSIONING” will be added and can transition from “running” state triggered by a new event - “decommissioning”. 
> This new state can be transit to state of “decommissioned” when Resource_Update if no running apps on this NM or NM reconnect after restart. Or it received DECOMMISSIONED event (after timeout from CLI).
> In addition, it can back to “running” if user decides to cancel previous decommission by calling recommission on the same node. The reaction to other events is similar to RUNNING state.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)