You are viewing a plain text version of this content. The canonical link for it is here.
Posted to yarn-issues@hadoop.apache.org by "Antal Bálint Steinbach (JIRA)" <ji...@apache.org> on 2019/04/04 15:33:01 UTC

[jira] [Commented] (YARN-5464) Server-Side NM Graceful Decommissioning with RM HA

    [ https://issues.apache.org/jira/browse/YARN-5464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16809980#comment-16809980 ] 

Antal Bálint Steinbach commented on YARN-5464:
----------------------------------------------

New patch uploaded.

Test steps:
 # Setup RM HA on a 3 node cluster
 # Set exclude file to exclude node 3 (_yarn.resourcemanager.nodes.exclude-path_)
 # Start a long sleep job
 # Refresh nodes to start decommission: _yarn_ rmadmin _-refreshNodes -g 100 -server_
 # Find active RM (_yarn_ rmadmin _-getAllServiceState_)
 # Kill active RM
 # Check the newly activated RM logs for decommission recovery
 # Wait for node 3 to be decommissioned because of timeout or because of the sleep job finished

> Server-Side NM Graceful Decommissioning with RM HA
> --------------------------------------------------
>
>                 Key: YARN-5464
>                 URL: https://issues.apache.org/jira/browse/YARN-5464
>             Project: Hadoop YARN
>          Issue Type: Sub-task
>          Components: graceful, yarn
>            Reporter: Robert Kanter
>            Assignee: Antal Bálint Steinbach
>            Priority: Major
>         Attachments: YARN-5464.001.patch, YARN-5464.002.patch, YARN-5464.003.patch, YARN-5464.004.patch, YARN-5464.005.patch, YARN-5464.006.patch, YARN-5464.wip.patch
>
>
> Make sure to remove the note added by YARN-7094 about RM HA failover not working right.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: yarn-issues-help@hadoop.apache.org