You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@ambari.apache.org by "D M Murali Krishna Reddy (Jira)" <ji...@apache.org> on 2023/06/13 18:40:00 UTC

[jira] [Commented] (AMBARI-25950) Exclude hosts getting erased when RM, NN are restarted

    [ https://issues.apache.org/jira/browse/AMBARI-25950?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17732212#comment-17732212 ] 

D M Murali Krishna Reddy commented on AMBARI-25950:
---------------------------------------------------

I have two working approaches
 # Just before sending the ExecutionCommand to agents update CommandParams for all_decommisioned_hosts in AgentsCommandPublisher#populateExecutionCommandsClusters using the AmbariCustomCommandExecutionHelper#calculateDecommissionedNodes based on the executionCommand Role(RM or NN)
 # Update CommandParams for all_decommisioned_hosts in a similar way as above

 ## For Custom Commands update CommandParams in AmbariCustomCommandExecutionHelper#addExecutionCommandsToStage, only for valid custom commands(except for service check and decommission commands)
 ## In AmbariManagementControllerImpl#createHostAction just before setting the commandParams for the executionCommand.

 

[~brahmareddy], [~vjasani] , [~vishalsuvagia]  [~eub]  
Can you suggest which approach to choose, I am inclined to approach 2 as the commandParams get added to stages.

> Exclude hosts getting erased when RM, NN are restarted
> ------------------------------------------------------
>
>                 Key: AMBARI-25950
>                 URL: https://issues.apache.org/jira/browse/AMBARI-25950
>             Project: Ambari
>          Issue Type: Bug
>          Components: ambari-server
>            Reporter: D M Murali Krishna Reddy
>            Assignee: D M Murali Krishna Reddy
>            Priority: Major
>
> After decommissioning a Node Manager or a Data node, if Resource Manager or Namenode are restarted, the exclude hosts file is getting overwritten with empty contents, causing the NM, DN to get recommisioned.
>  
> During NN, RM restart all_decommissioned_hosts is not set due to which in params_linux.py the exclude hosts file is getting created with empty content.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@ambari.apache.org
For additional commands, e-mail: issues-help@ambari.apache.org