You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@ozone.apache.org by "Stephen O'Donnell (Jira)" <ji...@apache.org> on 2021/04/08 15:04:00 UTC

[jira] [Resolved] (HDDS-4989) Decommission CLI should return details of nodes which fail

     [ https://issues.apache.org/jira/browse/HDDS-4989?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Stephen O'Donnell resolved HDDS-4989.
-------------------------------------
    Fix Version/s: 1.2.0
       Resolution: Fixed

> Decommission CLI should return details of nodes which fail
> ----------------------------------------------------------
>
>                 Key: HDDS-4989
>                 URL: https://issues.apache.org/jira/browse/HDDS-4989
>             Project: Apache Ozone
>          Issue Type: Improvement
>          Components: SCM, SCM Client
>            Reporter: Stephen O'Donnell
>            Assignee: Stephen O'Donnell
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 1.2.0
>
>
> With the current decommission / recommission / maintenance mode commands, you can pass a list of hosts to perform the operation on. If any of these hosts fail to enter the decommission / maintenance workflow, the command gives no feedback about the error. Some of the hosts can silently fail and the only way to know is to inspect the SCM log.
> The most common way a host can fail, is if a node which is undergoing maintenance is instructed to go to decommission and vice versa as this is a transition which is not allowed.
> This change will allow any failed nodes to feed back to the client. If the client detects that any of the nodes have failed, details will be written to stderr and the command exit code will be non-zero.
> Note that even though the exit code is non-zero, the command may have partially worked.
> Also note that the errors which are fed back are only around transitioning the node into the admin workflow - it is still possible for it to fail later for other reasons which will not be fed back to the client. This is because the client does not wait for the process to complete, but exits after confirmation the command has been processed by scm.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@ozone.apache.org
For additional commands, e-mail: issues-help@ozone.apache.org