You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@ozone.apache.org by "Xu Shao Hong (Jira)" <ji...@apache.org> on 2021/11/12 10:42:00 UTC

[jira] [Created] (HDDS-5976) add raftlog IOException notifier for SCM HA

Xu Shao Hong created HDDS-5976:
----------------------------------

             Summary: add raftlog IOException notifier for SCM HA
                 Key: HDDS-5976
                 URL: https://issues.apache.org/jira/browse/HDDS-5976
             Project: Apache Ozone
          Issue Type: Bug
            Reporter: Xu Shao Hong
            Assignee: Xu Shao Hong


Currently, for SCM HA, the raft log  IO exception will lead the leader to get stuck in an indefinite loop of timeout where server.submitClientRequestAsync(raftClientRequest). 

reason:

Once ratis server get the IO exception during appending logs, it will tell the appendlog task to complete exceptionally and notify the statemachine the event ''notifyLogFailed'', and then stops tasks in the following queue. The task is asynchronous and not "get()" by any one, thus the exception will not be known. SCM only gets timeout exception endlessly.

 



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@ozone.apache.org
For additional commands, e-mail: issues-help@ozone.apache.org