You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@ozone.apache.org by "Aravindan Vijayan (Jira)" <ji...@apache.org> on 2020/12/17 22:04:00 UTC

[jira] [Updated] (HDDS-4610) Prepare operation with one OM down leads to unrecoverable state in the downed OM.

     [ https://issues.apache.org/jira/browse/HDDS-4610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Aravindan Vijayan updated HDDS-4610:
------------------------------------
    Summary: Prepare operation with one OM down leads to unrecoverable state in the downed OM.  (was: OM prepare with one OM down leads to unrecoverable state in the downed OM.)

> Prepare operation with one OM down leads to unrecoverable state in the downed OM.
> ---------------------------------------------------------------------------------
>
>                 Key: HDDS-4610
>                 URL: https://issues.apache.org/jira/browse/HDDS-4610
>             Project: Hadoop Distributed Data Store
>          Issue Type: Sub-task
>          Components: Ozone Manager
>            Reporter: Aravindan Vijayan
>            Assignee: Aravindan Vijayan
>            Priority: Major
>             Fix For: 1.1.0
>
>
> On a 3 node OM HA setup, when a prepare is done with 1 OM down, it leads to a state where the leader and follower are fully prepared (Snapshot at last index and logs purged). When the 3rd node rejoins the quorum, it leads to an infinite installSnapshot loop between the leader and the 3rd node, and the system goes into a bad state until a restart is done.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@ozone.apache.org
For additional commands, e-mail: issues-help@ozone.apache.org