You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@ozone.apache.org by "Prashant Pogde (Jira)" <ji...@apache.org> on 2021/01/20 18:58:00 UTC

[jira] [Commented] (HDDS-4610) Prepare operation with one OM down leads to unrecoverable state in the downed OM.

    [ https://issues.apache.org/jira/browse/HDDS-4610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17268788#comment-17268788 ] 

Prashant Pogde commented on HDDS-4610:
--------------------------------------

Moving this out of 1.1.0 release. 

> Prepare operation with one OM down leads to unrecoverable state in the downed OM.
> ---------------------------------------------------------------------------------
>
>                 Key: HDDS-4610
>                 URL: https://issues.apache.org/jira/browse/HDDS-4610
>             Project: Hadoop Distributed Data Store
>          Issue Type: Sub-task
>          Components: Ozone Manager
>            Reporter: Aravindan Vijayan
>            Assignee: Aravindan Vijayan
>            Priority: Major
>             Fix For: 1.1.0
>
>
> On a 3 node OM HA setup, when a prepare is done with 1 OM down, it leads to a state where the leader and follower are fully prepared (Snapshot at last index and logs purged). When the 3rd node rejoins the quorum, it leads to an infinite installSnapshot loop between the leader and the 3rd node, and the system goes into a bad state until a restart is done.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@ozone.apache.org
For additional commands, e-mail: issues-help@ozone.apache.org