You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@nifi.apache.org by "Shawn Weeks (Jira)" <ji...@apache.org> on 2022/01/10 18:50:00 UTC

[jira] [Commented] (NIFI-9559) Zookeeper Client Can't Reconnect - Session timeout has elapsed while SUSPENDED

    [ https://issues.apache.org/jira/browse/NIFI-9559?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17472230#comment-17472230 ] 

Shawn Weeks commented on NIFI-9559:
-----------------------------------

Whats interesting is this only effects nodes that have gotten into this weird curator state. If I restart Zookeeper other nodes will reconnect just fine. There is something different about how this node lost it's connection to zookeeper. You might have to reproduce this by pulling the plug on Zookeeper or blocking connections via firewall and then restoring then.

> Zookeeper Client Can't Reconnect - Session timeout has elapsed while SUSPENDED
> ------------------------------------------------------------------------------
>
>                 Key: NIFI-9559
>                 URL: https://issues.apache.org/jira/browse/NIFI-9559
>             Project: Apache NiFi
>          Issue Type: Bug
>            Reporter: Shawn Weeks
>            Assignee: Nathan Gough
>            Priority: Minor
>         Attachments: nifi_and_zookeeper_logs.txt
>
>
> It's possible this is fixed in 1.15.2 but I don't see any commits that would have resolved it. After a loss of connection to Zookeeper a NiFi node never successfully reconnects to the Zookeeper or the Cluster and instead returns errors about no Cluster Coordinator and a Session timeout has elapsed while SUSPENDED repeatedly until you restart NiFi.
> The error described is the same one at https://issues.apache.org/jira/browse/CURATOR-405 however that patch has been in NiFi for several versions now.
> NiFi version is 1.14.0 and Zookeeper 3.6.3



--
This message was sent by Atlassian Jira
(v8.20.1#820001)