You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@ignite.apache.org by "Sergey Chugunov (JIRA)" <ji...@apache.org> on 2019/05/22 13:05:00 UTC
[jira] [Created] (IGNITE-11865) FailureProcessor treats
tcp-comm-worker as blocked when it works on reestablishing connect to
failed client node
Sergey Chugunov created IGNITE-11865:
----------------------------------------
Summary: FailureProcessor treats tcp-comm-worker as blocked when it works on reestablishing connect to failed client node
Key: IGNITE-11865
URL: https://issues.apache.org/jira/browse/IGNITE-11865
Project: Ignite
Issue Type: Bug
Affects Versions: 2.7
Reporter: Sergey Chugunov
Assignee: Sergey Chugunov
Fix For: 2.8
When client node fails tcp-comm-worker thread on server keeps trying to reestablish connection to the client until failed node is removed from topology (on expiration of clientFailureDetectionTimeout).
As tcp-comm-worker thread doesn't update its heartbeats from internal loops FailureProcessor considers it as blocked and prints out misleading message to logs along with full thread dump.
To avoid polluting logs with unnecessary messages we need to teach tcp-comm-worker how to update its heartbeat timestamp in FailureProcessor.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)