You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@ignite.apache.org by "Pavel Kovalenko (JIRA)" <ji...@apache.org> on 2018/09/07 12:55:00 UTC
[jira] [Created] (IGNITE-9494) Communication error resolver may be
invoked when topology is under construction
Pavel Kovalenko created IGNITE-9494:
---------------------------------------
Summary: Communication error resolver may be invoked when topology is under construction
Key: IGNITE-9494
URL: https://issues.apache.org/jira/browse/IGNITE-9494
Project: Ignite
Issue Type: Bug
Components: cache
Affects Versions: 2.5
Reporter: Pavel Kovalenko
Fix For: 2.7
Zookeeper Discovery.
During massive node start and join to topology there can happen communication error problems which can lead to invoking communication error resolver.
Communication error resolver initiates a peer-to-peer ping process on all alive nodes. Youngest nodes in a cluster may have the not complete picture about alive nodes in a cluster. This can lead to a situation, that youngest node will not ping all available nodes, and the coordinator may decide that those nodes have an unstable network and unexpectedly kill them.
We should throttle communication error resolver in case of massive node join and give them a time to get the complete picture about topology.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)