You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@ignite.apache.org by "Ivan Daschinskiy (JIRA)" <ji...@apache.org> on 2018/04/16 15:44:00 UTC
[jira] [Commented] (IGNITE-7786) Changing baseline topology on
cluster may have error in control.sh utility
[ https://issues.apache.org/jira/browse/IGNITE-7786?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16439602#comment-16439602 ]
Ivan Daschinskiy commented on IGNITE-7786:
------------------------------------------
Number of retries and timeout between retries are hardcoded in GridClientAbstractProjection, 3 and 1000ms respectively. This affects therefore all GridClientCompute invocations, not only control.sh. I suggest to introduce new System properties: i.e IGNITE_GRID_CLIENT_COMPUTE_RECONNECT_TIMEOUT and IGNITE_GRID_CLIENT_COMPUTE_NUM_RETRIES.
> Changing baseline topology on cluster may have error in control.sh utility
> --------------------------------------------------------------------------
>
> Key: IGNITE-7786
> URL: https://issues.apache.org/jira/browse/IGNITE-7786
> Project: Ignite
> Issue Type: Bug
> Affects Versions: 2.3
> Reporter: Dmitry Sherstobitov
> Priority: Major
>
> Looks like there is hardcoded timeout for waiting result of change baseline operation
> In cluster there is following behaviour:
> # Set new baseline topology version
> # Utility starts, but then fails by connection error
> # Cluster successfully activated
> {code:java}
> ...Start node...
> ...Waiting for topology snapshot...
> > control_utility.sh --baseline version 9
> Control utility
> 2017 Copyright(C) Apache Software Foundation
> User: test
> --------------------------------------------------------------------------------
> Failed to set baseline with specified topology version.
> Connection to cluster failed.
> Error: Failed to perform request (connection failed): /IP
> ...few milliseconds later...
> > control_utility.sh --baseline version 9
> Control utility
> 2017 Copyright(C) Apache Software Foundation
> User: test
> --------------------------------------------------------------------------------
> Cluster state: active
> Current topology version: 9
> Baseline nodes:
> ConsistentID=node1, STATE=ONLINE
> ConsistentID=node10001, STATE=ONLINE
> ConsistentID=node2, STATE=ONLINE
> ConsistentID=node3, STATE=ONLINE
> ConsistentID=node4, STATE=ONLINE
> --------------------------------------------------------------------------------
> Number of baseline nodes: 5
> Other nodes not found.{code}
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)