You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@ignite.apache.org by "Ivan Daschinskiy (JIRA)" <ji...@apache.org> on 2018/04/16 15:44:00 UTC

[jira] [Commented] (IGNITE-7786) Changing baseline topology on cluster may have error in control.sh utility

    [ https://issues.apache.org/jira/browse/IGNITE-7786?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16439602#comment-16439602 ] 

Ivan Daschinskiy commented on IGNITE-7786:
------------------------------------------

Number of retries and timeout between retries are hardcoded in GridClientAbstractProjection, 3 and 1000ms respectively. This affects therefore all GridClientCompute invocations, not only control.sh. I suggest to introduce new System properties: i.e IGNITE_GRID_CLIENT_COMPUTE_RECONNECT_TIMEOUT and IGNITE_GRID_CLIENT_COMPUTE_NUM_RETRIES.

> Changing baseline topology on cluster may have error in control.sh utility
> --------------------------------------------------------------------------
>
>                 Key: IGNITE-7786
>                 URL: https://issues.apache.org/jira/browse/IGNITE-7786
>             Project: Ignite
>          Issue Type: Bug
>    Affects Versions: 2.3
>            Reporter: Dmitry Sherstobitov
>            Priority: Major
>
> Looks like there is hardcoded timeout for waiting result of change baseline operation
> In cluster there is following behaviour: 
>  # Set new baseline topology version
>  # Utility starts, but then fails by connection error
>  # Cluster successfully activated
> {code:java}
> ...Start node...
> ...Waiting for topology snapshot...
> > control_utility.sh --baseline version 9
> Control utility 
> 2017 Copyright(C) Apache Software Foundation
> User: test
> --------------------------------------------------------------------------------
> Failed to set baseline with specified topology version.
> Connection to cluster failed.
> Error: Failed to perform request (connection failed): /IP
> ...few milliseconds later...
> > control_utility.sh --baseline version 9
> Control utility 
> 2017 Copyright(C) Apache Software Foundation
> User: test
> --------------------------------------------------------------------------------
> Cluster state: active
> Current topology version: 9
> Baseline nodes:
> ConsistentID=node1, STATE=ONLINE
> ConsistentID=node10001, STATE=ONLINE
> ConsistentID=node2, STATE=ONLINE
> ConsistentID=node3, STATE=ONLINE
> ConsistentID=node4, STATE=ONLINE
> --------------------------------------------------------------------------------
> Number of baseline nodes: 5
> Other nodes not found.{code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)