You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@ignite.apache.org by "Vladimir Goncharov (Jira)" <ji...@apache.org> on 2021/03/16 11:26:00 UTC

[jira] [Comment Edited] (IGNITE-14224) Extend logging of client failure handling.

    [ https://issues.apache.org/jira/browse/IGNITE-14224?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17302265#comment-17302265 ] 

Vladimir Goncharov edited comment on IGNITE-14224 at 3/16/21, 11:25 AM:
------------------------------------------------------------------------

W10 HOME/PRO

JDK1.8.251/241 Oracle/Open

W/o fix - exception on spi.openSocket in ServerImpl.sendMessageDirectly.

IgniteSpiException : class org.apache.ignite.spi.IgniteSpiException: Failed to send message to address [addr=/127.0.0.1:47500, msg=TcpDiscoveryJoinRequestMessage [node=TcpDiscoveryNode [id=b712cdf2-5e80-4d05-b45c-a113bdeea34b, consistentId=127.0.0.1:47501, addrs=ArrayList [127.0.0.1], sockAddrs=HashSet [/127.0.0.1:47501], discPort=47501, order=0, intOrder=0, lastExchangeTime=1615819551929, loc=true, ver=2.11.0#20210310-sha1:e90f9666, isClient=false], dataPacket=org.apache.ignite.spi.discovery.tcp.internal.DiscoveryDataPacket@33a630fa, super=TcpDiscoveryAbstractMessage [sndNodeId=null, id=3ec89563871-b712cdf2-5e80-4d05-b45c-a113bdeea34b, verifierNodeId=null, topVer=0, pendingIdx=0, failedNodes=null, isClient=false]]]

Connection count doesnt increment and spi.joinTimeout is zero. This leads to infintite cycle of cluster joining.

With fix - timeout exception after open socket and sending message as expected witch correct stopping of joining node after 3 failed connection.

Got exception while starting (will rollback startup routine).
 Command protocol successfully stopped: TCP binary

JDK 1.8.261/271/281 Oracle/Open

Works with and without patch as well 


was (Author: shadowru):
Java 1.8.251/241 Oracle/Open

W/o fix - exception on spi.openSocket in ServerImpl.sendMessageDirectly.

IgniteSpiException : class org.apache.ignite.spi.IgniteSpiException: Failed to send message to address [addr=/127.0.0.1:47500, msg=TcpDiscoveryJoinRequestMessage [node=TcpDiscoveryNode [id=b712cdf2-5e80-4d05-b45c-a113bdeea34b, consistentId=127.0.0.1:47501, addrs=ArrayList [127.0.0.1], sockAddrs=HashSet [/127.0.0.1:47501], discPort=47501, order=0, intOrder=0, lastExchangeTime=1615819551929, loc=true, ver=2.11.0#20210310-sha1:e90f9666, isClient=false], dataPacket=org.apache.ignite.spi.discovery.tcp.internal.DiscoveryDataPacket@33a630fa, super=TcpDiscoveryAbstractMessage [sndNodeId=null, id=3ec89563871-b712cdf2-5e80-4d05-b45c-a113bdeea34b, verifierNodeId=null, topVer=0, pendingIdx=0, failedNodes=null, isClient=false]]]

Connection count doesnt increment and spi.joinTimeout is zero. This leads to infintite cycle of cluster joining.

With fix - timeout exception after open socket and sending message as expected witch correct stopping of joining node after 3 failed connection.

Got exception while starting (will rollback startup routine).
Command protocol successfully stopped: TCP binary

Java 1.8.261/271/281 Oracle/Open

Works with and without patch as well 

> Extend logging of client failure handling.
> ------------------------------------------
>
>                 Key: IGNITE-14224
>                 URL: https://issues.apache.org/jira/browse/IGNITE-14224
>             Project: Ignite
>          Issue Type: Improvement
>          Components: clients
>    Affects Versions: 2.9.1
>            Reporter: Stanilovsky Evgeny
>            Assignee: Stanilovsky Evgeny
>            Priority: Major
>             Fix For: 2.11
>
>          Time Spent: 1h 10m
>  Remaining Estimate: 0h
>
> # "Finished serving remote node connection " but this message doesn't contain node id, it should be added (check org.apache.ignite.spi.discovery.tcp.ServerImpl.SocketReader#body).
>  # When client worker is removed from clientMsgWorkers, org.apache.ignite.internal.util.IgniteUtils#closeQuiet(java.lang.AutoCloseable) is used to close the socket, we don't see any exception if it has happened, closeQuiet should be replaced with org.apache.ignite.internal.util.IgniteUtils#close(java.lang.AutoCloseable, org.apache.ignite.IgniteLogger) which works in the same way but prints exception.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)