You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@geode.apache.org by "ASF subversion and git services (JIRA)" <ji...@apache.org> on 2016/02/01 17:42:40 UTC

[jira] [Commented] (GEODE-871) Frequent use of executeOnAllServers causes build-up of sockets in TIME_WAIT on locator machine

    [ https://issues.apache.org/jira/browse/GEODE-871?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15126526#comment-15126526 ] 

ASF subversion and git services commented on GEODE-871:
-------------------------------------------------------

Commit d22176bb1632dc5593a1755a1355aacd0360c5e8 in incubator-geode's branch refs/heads/develop from [~bschuchardt]
[ https://git-wip-us.apache.org/repos/asf?p=incubator-geode.git;h=d22176b ]

GEODE-871: Frequient use of executeOnAllServers causes build-up of TIME_WAIT sockets

Modified the fix to not abort the connection if a response wasn't read.
I also propagated this pattern to GMSHealthMonitor so that the "final
check" sockets are quickly released.


> Frequent use of executeOnAllServers causes build-up of sockets in TIME_WAIT on locator machine
> ----------------------------------------------------------------------------------------------
>
>                 Key: GEODE-871
>                 URL: https://issues.apache.org/jira/browse/GEODE-871
>             Project: Geode
>          Issue Type: Bug
>          Components: client/server
>            Reporter: Bruce Schuchardt
>            Assignee: Bruce Schuchardt
>             Fix For: 1.0.0-incubating.M2
>
>
> The executeOnAllServers implementation contacts a locator in order to get the current set of servers.  This opens a TCP/IP connection to the locator, interacts with the locator and then closes the TCP/IP connection.  The connection is ending up in TIME_WAIT state though, indicating that the locator is usually initiating the close of the connection instead of the client.
> These TIME_WAIT TCP/IP connections can build up rapidly and cause the locator to run out of ephemeral ports.  That puts the locator out of commission until the connections are timed out by the OS.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)