You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-issues@hadoop.apache.org by "Karthik Kambatla (JIRA)" <ji...@apache.org> on 2013/11/26 02:52:35 UTC

[jira] [Updated] (HADOOP-10127) Add ipc.client.connect.retry.interval to control the frequency of connection retries

     [ https://issues.apache.org/jira/browse/HADOOP-10127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Karthik Kambatla updated HADOOP-10127:
--------------------------------------

    Attachment: hadoop-10127-1.patch

Straight-forward patch that makes the retry-interval configurable. Tested manually in a YARN HA cluster with configured-failover. Sample output below: 

{noformat}
kasha@keka:~/install/rmha-tests/hadoop-3.0.0-SNAPSHOT$ sudo -u root bin/hadoop jar share/hadoop/mapreduce/hadoop-mapreduce-examples-3.0.0-SNAPSHOT.jar pi -Dipc.client.connect.max.retries=20 -Dipc.client.connect.retry.interval=50 2 10
Number of Maps  = 2
Samples per Map = 10
13/11/25 17:47:20 WARN util.NativeCodeLoader: Unable to load native-hadoop library for your platform... using builtin-java classes where applicable
Wrote input for Map #0
Wrote input for Map #1
Starting Job
13/11/25 17:47:22 INFO ipc.Client: Retrying connect to server: rm-ha-2.ent.cloudera.com/10.20.195.36:23140. Already tried 0 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=20, sleepTime=50 MILLISECONDS)
13/11/25 17:47:22 INFO ipc.Client: Retrying connect to server: rm-ha-2.ent.cloudera.com/10.20.195.36:23140. Already tried 1 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=20, sleepTime=50 MILLISECONDS)
13/11/25 17:47:22 INFO ipc.Client: Retrying connect to server: rm-ha-2.ent.cloudera.com/10.20.195.36:23140. Already tried 2 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=20, sleepTime=50 MILLISECONDS)
13/11/25 17:47:22 INFO ipc.Client: Retrying connect to server: rm-ha-2.ent.cloudera.com/10.20.195.36:23140. Already tried 3 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=20, sleepTime=50 MILLISECONDS)
13/11/25 17:47:22 INFO ipc.Client: Retrying connect to server: rm-ha-2.ent.cloudera.com/10.20.195.36:23140. Already tried 4 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=20, sleepTime=50 MILLISECONDS)

{noformat}

> Add ipc.client.connect.retry.interval to control the frequency of connection retries
> ------------------------------------------------------------------------------------
>
>                 Key: HADOOP-10127
>                 URL: https://issues.apache.org/jira/browse/HADOOP-10127
>             Project: Hadoop Common
>          Issue Type: Bug
>          Components: ipc
>    Affects Versions: 2.2.0
>            Reporter: Karthik Kambatla
>            Assignee: Karthik Kambatla
>         Attachments: hadoop-10127-1.patch
>
>
> Currently, the client attempts to connect to the server every 1 second. It would be nice to make this configurable so that the client can connect more/less frequently. Changing the number of retries alone is not granular enough.



--
This message was sent by Atlassian JIRA
(v6.1#6144)