You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@ozone.apache.org by "Nanda kumar (Jira)" <ji...@apache.org> on 2020/04/02 18:03:00 UTC

[jira] [Commented] (HDDS-3088) maxRetries value is too large while trying to reconnect to SCM server

    [ https://issues.apache.org/jira/browse/HDDS-3088?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17073960#comment-17073960 ] 

Nanda kumar commented on HDDS-3088:
-----------------------------------

[~nilotpalnandi]
The max retry count for the client is 10.
{code}
$ bin/ozone scmcli container list
Retrying connect to server: localhost/127.0.0.1:9860. Already tried 0 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=10, sleepTime=1000 MILLISECONDS)
Retrying connect to server: localhost/127.0.0.1:9860. Already tried 1 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=10, sleepTime=1000 MILLISECONDS)
Retrying connect to server: localhost/127.0.0.1:9860. Already tried 2 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=10, sleepTime=1000 MILLISECONDS)
Retrying connect to server: localhost/127.0.0.1:9860. Already tried 3 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=10, sleepTime=1000 MILLISECONDS)
Retrying connect to server: localhost/127.0.0.1:9860. Already tried 4 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=10, sleepTime=1000 MILLISECONDS)
Retrying connect to server: localhost/127.0.0.1:9860. Already tried 5 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=10, sleepTime=1000 MILLISECONDS)
Retrying connect to server: localhost/127.0.0.1:9860. Already tried 6 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=10, sleepTime=1000 MILLISECONDS)
Retrying connect to server: localhost/127.0.0.1:9860. Already tried 7 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=10, sleepTime=1000 MILLISECONDS)
Retrying connect to server: localhost/127.0.0.1:9860. Already tried 8 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=10, sleepTime=1000 MILLISECONDS)
Retrying connect to server: localhost/127.0.0.1:9860. Already tried 9 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=10, sleepTime=1000 MILLISECONDS)
Call From 20115.local/127.0.0.1 to localhost:9860 failed on connection exception: java.net.ConnectException: Connection refused; For more details see:  http://wiki.apache.org/hadoop/ConnectionRefused
{code}

The value 2147483647 is only for datanode to SCM communication.

> maxRetries value is too large while trying to reconnect to SCM server
> ---------------------------------------------------------------------
>
>                 Key: HDDS-3088
>                 URL: https://issues.apache.org/jira/browse/HDDS-3088
>             Project: Hadoop Distributed Data Store
>          Issue Type: Bug
>          Components: SCM
>            Reporter: Nilotpal Nandi
>            Assignee: Nanda kumar
>            Priority: Major
>
> MaxRetries value is 2147483647 which is too high
> It keeps on retrying to connect to SCM server.
>  
> {noformat}
> 2020-02-27 05:54:43,430 INFO org.apache.hadoop.ipc.Client: Retrying connect to server: quasar-hqknwz-8.quasar-hqknwz.root.hwx.site/172.27.14.1:9861. Already tried 10535 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 MILLISECONDS)
> 2020-02-27 05:54:44,431 INFO org.apache.hadoop.ipc.Client: Retrying connect to server: quasar-hqknwz-8.quasar-hqknwz.root.hwx.site/172.27.14.1:9861. Already tried 10536 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 MILLISECONDS)
> 2020-02-27 05:54:45,432 INFO org.apache.hadoop.ipc.Client: Retrying connect to server: quasar-hqknwz-8.quasar-hqknwz.root.hwx.site/172.27.14.1:9861. Already tried 10537 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 MILLISECONDS)
> 2020-02-27 05:54:46,433 INFO org.apache.hadoop.ipc.Client: Retrying connect to server: quasar-hqknwz-8.quasar-hqknwz.root.hwx.site/172.27.14.1:9861. Already tried 10538 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 MILLISECONDS){noformat}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: ozone-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: ozone-issues-help@hadoop.apache.org