You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@ozone.apache.org by "Ethan Rose (Jira)" <ji...@apache.org> on 2021/10/20 20:37:10 UTC

[jira] [Updated] (HDDS-1583) EOFException for Ozone RPC client

     [ https://issues.apache.org/jira/browse/HDDS-1583?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Ethan Rose updated HDDS-1583:
-----------------------------
    Target Version/s: 1.3.0  (was: 1.2.0)

I am managing the 1.2.0 release and we currently have more than 600 issues targeted for 1.2.0. I am moving the target field to 1.3.0.

If you are actively working on this jira and believe this should be targeted for the 1.2.0 release, Please reach out to me via Apache email or Slack.

> EOFException for Ozone RPC client
> ---------------------------------
>
>                 Key: HDDS-1583
>                 URL: https://issues.apache.org/jira/browse/HDDS-1583
>             Project: Apache Ozone
>          Issue Type: Bug
>            Reporter: Eric Yang
>            Priority: Major
>
> We discover a bug in Ozone RPC client.  If the server is in starting state, and not completely started.  Calling new SCMCLI().createScmClient(); would result in EOFException error.  Most software client have some level of retires to establish connection without throwing errors for a brief period of time to ensure that transient errors are not over alarming to client code.  The experience can be improved by making sure that connection logic retries a few times before giving up.  See related stack trace:
> {code}java.io.EOFException: End of File Exception between local host is: "localhost.localdomain/127.0.0.1"; destination host is: "localhost.localdomain":9860; : java.io.EOFException; For more details see:  http://wiki.apache.org/hadoop/EOFException
>         at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
>         at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:62)
>         at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
>         at java.lang.reflect.Constructor.newInstance(Constructor.java:423)
>         at org.apache.hadoop.net.NetUtils.wrapWithMessage(NetUtils.java:831)
>         at org.apache.hadoop.net.NetUtils.wrapException(NetUtils.java:789)
>         at org.apache.hadoop.ipc.Client.getRpcResponse(Client.java:1515)
>         at org.apache.hadoop.ipc.Client.call(Client.java:1457)
>         at org.apache.hadoop.ipc.Client.call(Client.java:1367)
>         at org.apache.hadoop.ipc.ProtobufRpcEngine$Invoker.invoke(ProtobufRpcEngine.java:228)
>         at org.apache.hadoop.ipc.ProtobufRpcEngine$Invoker.invoke(ProtobufRpcEngine.java:116)
>         at com.sun.proxy.$Proxy12.inSafeMode(Unknown Source)
>         at org.apache.hadoop.hdds.scm.protocolPB.StorageContainerLocationProtocolClientSideTranslatorPB.inSafeMode(StorageContainerLocationProtocolClientSideTranslatorPB.java:383)
>         at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>         at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
>         at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>         at java.lang.reflect.Method.invoke(Method.java:498)
>         at org.apache.hadoop.hdds.tracing.TraceAllMethod.invoke(TraceAllMethod.java:66)
>         at com.sun.proxy.$Proxy13.inSafeMode(Unknown Source)
>         at org.apache.hadoop.hdds.scm.client.ContainerOperationClient.inSafeMode(ContainerOperationClient.java:456)
>         at org.apache.hadoop.ozone.ITDiskReadWrite.setUp(ITDiskReadWrite.java:43)
>         at junit.framework.TestCase.runBare(TestCase.java:139)
>         at junit.framework.TestResult$1.protect(TestResult.java:122)
>         at junit.framework.TestResult.runProtected(TestResult.java:142)
>         at junit.framework.TestResult.run(TestResult.java:125)
>         at junit.framework.TestCase.run(TestCase.java:129)
>         at junit.framework.TestSuite.runTest(TestSuite.java:255)
>         at junit.framework.TestSuite.run(TestSuite.java:250)
>         at org.junit.internal.runners.JUnit38ClassRunner.run(JUnit38ClassRunner.java:84)
>         at org.junit.runners.Suite.runChild(Suite.java:127)
>         at org.junit.runners.Suite.runChild(Suite.java:26)
>         at org.junit.runners.ParentRunner$3.run(ParentRunner.java:238)
>         at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:63)
>         at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:236)
>         at org.junit.runners.ParentRunner.access$000(ParentRunner.java:53)
>         at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:229)
>         at org.junit.runners.ParentRunner.run(ParentRunner.java:309)
>         at org.apache.maven.surefire.junitcore.JUnitCore.run(JUnitCore.java:55)
>         at org.apache.maven.surefire.junitcore.JUnitCoreWrapper.createRequestAndRun(JUnitCoreWrapper.java:137)
>         at org.apache.maven.surefire.junitcore.JUnitCoreWrapper.executeEager(JUnitCoreWrapper.java:107)
>         at org.apache.maven.surefire.junitcore.JUnitCoreWrapper.execute(JUnitCoreWrapper.java:83)
>         at org.apache.maven.surefire.junitcore.JUnitCoreWrapper.execute(JUnitCoreWrapper.java:75)
>         at org.apache.maven.surefire.junitcore.JUnitCoreProvider.invoke(JUnitCoreProvider.java:158)
>         at org.apache.maven.surefire.booter.ForkedBooter.invokeProviderInSameClassLoader(ForkedBooter.java:384)
>         at org.apache.maven.surefire.booter.ForkedBooter.runSuitesInProcess(ForkedBooter.java:345)
>         at org.apache.maven.surefire.booter.ForkedBooter.execute(ForkedBooter.java:126)
>         at org.apache.maven.surefire.booter.ForkedBooter.main(ForkedBooter.java:418)
> Caused by: java.io.EOFException
>         at java.io.DataInputStream.readInt(DataInputStream.java:392)
>         at org.apache.hadoop.ipc.Client$IpcStreams.readResponse(Client.java:1816)
>         at org.apache.hadoop.ipc.Client$Connection.receiveRpcResponse(Client.java:1173)
>         at org.apache.hadoop.ipc.Client$Connection.run(Client.java:1069)
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@ozone.apache.org
For additional commands, e-mail: issues-help@ozone.apache.org