You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@ozone.apache.org by "cchenaxchen (Jira)" <ji...@apache.org> on 2021/03/02 07:18:00 UTC

[jira] [Commented] (HDDS-4868) Ozone datanode initraftlog fail due to bad disk so can not communicate to SCM

    [ https://issues.apache.org/jira/browse/HDDS-4868?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17293452#comment-17293452 ] 

cchenaxchen commented on HDDS-4868:
-----------------------------------

[~adoroszlai],ok,this problem detail is when the bad disk occurs,the log reload from disk will error,so will throw runtime exception

the more log detail is 
WARN org.apache.hadoop.ozone.container.common.statemachine.EndpointStateMachine: Unable to communicate to SCM server at 10.51.87.181:9861 for past 34200 seconds.
java.io.IOException: java.lang.IllegalStateException
        at org.apache.ratis.util.IOUtils.asIOException(IOUtils.java:54)
        at org.apache.ratis.util.IOUtils.toIOException(IOUtils.java:61)
        at org.apache.ratis.util.IOUtils.getFromFuture(IOUtils.java:71)
        at org.apache.ratis.server.impl.RaftServerProxy.getImpls(RaftServerProxy.java:301)
        at org.apache.ratis.server.impl.RaftServerProxy.start(RaftServerProxy.java:318)
        at org.apache.hadoop.ozone.container.common.transport.server.ratis.XceiverServerRatis.start(XceiverServerRatis.java:461)
        at org.apache.hadoop.ozone.container.ozoneimpl.OzoneContainer.start(OzoneContainer.java:242)
        at org.apache.hadoop.ozone.container.common.states.endpoint.VersionEndpointTask.call(VersionEndpointTask.java:112)
        at org.apache.hadoop.ozone.container.common.states.endpoint.VersionEndpointTask.call(VersionEndpointTask.java:41)
        at java.util.concurrent.FutureTask.run(FutureTask.java:266)
        at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
        at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
        at java.lang.Thread.run(Thread.java:748)
Caused by: java.lang.IllegalStateException
        at org.apache.ratis.util.Preconditions.assertTrue(Preconditions.java:33)
        at org.apache.ratis.server.raftlog.segmented.SegmentedRaftLogCache.validateAdding(SegmentedRaftLogCache.java:401)
        at org.apache.ratis.server.raftlog.segmented.SegmentedRaftLogCache.addSegment(SegmentedRaftLogCache.java:406)
        at org.apache.ratis.server.raftlog.segmented.SegmentedRaftLogCache.loadSegment(SegmentedRaftLogCache.java:368)
        at org.apache.ratis.server.raftlog.segmented.SegmentedRaftLog.loadLogSegments(SegmentedRaftLog.java:249)
        at org.apache.ratis.server.raftlog.segmented.SegmentedRaftLog.openImpl(SegmentedRaftLog.java:217)
        at org.apache.ratis.server.raftlog.RaftLog.open(RaftLog.java:276)
        at org.apache.ratis.server.impl.ServerState.initRaftLog(ServerState.java:192)
        at org.apache.ratis.server.impl.ServerState.<init>(ServerState.java:122)
        at org.apache.ratis.server.impl.RaftServerImpl.<init>(RaftServerImpl.java:137)
        at org.apache.ratis.server.impl.RaftServerProxy.lambda$newRaftServerImpl$2(RaftServerProxy.java:221)
        at java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1590)
        ... 3 more
 
thank you very much!

> Ozone datanode initraftlog fail due to bad disk so can not communicate to SCM
> -----------------------------------------------------------------------------
>
>                 Key: HDDS-4868
>                 URL: https://issues.apache.org/jira/browse/HDDS-4868
>             Project: Apache Ozone
>          Issue Type: Bug
>            Reporter: cchenaxchen
>            Assignee: cchenaxchen
>            Priority: Major
>
> because of bad disk,the datanode initraftlog fail and throw exception:
> java.io.IOException: java.lang.IllegalStateException
>         at org.apache.ratis.util.IOUtils.asIOException(IOUtils.java:54)
>         at org.apache.ratis.util.IOUtils.toIOException(IOUtils.java:61)
>         at org.apache.ratis.util.IOUtils.getFromFuture(IOUtils.java:71)
>         at org.apache.ratis.server.impl.RaftServerProxy.getImpls(RaftServerProxy.java:301)
>         at org.apache.ratis.server.impl.RaftServerProxy.start(RaftServerProxy.java:318)
>         at org.apache.hadoop.ozone.container.common.transport.server.ratis.XceiverServerRatis.start(XceiverServerRatis.java:461)
>         at org.apache.hadoop.ozone.container.ozoneimpl.OzoneContainer.start(OzoneContainer.java:242)
>         at org.apache.hadoop.ozone.container.common.states.endpoint.VersionEndpointTask.call(VersionEndpointTask.java:112)
>         at org.apache.hadoop.ozone.container.common.states.endpoint.VersionEndpointTask.call(VersionEndpointTask.java:41)
>         at java.util.concurrent.FutureTask.run(FutureTask.java:266)
>         at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
>         at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
>         at java.lang.Thread.run(Thread.java:748)



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@ozone.apache.org
For additional commands, e-mail: issues-help@ozone.apache.org