You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@ozone.apache.org by "cchenaxchen (Jira)" <ji...@apache.org> on 2021/03/02 07:18:00 UTC
[jira] [Commented] (HDDS-4868) Ozone datanode initraftlog fail due
to bad disk so can not communicate to SCM
[ https://issues.apache.org/jira/browse/HDDS-4868?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17293452#comment-17293452 ]
cchenaxchen commented on HDDS-4868:
-----------------------------------
[~adoroszlai],ok,this problem detail is when the bad disk occurs,the log reload from disk will error,so will throw runtime exception
the more log detail is
WARN org.apache.hadoop.ozone.container.common.statemachine.EndpointStateMachine: Unable to communicate to SCM server at 10.51.87.181:9861 for past 34200 seconds.
java.io.IOException: java.lang.IllegalStateException
at org.apache.ratis.util.IOUtils.asIOException(IOUtils.java:54)
at org.apache.ratis.util.IOUtils.toIOException(IOUtils.java:61)
at org.apache.ratis.util.IOUtils.getFromFuture(IOUtils.java:71)
at org.apache.ratis.server.impl.RaftServerProxy.getImpls(RaftServerProxy.java:301)
at org.apache.ratis.server.impl.RaftServerProxy.start(RaftServerProxy.java:318)
at org.apache.hadoop.ozone.container.common.transport.server.ratis.XceiverServerRatis.start(XceiverServerRatis.java:461)
at org.apache.hadoop.ozone.container.ozoneimpl.OzoneContainer.start(OzoneContainer.java:242)
at org.apache.hadoop.ozone.container.common.states.endpoint.VersionEndpointTask.call(VersionEndpointTask.java:112)
at org.apache.hadoop.ozone.container.common.states.endpoint.VersionEndpointTask.call(VersionEndpointTask.java:41)
at java.util.concurrent.FutureTask.run(FutureTask.java:266)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:748)
Caused by: java.lang.IllegalStateException
at org.apache.ratis.util.Preconditions.assertTrue(Preconditions.java:33)
at org.apache.ratis.server.raftlog.segmented.SegmentedRaftLogCache.validateAdding(SegmentedRaftLogCache.java:401)
at org.apache.ratis.server.raftlog.segmented.SegmentedRaftLogCache.addSegment(SegmentedRaftLogCache.java:406)
at org.apache.ratis.server.raftlog.segmented.SegmentedRaftLogCache.loadSegment(SegmentedRaftLogCache.java:368)
at org.apache.ratis.server.raftlog.segmented.SegmentedRaftLog.loadLogSegments(SegmentedRaftLog.java:249)
at org.apache.ratis.server.raftlog.segmented.SegmentedRaftLog.openImpl(SegmentedRaftLog.java:217)
at org.apache.ratis.server.raftlog.RaftLog.open(RaftLog.java:276)
at org.apache.ratis.server.impl.ServerState.initRaftLog(ServerState.java:192)
at org.apache.ratis.server.impl.ServerState.<init>(ServerState.java:122)
at org.apache.ratis.server.impl.RaftServerImpl.<init>(RaftServerImpl.java:137)
at org.apache.ratis.server.impl.RaftServerProxy.lambda$newRaftServerImpl$2(RaftServerProxy.java:221)
at java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1590)
... 3 more
thank you very much!
> Ozone datanode initraftlog fail due to bad disk so can not communicate to SCM
> -----------------------------------------------------------------------------
>
> Key: HDDS-4868
> URL: https://issues.apache.org/jira/browse/HDDS-4868
> Project: Apache Ozone
> Issue Type: Bug
> Reporter: cchenaxchen
> Assignee: cchenaxchen
> Priority: Major
>
> because of bad disk,the datanode initraftlog fail and throw exception:
> java.io.IOException: java.lang.IllegalStateException
> at org.apache.ratis.util.IOUtils.asIOException(IOUtils.java:54)
> at org.apache.ratis.util.IOUtils.toIOException(IOUtils.java:61)
> at org.apache.ratis.util.IOUtils.getFromFuture(IOUtils.java:71)
> at org.apache.ratis.server.impl.RaftServerProxy.getImpls(RaftServerProxy.java:301)
> at org.apache.ratis.server.impl.RaftServerProxy.start(RaftServerProxy.java:318)
> at org.apache.hadoop.ozone.container.common.transport.server.ratis.XceiverServerRatis.start(XceiverServerRatis.java:461)
> at org.apache.hadoop.ozone.container.ozoneimpl.OzoneContainer.start(OzoneContainer.java:242)
> at org.apache.hadoop.ozone.container.common.states.endpoint.VersionEndpointTask.call(VersionEndpointTask.java:112)
> at org.apache.hadoop.ozone.container.common.states.endpoint.VersionEndpointTask.call(VersionEndpointTask.java:41)
> at java.util.concurrent.FutureTask.run(FutureTask.java:266)
> at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
> at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
> at java.lang.Thread.run(Thread.java:748)
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@ozone.apache.org
For additional commands, e-mail: issues-help@ozone.apache.org