You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@ozone.apache.org by "Aswin Shakil Balasubramanian (Jira)" <ji...@apache.org> on 2021/08/11 16:06:00 UTC

[jira] [Created] (HDDS-5611) NullPointerException in ContainerStateMachine during Pipeline Close.

Aswin Shakil Balasubramanian created HDDS-5611:
--------------------------------------------------

             Summary: NullPointerException in ContainerStateMachine during Pipeline Close.
                 Key: HDDS-5611
                 URL: https://issues.apache.org/jira/browse/HDDS-5611
             Project: Apache Ozone
          Issue Type: Bug
            Reporter: Aswin Shakil Balasubramanian
            Assignee: Aswin Shakil Balasubramanian


{code:java}
2021-06-22 05:43:07,590 ERROR org.apache.hadoop.ozone.container.common.statemachine.commandhandler.ClosePipelineCommandHandler: Can't close pipeline PipelineID=0bcbc90b-5982-450
2-a5d6-ba14a461c307
java.io.IOException: java.lang.NullPointerException
        at org.apache.hadoop.ozone.container.common.transport.server.ratis.XceiverServerRatis.removeGroup(XceiverServerRatis.java:782)
        at org.apache.hadoop.ozone.container.common.statemachine.commandhandler.ClosePipelineCommandHandler.handle(ClosePipelineCommandHandler.java:74)
        at org.apache.hadoop.ozone.container.common.statemachine.commandhandler.CommandDispatcher.handle(CommandDispatcher.java:99)
        at org.apache.hadoop.ozone.container.common.statemachine.DatanodeStateMachine.lambda$initCommandHandlerThread$2(DatanodeStateMachine.java:497)
        at java.base/java.lang.Thread.run(Thread.java:834)
Caused by: java.io.IOException: java.lang.NullPointerException
        at org.apache.ratis.util.IOUtils.asIOException(IOUtils.java:54)
        at org.apache.ratis.server.impl.RaftServerImpl.waitForReply(RaftServerImpl.java:862)
        at org.apache.ratis.server.impl.RaftServerProxy.groupManagement(RaftServerProxy.java:432)
        at org.apache.hadoop.ozone.container.common.transport.server.ratis.XceiverServerRatis.removeGroup(XceiverServerRatis.java:780)
        ... 4 more
Caused by: java.lang.NullPointerException
        at org.apache.hadoop.ozone.container.ozoneimpl.ContainerController.markContainerForClose(ContainerController.java:83)
        at org.apache.hadoop.ozone.container.common.transport.server.ratis.ContainerStateMachine.notifyGroupRemove(ContainerStateMachine.java:883)
        at org.apache.ratis.server.impl.RaftServerImpl.groupRemove(RaftServerImpl.java:362)
        at org.apache.ratis.server.impl.RaftServerProxy.lambda$groupRemoveAsync$14(RaftServerProxy.java:499)
        at java.base/java.util.concurrent.CompletableFuture.uniApplyNow(CompletableFuture.java:680)
        at java.base/java.util.concurrent.CompletableFuture.uniApplyStage(CompletableFuture.java:658)
        at java.base/java.util.concurrent.CompletableFuture.thenApply(CompletableFuture.java:2094)
        at org.apache.ratis.server.impl.RaftServerProxy.groupRemoveAsync(RaftServerProxy.java:498)
        at org.apache.ratis.server.impl.RaftServerProxy.groupManagementAsync(RaftServerProxy.java:452)
        ... 6 more
{code}
During pipeline close, we iterate over the list of containers in Ratis snapshot to close them if needed. This will cause an NPE for missing containers. We need to make sure we also look at missing containers, and skip them in this step.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@ozone.apache.org
For additional commands, e-mail: issues-help@ozone.apache.org