You are viewing a plain text version of this content. The canonical link for it is here.
Posted to yarn-issues@hadoop.apache.org by "Zhijie Shen (JIRA)" <ji...@apache.org> on 2014/04/04 23:03:14 UTC
[jira] [Commented] (YARN-1903) TestNMClient fails occasionally
[ https://issues.apache.org/jira/browse/YARN-1903?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13960418#comment-13960418 ]
Zhijie Shen commented on YARN-1903:
-----------------------------------
I found the following log:
{code}
2014-04-04 05:08:01,361 INFO containermanager.ContainerManagerImpl (ContainerManagerImpl.java:getContainerStatusInternal(785)) - Returning ContainerStatus: [ContainerId: container_1396613275302_0001_01_000004, State: RUNNING, Diagnostics: , ExitStatus: -1000, ]
2014-04-04 05:08:01,365 INFO containermanager.ContainerManagerImpl (ContainerManagerImpl.java:stopContainerInternal(718)) - Stopping container with container Id: container_1396613275302_0001_01_000004
2014-04-04 05:08:01,366 INFO nodemanager.NMAuditLogger (NMAuditLogger.java:logSuccess(89)) - USER=jenkins IP=10.79.62.28 OPERATION=Stop Container Request TARGET=ContainerManageImpl RESULT=SUCCESS APPID=application_1396613275302_0001 CONTAINERID=container_1396613275302_0001_01_000004
2014-04-04 05:08:01,387 INFO monitor.ContainersMonitorImpl (ContainersMonitorImpl.java:isEnabled(169)) - Neither virutal-memory nor physical-memory monitoring is needed. Not running the monitor-thread
2014-04-04 05:08:01,387 INFO containermanager.AuxServices (AuxServices.java:handle(175)) - Got event CONTAINER_STOP for appId application_1396613275302_0001
2014-04-04 05:08:01,389 INFO application.Application (ApplicationImpl.java:transition(296)) - Adding container_1396613275302_0001_01_000004 to application application_1396613275302_0001
2014-04-04 05:08:01,389 INFO nodemanager.NMAuditLogger (NMAuditLogger.java:logSuccess(89)) - USER=jenkins OPERATION=Container Finished - Killed TARGET=ContainerImpl RESULT=SUCCESS APPID=application_1396613275302_0001 CONTAINERID=container_1396613275302_0001_01_000004
2014-04-04 05:08:01,389 INFO container.Container (ContainerImpl.java:handle(884)) - Container container_1396613275302_0001_01_000004 transitioned from NEW to DONE
2014-04-04 05:08:01,389 INFO application.Application (ApplicationImpl.java:transition(339)) - Removing container_1396613275302_0001_01_000004 from application application_1396613275302_0001
2014-04-04 05:08:01,390 INFO util.ProcfsBasedProcessTree (ProcfsBasedProcessTree.java:isAvailable(182)) - ProcfsBasedProcessTree currently is supported only on Linux.
2014-04-04 05:08:01,392 INFO rmcontainer.RMContainerImpl (RMContainerImpl.java:handle(321)) - container_1396613275302_0001_01_000004 Container Transitioned from ACQUIRED to RUNNING
2014-04-04 05:08:01,393 INFO containermanager.ContainerManagerImpl (ContainerManagerImpl.java:getContainerStatusInternal(771)) - Getting container-status for container_1396613275302_0001_01_000004
2014-04-04 05:08:01,393 INFO containermanager.ContainerManagerImpl (ContainerManagerImpl.java:getContainerStatusInternal(785)) - Returning ContainerStatus: [ContainerId: container_1396613275302_0001_01_000004, State: COMPLETE, Diagnostics: , ExitStatus: -1000, ]
{code}
When the kill event is received, the container is still at NEW, it is moved to DONE by going through ContainerDoneTransition, which won't set the killing related exitcode and diagnostics.
> TestNMClient fails occasionally
> -------------------------------
>
> Key: YARN-1903
> URL: https://issues.apache.org/jira/browse/YARN-1903
> Project: Hadoop YARN
> Issue Type: Bug
> Reporter: Zhijie Shen
> Assignee: Zhijie Shen
>
> The container status after stopping container is not expected.
> {code}
> java.lang.AssertionError: 4:
> at org.junit.Assert.fail(Assert.java:93)
> at org.junit.Assert.assertTrue(Assert.java:43)
> at org.apache.hadoop.yarn.client.api.impl.TestNMClient.testGetContainerStatus(TestNMClient.java:382)
> at org.apache.hadoop.yarn.client.api.impl.TestNMClient.testContainerManagement(TestNMClient.java:346)
> at org.apache.hadoop.yarn.client.api.impl.TestNMClient.testNMClient(TestNMClient.java:226)
> {code}
--
This message was sent by Atlassian JIRA
(v6.2#6252)