You are viewing a plain text version of this content. The canonical link for it is here.
Posted to yarn-issues@hadoop.apache.org by "Steve Loughran (JIRA)" <ji...@apache.org> on 2013/11/22 15:21:35 UTC

[jira] [Commented] (YARN-1438) When a container fails, the text of the exception isn't included in the diagnostics

    [ https://issues.apache.org/jira/browse/YARN-1438?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13829995#comment-13829995 ] 

Steve Loughran commented on YARN-1438:
--------------------------------------

Here's an example, we know the container failed, we know it threw an exception, but we don't know what actually happened. 

{code}

2013-11-22 02:37:22,991 INFO  [AMRM Callback Handler Thread] HoyaAppMaster.yarn (HoyaAppMaster.java:onContainersCompleted(762)) - onContainersCompleted([1]
2013-11-22 02:37:22,991 INFO  [AMRM Callback Handler Thread] HoyaAppMaster.yarn (HoyaAppMaster.java:onContainersCompleted(765)) - Container Completion for containerID=container_1384457454541_0020_01_000002, state=COMPLETE, exitStatus=1, diagnostics=Exception from container-launch: 
org.apache.hadoop.util.Shell$ExitCodeException: 
	at org.apache.hadoop.util.Shell.runCommand(Shell.java:464)
	at org.apache.hadoop.util.Shell.run(Shell.java:379)
	at org.apache.hadoop.util.Shell$ShellCommandExecutor.execute(Shell.java:589)
	at org.apache.hadoop.yarn.server.nodemanager.DefaultContainerExecutor.launchContainer(DefaultContainerExecutor.java:195)
	at org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.java:283)
	at org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.java:79)
	at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303)
	at java.util.concurrent.FutureTask.run(FutureTask.java:138)
	at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
	at java.lang.Thread.run(Thread.java:662)
{code}

> When a container fails, the text of the exception isn't included in the diagnostics
> -----------------------------------------------------------------------------------
>
>                 Key: YARN-1438
>                 URL: https://issues.apache.org/jira/browse/YARN-1438
>             Project: Hadoop YARN
>          Issue Type: Bug
>          Components: nodemanager
>            Reporter: Steve Loughran
>
> The diagnostics text generated when a container execution thrown an exception doesn't include the exception message -only the stack trace. This makes debugging harder than necessary.



--
This message was sent by Atlassian JIRA
(v6.1#6144)