You are viewing a plain text version of this content. The canonical link for it is here.
Posted to mapreduce-issues@hadoop.apache.org by "Mohammad Kamrul Islam (JIRA)" <ji...@apache.org> on 2014/03/12 02:25:44 UTC
[jira] [Commented] (MAPREDUCE-5792) When
mapreduce.jobhistory.intermediate-done-dir isn't writable, application
fails with generic error
[ https://issues.apache.org/jira/browse/MAPREDUCE-5792?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13931245#comment-13931245 ]
Mohammad Kamrul Islam commented on MAPREDUCE-5792:
--------------------------------------------------
Thanks [~jlowe] for stepping in.
I prefer the MR client to catch it before the submission. what do you think? Do you see any issue?
> When mapreduce.jobhistory.intermediate-done-dir isn't writable, application fails with generic error
> ----------------------------------------------------------------------------------------------------
>
> Key: MAPREDUCE-5792
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-5792
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Components: mr-am, mrv2
> Affects Versions: 2.3.0
> Reporter: Travis Thompson
> Assignee: Mohammad Kamrul Islam
>
> When trying to run an application and the permissions are wrong on {{mapreduce.jobhistory.intermediate-done-dir}}, the MapReduce AM fails with a non-descriptive error message:
> {noformat}
> Application application_1394227890066_0004 failed 2 times due to AM Container for appattempt_1394227890066_0004_000002 exited with exitCode: 1 due to: Exception from container-launch:
> org.apache.hadoop.util.Shell$ExitCodeException:
> at org.apache.hadoop.util.Shell.runCommand(Shell.java:505)
> at org.apache.hadoop.util.Shell.run(Shell.java:418)
> at org.apache.hadoop.util.Shell$ShellCommandExecutor.execute(Shell.java:650)
> at org.apache.hadoop.yarn.server.nodemanager.LinuxContainerExecutor.launchContainer(LinuxContainerExecutor.java:279)
> at org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.java:283)
> at org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.java:79)
> at java.util.concurrent.FutureTask.run(FutureTask.java:262)
> at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
> at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
> at java.lang.Thread.run(Thread.java:744)
> main : command provided 1
> main : user is tthompso
> main : requested yarn user is tthompso
> Container exited with a non-zero exit code 1
> .Failing this attempt.. Failing the application.
> {noformat}
> When permissions are corrected on this dir, applications are able to run. There should probably be some sort of check on this dir before launching the AM so a more meaningful error message can be thrown.
--
This message was sent by Atlassian JIRA
(v6.2#6252)