You are viewing a plain text version of this content. The canonical link for it is here.
Posted to yarn-issues@hadoop.apache.org by "Jason Lowe (JIRA)" <ji...@apache.org> on 2017/07/11 22:38:00 UTC

[jira] [Commented] (YARN-6805) NPE in LinuxContainerExecutor due to null PrivilegedOperationException exit code

    [ https://issues.apache.org/jira/browse/YARN-6805?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16083110#comment-16083110 ] 

Jason Lowe commented on YARN-6805:
----------------------------------

Sample stack trace from a 2.8-based release:
{noformat}
2017-07-10 20:39:12,810 [LocalizerRunner for container_e03_1496686551678_8189060_01_005998] WARN privileged.PrivilegedOperationExecutor: IOException executing command: 
java.io.InterruptedIOException: java.lang.InterruptedException
        at org.apache.hadoop.util.Shell.runCommand(Shell.java:1007)
        at org.apache.hadoop.util.Shell.run(Shell.java:898)
        at org.apache.hadoop.util.Shell$ShellCommandExecutor.execute(Shell.java:1213)
        at org.apache.hadoop.yarn.server.nodemanager.containermanager.linux.privileged.PrivilegedOperationExecutor.executePrivilegedOperation(PrivilegedOperationExecutor.java:151)
        at org.apache.hadoop.yarn.server.nodemanager.LinuxContainerExecutor.startLocalizer(LinuxContainerExecutor.java:263)
        at org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService$LocalizerRunner.run(ResourceLocalizationService.java:1155)
Caused by: java.lang.InterruptedException
        at java.lang.Object.wait(Native Method)
        at java.lang.Object.wait(Object.java:502)
        at java.lang.UNIXProcess.waitFor(UNIXProcess.java:395)
        at org.apache.hadoop.util.Shell.runCommand(Shell.java:997)
        ... 5 more
2017-07-10 20:39:12,811 [LocalizerRunner for container_e03_1496686551678_8189060_01_005998] INFO localizer.ResourceLocalizationService: Localizer failed
java.lang.NullPointerException
        at org.apache.hadoop.yarn.server.nodemanager.LinuxContainerExecutor.startLocalizer(LinuxContainerExecutor.java:267)
        at org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService$LocalizerRunner.run(ResourceLocalizationService.java:1155)
{noformat}


> NPE in LinuxContainerExecutor due to null PrivilegedOperationException exit code
> --------------------------------------------------------------------------------
>
>                 Key: YARN-6805
>                 URL: https://issues.apache.org/jira/browse/YARN-6805
>             Project: Hadoop YARN
>          Issue Type: Bug
>          Components: nodemanager
>    Affects Versions: 2.8.1
>            Reporter: Jason Lowe
>
> The LinuxContainerExecutor contains a number of code snippets like this:
> {code}
>     } catch (PrivilegedOperationException e) {
>       int exitCode = e.getExitCode();
> {code}
> PrivilegedOperationException#getExitCode can return null if the operation was interrupted, so when the JVM does auto-unboxing on that last line it can NPE if there was no exit code.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: yarn-issues-help@hadoop.apache.org