You are viewing a plain text version of this content. The canonical link for it is here.
Posted to yarn-issues@hadoop.apache.org by "Eric Badger (JIRA)" <ji...@apache.org> on 2016/09/13 14:33:21 UTC

[jira] [Commented] (YARN-5641) Localizer leaves behind tarballs after container is complete

    [ https://issues.apache.org/jira/browse/YARN-5641?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15487350#comment-15487350 ] 

Eric Badger commented on YARN-5641:
-----------------------------------

Here is an example log of the failure. The localizer attempts to clean up its localized directories and fails, thus leaving the directories behind. 
{noformat}
2016-09-09 00:11:10,483 [LocalizerRunner for container_e01_1469258802232_9766291_01_000505] INFO localizer.ResourceLocalizationService: Localizer failed
java.io.IOException: java.lang.InterruptedException
        at org.apache.hadoop.util.Shell.runCommand(Shell.java:565)
        at org.apache.hadoop.util.Shell.run(Shell.java:459)
        at org.apache.hadoop.util.Shell$ShellCommandExecutor.execute(Shell.java:747)
        at org.apache.hadoop.yarn.server.nodemanager.LinuxContainerExecutor.startLocalizer(LinuxContainerExecutor.java:237)
        at org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService$LocalizerRunner.run(ResourceLocalizationService.java:1113)
2016-09-09 00:11:10,484 [DeletionService #0] INFO nodemanager.LinuxContainerExecutor: Deleting path : /tmp/yarn-local/usercache/foobar/appcache/application_1469258802232_9766291/filecache/13
2016-09-09 00:11:10,486 [DeletionService #0] INFO nodemanager.LinuxContainerExecutor: Deleting path : /tmp/yarn-local/usercache/foobar/appcache/application_1469258802232_9766291/filecache/13_tmp
2016-09-09 00:11:10,486 [DeletionService #0] INFO nodemanager.LinuxContainerExecutor: Deleting path : /tmp/yarn-local/usercache/foobar/appcache/application_1469258802232_9766291/filecache/12
2016-09-09 00:11:10,486 [DeletionService #0] INFO nodemanager.LinuxContainerExecutor: Deleting path : /tmp/yarn-local/usercache/foobar/appcache/application_1469258802232_9766291/filecache/12_tmp
2016-09-09 00:11:10,491 [DeletionService #0] ERROR nodemanager.LinuxContainerExecutor: DeleteAsUser for /tmp/yarn-local/usercache/foobar/appcache/application_1469258802232_9766291/filecache/13 /tmp/yarn-local/usercache/foobar/appcache/application_1469258802232_9766291/filecache/13_tmp /tmp/yarn-local/usercache/foobar/appcache/application_1469258802232_9766291/filecache/12 /tmp/yarn-local/usercache/foobar/appcache/application_1469258802232_9766291/filecache/12_tmp returned with exit code: 255
ExitCodeException exitCode=255: 
        at org.apache.hadoop.util.Shell.runCommand(Shell.java:562)
        at org.apache.hadoop.util.Shell.run(Shell.java:459)
        at org.apache.hadoop.util.Shell$ShellCommandExecutor.execute(Shell.java:747)
        at org.apache.hadoop.yarn.server.nodemanager.LinuxContainerExecutor.deleteAsUser(LinuxContainerExecutor.java:424)
        at org.apache.hadoop.yarn.server.nodemanager.DeletionService$FileDeletionTask.run(DeletionService.java:295)
        at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
        at java.util.concurrent.FutureTask.run(FutureTask.java:266)
        at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$201(ScheduledThreadPoolExecutor.java:180)
        at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:293)
        at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
        at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
        at java.lang.Thread.run(Thread.java:745)
2016-09-09 00:11:10,491 [DeletionService #0] ERROR nodemanager.LinuxContainerExecutor: Output from LinuxContainerExecutor's deleteAsUser follows:
2016-09-09 00:11:10,491 [DeletionService #0] INFO nodemanager.ContainerExecutor: main : command provided 3
2016-09-09 00:11:10,491 [DeletionService #0] INFO nodemanager.ContainerExecutor: main : user is foobar
2016-09-09 00:11:10,491 [DeletionService #0] INFO nodemanager.ContainerExecutor: main : requested yarn user is foobar
2016-09-09 00:11:10,491 [DeletionService #0] INFO nodemanager.ContainerExecutor: rmdir of the/tmp/yarn-local/usercache/foobar/appcache/application_1469258802232_9766291/filecache/12/ failed - Permission denied
{noformat}

> Localizer leaves behind tarballs after container is complete
> ------------------------------------------------------------
>
>                 Key: YARN-5641
>                 URL: https://issues.apache.org/jira/browse/YARN-5641
>             Project: Hadoop YARN
>          Issue Type: Bug
>            Reporter: Eric Badger
>            Assignee: Eric Badger
>
> The localizer sometimes fails to clean up extracted tarballs leaving large footprints that persist on the nodes indefinitely. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: yarn-issues-help@hadoop.apache.org