You are viewing a plain text version of this content. The canonical link for it is here.
Posted to yarn-issues@hadoop.apache.org by "Jason Lowe (JIRA)" <ji...@apache.org> on 2013/05/09 23:05:17 UTC

[jira] [Commented] (YARN-661) NM fails to cleanup local directories for users

    [ https://issues.apache.org/jira/browse/YARN-661?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13653198#comment-13653198 ] 

Jason Lowe commented on YARN-661:
---------------------------------

Sample log of the failure:

{noformat}
2013-05-09 10:38:05,107 [DeletionService #0] INFO org.apache.hadoop.yarn.server.nodemanager.LinuxContainerExecutor: Deleting absolute path : file:/tmp/yarn-local/usercache_DEL_1368077188083/someuser
2013-05-09 10:38:05,107 [DeletionService #0] INFO org.apache.hadoop.yarn.server.nodemanager.LinuxContainerExecutor:  -- DEBUG -- deleteAsUser: [/hadoop/current/bin/container-executor, someuser, 3, /tmp/yarn-local/usercache_DEL_1368077188083/someuser]
2013-05-09 10:38:05,142 [DeletionService #0] WARN org.apache.hadoop.yarn.server.nodemanager.LinuxContainerExecutor: Exit code from container is : 255
2013-05-09 10:38:05,142 [DeletionService #0] ERROR org.apache.hadoop.yarn.server.nodemanager.LinuxContainerExecutor: DeleteAsUser for /tmp/yarn-local/usercache_DEL_1368077188083/someuser returned with non-zero exit code255
{noformat}

The directory is owner by the user, but the parent directory which must be written as part of the delete is owned by the mapred user and is only writable by the mapred user.
                
> NM fails to cleanup local directories for users
> -----------------------------------------------
>
>                 Key: YARN-661
>                 URL: https://issues.apache.org/jira/browse/YARN-661
>             Project: Hadoop YARN
>          Issue Type: Bug
>          Components: nodemanager
>    Affects Versions: 2.0.5-beta, 0.23.8
>            Reporter: Jason Lowe
>
> YARN-71 added deletion of local directories on startup, but in practice it fails to delete the directories because of permission problems.  The top-level usercache directory is owned by the user but is in a directory that is not writable by the user.  Therefore the deletion of the user's usercache directory, as the user, fails due to lack of permissions.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira