You are viewing a plain text version of this content. The canonical link for it is here.
Posted to yarn-dev@hadoop.apache.org by "Matthew Sharp (Jira)" <ji...@apache.org> on 2020/04/28 17:17:00 UTC

[jira] [Created] (YARN-10250) Container Relaunch - find: File system loop detected

Matthew Sharp created YARN-10250:
------------------------------------

             Summary: Container Relaunch - find: File system loop detected
                 Key: YARN-10250
                 URL: https://issues.apache.org/jira/browse/YARN-10250
             Project: Hadoop YARN
          Issue Type: Bug
    Affects Versions: 3.2.0
            Reporter: Matthew Sharp


Hive LLAP YARN service tries to relaunch from a container failure and when it retries on the same node we are seeing it fail with:
{code:java}
find: File system loop detected; ‘./lib/llap-27Apr2020.tar.gz’ is part of the same file system loop as ‘./lib’. {code}
 

YARN-8667 attempted to clean up the prior symlinks before relaunching, but in this case it still exists since it recreates the symlinks right before trying to output to directory.info for logging.

 

The following line appears to be the culprit:  [https://github.com/apache/hadoop/blob/trunk/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager/src/main/java/org/apache/hadoop/yarn/server/nodemanager/containermanager/launcher/ContainerLaunch.java#L1346]

 

 

 

 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-dev-unsubscribe@hadoop.apache.org
For additional commands, e-mail: yarn-dev-help@hadoop.apache.org