You are viewing a plain text version of this content. The canonical link for it is here.
Posted to yarn-dev@hadoop.apache.org by "KWON BYUNGCHANG (JIRA)" <ji...@apache.org> on 2019/06/25 06:24:00 UTC

[jira] [Created] (YARN-9647) Docker launch fails when local-dirs or log-dirs is unhealthy.

KWON BYUNGCHANG created YARN-9647:
-------------------------------------

             Summary: Docker launch fails when local-dirs or log-dirs is unhealthy.
                 Key: YARN-9647
                 URL: https://issues.apache.org/jira/browse/YARN-9647
             Project: Hadoop YARN
          Issue Type: Bug
          Components: nodemanager
            Reporter: KWON BYUNGCHANG


my /etc/hadoop/conf/container-executor.cfg

{code}
[docker]
   docker.allowed.ro-mounts=/data1/hadoop/yarn/local,/data2/hadoop/yarn/local
   docker.allowed.rw-mounts=/data1/hadoop/yarn/local,/data2/hadoop/yarn/local
{code}


if /data2 is unhealthy, docker launch fails  although container can use /data1 as local-dir, log-dir 

error message is below

{code}
[2019-06-25 14:55:26.168]Exception from container-launch. Container id: container_e50_1561100493387_5185_01_000597 Exit code: 29 Exception message: Launch container failed Shell error output: Could not determine real path of mount '/data2/hadoop/yarn/local' Could not determine real path of mount '/data2/hadoop/yarn/local' Unable to find permitted docker mounts on disk Error constructing docker command, docker error code=16, error message='Mount access error' Shell output: main : command provided 4 main : run as user is magnum main : requested yarn user is magnum Creating script paths... Creating local dirs... [2019-06-25 14:55:26.189]Container exited with a non-zero exit code 29. [2019-06-25 14:55:26.192]Container exited with a non-zero exit code 29. 
{code}


root cause is that normalize_mounts() in docker-util.c return -1  because it cannot resolve real path of /data2/hadoop/yarn/local.(note that /data2 is disk fault  at this point)
however disk of nm local dirs and nm log dirs can fail at any time.
docker launch should succeed if there are available local dirs and log dirs.




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-dev-unsubscribe@hadoop.apache.org
For additional commands, e-mail: yarn-dev-help@hadoop.apache.org