You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@mesos.apache.org by "Chun-Hung Hsiao (JIRA)" <ji...@apache.org> on 2018/07/05 18:06:00 UTC

[jira] [Created] (MESOS-9052) Default executor should commit suicide if it cannot receive HTTP responses for LAUNCH_NESTED_CONTAINER calls.

Chun-Hung Hsiao created MESOS-9052:
--------------------------------------

             Summary: Default executor should commit suicide if it cannot receive HTTP responses for LAUNCH_NESTED_CONTAINER calls.
                 Key: MESOS-9052
                 URL: https://issues.apache.org/jira/browse/MESOS-9052
             Project: Mesos
          Issue Type: Bug
          Components: executor
    Affects Versions: 1.6.0, 1.5.0, 1.4.0, 1.7.0
            Reporter: Chun-Hung Hsiao


If there is a network problem (e.g., a routing problem), it is possible that the agent has received {{LAUNCH_NESTED_CONTAINER}} calls from the default executor and launched the nested container, but the executor does not get the HTTP response. This would result in tasks stuck at {{TASK_STARTING}} forever. We should consider making the default executor commit suicide if it does not receive the response in a reasonable amount of time. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)