You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@mesos.apache.org by "Alexander Rukletsov (JIRA)" <ji...@apache.org> on 2016/12/07 15:13:58 UTC
[jira] [Created] (MESOS-6743) Docker executor hangs forever if
`docker stop` fails.
Alexander Rukletsov created MESOS-6743:
------------------------------------------
Summary: Docker executor hangs forever if `docker stop` fails.
Key: MESOS-6743
URL: https://issues.apache.org/jira/browse/MESOS-6743
Project: Mesos
Issue Type: Bug
Components: docker
Affects Versions: 1.0.1, 1.1.0
Reporter: Alexander Rukletsov
If {{docker stop}} finishes with an error status, the executor should catch this and react instead of indefinitely waiting for {{reaped}} to return.
An interesting question is _how_ to react. Here are possible solutions.
1. Retry {{docker stop}}. In this case it is unclear how many times to retry and what to do if {{docker stop}} continues to fail.
2. Unmark task as {{killed}}. This will allow frameworks to retry the kill. However, in this case it is unclear what status updates we should send: {[TASK_KILLING}} for every kill retry? an extra update when we failed to kill a task? or set a specific reason in {{TASK_KILLING}}?
3. Clean up and exit. In this case we should make sure the task container is killed or notify the framework and the operator that the container may still be running.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)