You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mesos.apache.org by "Benjamin Mahler (JIRA)" <ji...@apache.org> on 2014/05/20 03:09:38 UTC

[jira] [Created] (MESOS-1389) Reconciliation can send TASK_LOST before a terminal update reaches the framework.

Benjamin Mahler created MESOS-1389:
--------------------------------------

             Summary: Reconciliation can send TASK_LOST before a terminal update reaches the framework.
                 Key: MESOS-1389
                 URL: https://issues.apache.org/jira/browse/MESOS-1389
             Project: Mesos
          Issue Type: Bug
    Affects Versions: 0.19.0
            Reporter: Benjamin Mahler
             Fix For: 0.19.0


There's an unfortunate case with reconciliation, where we end up sending TASK_LOST first and then the slave sends the valid terminal status update.

When the slave re-registers with terminal tasks that have un-acked updates. The master does not store these tasks. So while the slave still needs to send the terminal status updates, the master will reply with TASK_LOST for reconciliation.

We may need to ensure that all status update acknowledgements go through the master to fix this.



--
This message was sent by Atlassian JIRA
(v6.2#6252)