You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@mesos.apache.org by "Anand Mazumdar (JIRA)" <ji...@apache.org> on 2017/09/11 16:13:03 UTC

[jira] [Updated] (MESOS-7867) Master doesn't handle scheduler driver downgrade from HTTP based to PID based

     [ https://issues.apache.org/jira/browse/MESOS-7867?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Anand Mazumdar updated MESOS-7867:
----------------------------------
    Shepherd: Anand Mazumdar

> Master doesn't handle scheduler driver downgrade from HTTP based to PID based
> -----------------------------------------------------------------------------
>
>                 Key: MESOS-7867
>                 URL: https://issues.apache.org/jira/browse/MESOS-7867
>             Project: Mesos
>          Issue Type: Bug
>          Components: master
>    Affects Versions: 1.3.0
>            Reporter: Ilya Pronin
>            Assignee: Ilya Pronin
>
> When a framework upgrades from a PID based driver to an HTTP based driver, master removes its per-framework-principal metrics ({{messages_received}} and {{messages_processed}}) in {{Master::failoverFramework}}. When the same framework downgrades back to a PID based driver, the master doesn't reinstate those metrics. This causes a crash when the master receives a message from the failed over framework and increments {{messages_received}} counter in {{Master::visit(const MessageEvent&)}}.
> {noformat}
> I0807 18:17:45.713220 19095 master.cpp:2916] Framework 70822e80-ca38-4470-916e-e6da073a4742-0000 (TwitterScheduler) failed over
> F0807 18:18:20.725908 19079 master.cpp:1451] Check failed: metrics->frameworks.contains(principal.get())
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)