You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@mesos.apache.org by "Benjamin Bannier (JIRA)" <ji...@apache.org> on 2016/03/03 13:58:18 UTC

[jira] [Commented] (MESOS-2522) Add reason field for framework errors

    [ https://issues.apache.org/jira/browse/MESOS-2522?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15177776#comment-15177776 ] 

Benjamin Bannier commented on MESOS-2522:
-----------------------------------------

The master currently responds with the following {{FrameworkErrorMessage}} contents:

||Master method ||{{FrameworkErrorMessage}} content||
| {{Master::exceededCapacity}}    |  Message {{$MESSAGE_NAME}} dropped: capacity ({{$CAPACITY}}) exceeded |
| {{Master::registerFramework}}   |  Registered with 'id' already set |
| {{Master::reregisterFramework}} |  Re-registering without an 'id' |
| {{Master::subscribe}}           |  Role '{{$FRAMEWORK_ROLE}}' is not present in the master's --roles |
|                             |  User 'root' is not allowed to run frameworks without --root_submissions set |
|                             |  Framework has been removed |
| {{Master::_subscribe}}          |  Authorization failure: {{$AUHTORIZATION_FAILURE}} |
|                             |  Not authorized to use role '{{$FRAMEWORK_ROLE}}' |
|                             |  Framework is already connected |
|                             |  Framework failed over |
| {{Master::failoverFramework}}   |  Framework failed over |

AFAIK currently the only message requiring a new {{FrameworkID}} is {{Framework has been removed}}.


> Add reason field for framework errors
> -------------------------------------
>
>                 Key: MESOS-2522
>                 URL: https://issues.apache.org/jira/browse/MESOS-2522
>             Project: Mesos
>          Issue Type: Improvement
>          Components: master
>    Affects Versions: 0.22.0
>            Reporter: Connor Doyle
>            Priority: Minor
>              Labels: mesosphere, newbie
>
> Currently, the only insight into framework errors is a message string.  Framework schedulers could probably be smarter about how to handle errors if the cause is known.  Since there are only a handful of distinct cases that could trigger an error, they could be captured by an enumeration.
> One specific use case for this feature follows. Frameworks that intend to survive failover typicaly persist the FrameworkID somewhere.  When a framework has been marked completed by the master for exceeding its configured failover timeout, then re-registration triggers a framework error.  Probably, the scheduler wants to disambiguate this kind of framework error from others in order to invalidate the stashed FrameworkID for the next attempt at (re)registration.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)