You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@mesos.apache.org by "Yan Xu (JIRA)" <ji...@apache.org> on 2017/12/01 23:20:00 UTC

[jira] [Commented] (MESOS-8223) Master crashes when suppressed on subscribe is enabled.

    [ https://issues.apache.org/jira/browse/MESOS-8223?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16275169#comment-16275169 ] 

Yan Xu commented on MESOS-8223:
-------------------------------

{noformat:title=}
commit 8c2f972b5c0c42e1519d09275cc26e1765a0c5de
Author: Jiang Yan Xu <xu...@apple.com>
Date:   Tue Nov 14 00:12:17 2017 -0800

    Fixed a bug that removed the suppressed framework from sorter.
    
    Review: https://reviews.apache.org/r/63831
{noformat}

> Master crashes when suppressed on subscribe is enabled.
> -------------------------------------------------------
>
>                 Key: MESOS-8223
>                 URL: https://issues.apache.org/jira/browse/MESOS-8223
>             Project: Mesos
>          Issue Type: Bug
>          Components: master
>    Affects Versions: 1.4.0
>            Reporter: Yan Xu
>            Assignee: Yan Xu
>            Priority: Critical
>             Fix For: 1.5.0
>
>
> Introduced in MESOS-7015, this feature is not actually turned on due to MESOS-8200. However once this is addressed and the feature enabled, the master crashes with:
> {noformat:title=}
> I1113 17:17:37.240901 11285 master.cpp:3309] Disconnecting framework 40f7bdc0-e54b-46da-ace1-48162171baf4-0110 (test-framework)
> I1113 17:17:37.240911 11285 master.cpp:1435] Giving framework 40f7bdc0-e54b-46da-ace1-48162171baf4-0110 (test-framework) 3days to failover
> I1113 17:17:37.241953 11285 master.cpp:2612] Received subscription request for HTTP framework 'test-framework'
> I1113 17:17:37.242807 11285 master.cpp:2748] Subscribing framework 'test-framework' with checkpointing enabled, roles { * } suppressed and capabilities [ SHARED_RESOURCES, TASK_KILLING_STATE ]
> I1113 17:17:37.242820 11285 master.cpp:6994] Updating info for framework 40f7bdc0-e54b-46da-ace1-48162171baf4-0110
> I1113 17:17:37.252637 11270 hierarchical.cpp:380] Activated framework 40f7bdc0-e54b-46da-ace1-48162171baf4-0110
> I1113 17:17:37.272457 11289 master.cpp:7723] Performing implicit task state reconciliation for framework 40f7bdc0-e54b-46da-ace1-48162171baf4-0110 (test-framework)
> I1113 17:17:37.272507 11289 master.cpp:7723] Performing implicit task state reconciliation for framework 40f7bdc0-e54b-46da-ace1-48162171baf4-0110 (test-framework)
> I1113 17:17:41.966331 11271 master.cpp:5564] Processing REVIVE call for framework 40f7bdc0-e54b-46da-ace1-48162171baf4-0110 (test-framework)
> F1113 17:17:41.966380 11280 sorter.cpp:270] Check failed: 'find(clientPath)' Must be non NULL
> *** Check failure stack trace: ***
>     @     0x7f3467efd0dd  (unknown)
> {noformat}
> This happens with a unsuppressed framework reregisters with suppressed roles and then revive.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)