You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@mesos.apache.org by "Klaus Ma (JIRA)" <ji...@apache.org> on 2016/05/29 10:58:12 UTC

[jira] [Commented] (MESOS-5482) mesos task stuck in staging after slave reboot

    [ https://issues.apache.org/jira/browse/MESOS-5482?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15305857#comment-15305857 ] 

Klaus Ma commented on MESOS-5482:
---------------------------------

[~gufranmmu@yahoo.com], would you share mesos-master, mesos-slave and marathon's log? Is there any other service can run in the rebooted node?

> mesos task stuck in staging after slave reboot
> ----------------------------------------------
>
>                 Key: MESOS-5482
>                 URL: https://issues.apache.org/jira/browse/MESOS-5482
>             Project: Mesos
>          Issue Type: Bug
>            Reporter: lutful karim
>            Priority: Blocker
>
> The main idea of mesos/marathon is to sleep well, but after node reboot mesos task gets stuck in staging for about 4 hours.
> To reproduce the issue: 
> - setup a mesos cluster in HA mode with systemd enabled mesos-master and mesos-slave service.
> - run docker registry with mesos constraint (hostname:LIKE:mesos-slave-1) in one node. Reboot the node and notice that task getting stuck in staging.
> Possible workaround: service mesos-slave restart fixes the issue.
> OS: centos 7.2
> mesos version: 0.28.1
> marathon: 1.1.1
> zookeeper: 3.4.8
> docker: 1.9.1 dockerAPIversion: 1.21



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)