You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@mesos.apache.org by "Gilbert Song (JIRA)" <ji...@apache.org> on 2019/01/02 22:18:00 UTC

[jira] [Created] (MESOS-9507) Agent could not recover due to empty docker volume checkpointed files.

Gilbert Song created MESOS-9507:
-----------------------------------

             Summary: Agent could not recover due to empty docker volume checkpointed files.
                 Key: MESOS-9507
                 URL: https://issues.apache.org/jira/browse/MESOS-9507
             Project: Mesos
          Issue Type: Bug
          Components: containerization
            Reporter: Gilbert Song


Agent could not recover due to empty docker volume checkpointed files. Please see logs:

{noformat}
Nov 12 17:12:00 guppy mesos-agent[38960]: E1112 17:12:00.978682 38969 slave.cpp:6279] EXIT with status 1: Failed to perform recovery: Collect failed: Collect failed: Failed to recover docker volumes for orphan container e1b04051-1e4a-47a9-b866-1d625cda1d22: JSON parse failed: syntax error at line 1 near:
Nov 12 17:12:00 guppy mesos-agent[38960]: To remedy this do as follows: 
Nov 12 17:12:00 guppy mesos-agent[38960]: Step 1: rm -f /var/lib/mesos/slave/meta/slaves/latest
Nov 12 17:12:00 guppy mesos-agent[38960]: This ensures agent doesn't recover old live executors.
Nov 12 17:12:00 guppy mesos-agent[38960]: Step 2: Restart the agent. 
Nov 12 17:12:00 guppy systemd[1]: dcos-mesos-slave.service: main process exited, code=exited, status=1/FAILURE
Nov 12 17:12:00 guppy systemd[1]: Unit dcos-mesos-slave.service entered failed state.
Nov 12 17:12:00 guppy systemd[1]: dcos-mesos-slave.service failed.
{noformat}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)