You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@aurora.apache.org by "John Sirois (JIRA)" <ji...@apache.org> on 2016/02/12 18:22:18 UTC
[jira] [Created] (AURORA-1617) Install instructions should point
out the critical step of matching mesos slave -work_dir to the observer
--mesos-root
John Sirois created AURORA-1617:
-----------------------------------
Summary: Install instructions should point out the critical step of matching mesos slave -work_dir to the observer --mesos-root
Key: AURORA-1617
URL: https://issues.apache.org/jira/browse/AURORA-1617
Project: Aurora
Issue Type: Task
Components: Documentation, Observer
Reporter: John Sirois
Priority: Minor
As reported by Thorhs here: http://wilderness.apache.org/channels/?f=aurora/2016-02-12#1455288155
{noformat}
Fri Feb 12 14:44:09 2016 Thorhs: Hi All, I'm trying out the nightly build of aurora, and things are going fine until I try to view the task state in the observer. Clicking in the Host link in the Active Task page redirects me to port 1338/task/taskid, but it gives me a 404 error.
Fri Feb 12 14:44:30 2016 Thorhs: I think it may be mismatch between the checkpoint path between the executor and the observer.
Fri Feb 12 14:44:39 2016 Thorhs: Is this something that you have seen before?
Fri Feb 12 14:46:31 2016 Thorhs: I'm running on Centos7 with RPM aurora-scheduler-0.13.0snapshot.2016.02.10-1.el7.centos.aurora.x86_64
Fri Feb 12 14:53:22 2016 Thorhs: Looking at the thermos, it appears it is looking for a directory with checkpoints, defaulting to /var/run/thermos if I read it correctly. on the ps output for an executor, I see the checkpoint path is set to /tmp/mesos/slaves/886fc9bc-179b-43c4-a7c6-e706ab7ae96b-S0/frameworks/20160210-072614-3231125002-5050-1392-0000/executors/thermos-1455287988358-nobody-devel-hello_world-0-9fe076da-1037-447c-95e8-ff8ca7751834/runs/a3efc6bd-d108-49
Fri Feb 12 14:58:35 2016 igmor: Joined the channel
Fri Feb 12 15:08:04 2016 Thorhs: Never mind, work_dir was not set in /etc/mesos-slave. Once set to /var/lib/mesos and restarted, everything started working. I must have missed a step in the instructions.
{noformat}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)