You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mesos.apache.org by "Andy Konwinski (Commented) (JIRA)" <ji...@apache.org> on 2011/11/23 03:24:40 UTC

[jira] [Commented] (MESOS-35) Slave webui needs to correctly display stdout/stderr of executors.

    [ https://issues.apache.org/jira/browse/MESOS-35?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13155628#comment-13155628 ] 

Andy Konwinski commented on MESOS-35:
-------------------------------------

I believe this issue no longer manifests (as of SVN r1205259) if you don't specify a value for work_dir when running the mesos slave (i.e. if you let the default values be used for them, which is either mesos_home or ".").

Michael and I use deploylib (deploylib.com), in which we run mesos on EC2 and we set it up to use a config file in which we specify that work_dir = /mnt and this causes the links (in the slave webui) for a framework's executors' stderr and stdout to break. The erro message says that the /mnt/slave/... directory can't be found. When I manually look, the file that is looking for is in /mnt/work/slave/... (notice the extra "/work").

It appears that with these config settings, slave.cpp puts files in a different directory (relative to work_dir from conf) than webui/slave/webui.py looks for them in.

In src/slave/slave.cpp, when we're building up the directory where the executors' stderr and stdout will eventually go, if work_dir is present as a conf value, we should not append "/work" to it, but we currently do. See snippet below, starting at line 1381:

<code>
1381   string workDir = ".";
1382   if (conf.contains("work_dir")) {
1383     workDir = conf.get("work_dir", workDir);
1384   } else if (conf.contains("home")) {
1385     workDir = conf.get("home", workDir);
1386   }
1387 
1388   workDir = workDir + "/work";
1389 
1390   std::ostringstream out(std::ios_base::app | std::ios_base::out);
1391   out << workDir << "/slaves/" << id
1392       << "/frameworks/" << frameworkId
1393       << "/executors/" << executorId;

</code>

... having that extra "/work" in the path is breaking things, since when work_dir is set, src/slave/webui.cpp doesn't append "/work" to it (see starting at line 80):

<code>
 80   webuiPort = conf.get("webui_port", "8081");
 81   logDir = conf.get("log_dir", FLAGS_log_dir);
 82   if (conf.contains("work_dir")) {
 83     workDir = conf.get("work_dir", "");
 84   } else if (conf.contains("home")) {
 85     workDir = conf.get("home", "") + "/work";
 86   } else {
 87     workDir = "work";
 88   }
</code>

I'll attach a small patch that doesn't append "/work" to workDir in slave.cpp if workDir is set from the "work_dir" conf flag.
                
> Slave webui needs to correctly display stdout/stderr of executors.
> ------------------------------------------------------------------
>
>                 Key: MESOS-35
>                 URL: https://issues.apache.org/jira/browse/MESOS-35
>             Project: Mesos
>          Issue Type: Bug
>          Components: slave, webui
>            Reporter: Benjamin Hindman
>
> There have been some changes to the work directories for frameworks due to multi-executor support and so the slave webui is no longer properly linking to executors stdout/stderr.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira