You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mesos.apache.org by Ben Mahler <be...@gmail.com> on 2013/09/04 04:44:01 UTC
Re: Review Request 13904: Fixed CgroupsIsolator to listen for OOMs of
recovered executors.
-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/13904/
-----------------------------------------------------------
(Updated Sept. 4, 2013, 2:44 a.m.)
Review request for mesos, Benjamin Hindman and Vinod Kone.
Changes
-------
Rebase.
Bugs: MESOS-671
https://issues.apache.org/jira/browse/MESOS-671
Repository: mesos-git
Description
-------
See MESOS-671.
This appears to be the culprit behind triggering MESOS-662 on a frequent basis for recovered slaves.
Diffs (updated)
-----
src/slave/cgroups_isolator.cpp 676768e6b8bd13820467309814845257a9c47e02
Diff: https://reviews.apache.org/r/13904/diff/
Testing
-------
make check
This requires an integration test to catch, it appears the balloon test could be enhanced to have the slave recover, but punting on adding complexity there until we figure out a good testing strategy to ensure recovered slaves operate the same as non-recovered slaves.
Thanks,
Ben Mahler