You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@mesos.apache.org by "Greg Mann (JIRA)" <ji...@apache.org> on 2016/03/01 07:40:18 UTC

[jira] [Commented] (MESOS-4053) MemoryPressureMesosTest tests fail on CentOS 6.6

    [ https://issues.apache.org/jira/browse/MESOS-4053?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15173311#comment-15173311 ] 

Greg Mann commented on MESOS-4053:
----------------------------------

I just ran into this while testing 0.27.2-rc1 on Ubuntu 14.04, using gcc, with libevent and SSL enabled. This was after running several rounds of tests on this machine with different builds of Mesos:

{code}
[ RUN      ] MemoryPressureMesosTest.CGROUPS_ROOT_Statistics
I0301 06:36:33.444818 45724 process.cpp:2492] Spawned process files@172.31.18.71:50382
I0301 06:36:33.444849 45739 process.cpp:2502] Resuming files@172.31.18.71:50382 at 2016-03-01 06:36:33.444837888+00:00
I0301 06:36:33.445101 45745 process.cpp:2502] Resuming help@172.31.18.71:50382 at 2016-03-01 06:36:33.445021952+00:00
I0301 06:36:33.458566 45724 process.cpp:2492] Spawned process __latch__(2)@172.31.18.71:50382
I0301 06:36:33.458591 45746 process.cpp:2502] Resuming __gc__@172.31.18.71:50382 at 2016-03-01 06:36:33.458576896+00:00
I0301 06:36:33.458600 45738 process.cpp:2502] Resuming __latch__(2)@172.31.18.71:50382 at 2016-03-01 06:36:33.458589952+00:00
I0301 06:36:33.458652 45738 process.cpp:2607] Cleaning up __latch__(2)@172.31.18.71:50382
../../src/tests/mesos.cpp:955: Failure
(cgroups::destroy(hierarchy, cgroup)).failure(): Failed to remove cgroup '/sys/fs/cgroup/perf_event/mesos_test': Device or resource busy
-----------------------------------------------------------
We're very sorry but we can't seem to destroy existing
cgroups that we likely created as part of an earlier
invocation of the tests. Please manually destroy the cgroup
at '/sys/fs/cgroup/perf_event/mesos_test' by first
manually killing all the processes found in the file at '/sys/fs/cgroup/perf_event/mesos_test/tasks'
-----------------------------------------------------------
I0301 06:36:33.458739 45744 process.cpp:2502] Resuming __gc__@172.31.18.71:50382 at 2016-03-01 06:36:33.458727936+00:00
I0301 06:36:33.458854 45749 process.cpp:2502] Resuming AuthenticationRouter(1)@172.31.18.71:50382 at 2016-03-01 06:36:33.458842112+00:00
I0301 06:36:33.461118 45724 process.cpp:2492] Spawned process __latch__(3)@172.31.18.71:50382
I0301 06:36:33.461139 45752 process.cpp:2502] Resuming __gc__@172.31.18.71:50382 at 2016-03-01 06:36:33.461127936+00:00
I0301 06:36:33.461161 45748 process.cpp:2502] Resuming __latch__(3)@172.31.18.71:50382 at 2016-03-01 06:36:33.461143040+00:00
../../src/tests/mesos.cpp:989: Failure
(cgroups::destroy(hierarchy, cgroup)).failure(): Failed to remove cgroup '/sys/fs/cgroup/perf_event/mesos_test': Device or resource busy
I0301 06:36:33.461216 45748 process.cpp:2607] Cleaning up __latch__(3)@172.31.18.71:50382
I0301 06:36:33.461310 45742 process.cpp:2502] Resuming files@172.31.18.71:50382 at 2016-03-01 06:36:33.461299968+00:00
I0301 06:36:33.461315 45740 process.cpp:2502] Resuming __gc__@172.31.18.71:50382 at 2016-03-01 06:36:33.461304064+00:00
I0301 06:36:33.461503 45742 process.cpp:2607] Cleaning up files@172.31.18.71:50382
[  FAILED  ] MemoryPressureMesosTest.CGROUPS_ROOT_Statistics (17 ms)
{code}

> MemoryPressureMesosTest tests fail on CentOS 6.6
> ------------------------------------------------
>
>                 Key: MESOS-4053
>                 URL: https://issues.apache.org/jira/browse/MESOS-4053
>             Project: Mesos
>          Issue Type: Bug
>         Environment: CentOS 6.6
>            Reporter: Greg Mann
>            Assignee: Benjamin Hindman
>              Labels: mesosphere, test-failure
>
> {{MemoryPressureMesosTest.CGROUPS_ROOT_Statistics}} and {{MemoryPressureMesosTest.CGROUPS_ROOT_SlaveRecovery}} fail on CentOS 6.6. It seems that mounted cgroups are not properly cleaned up after previous tests, so multiple hierarchies are detected and thus an error is produced:
> {code}
> [ RUN      ] MemoryPressureMesosTest.CGROUPS_ROOT_Statistics
> ../../src/tests/mesos.cpp:849: Failure
> Value of: _baseHierarchy.get()
>   Actual: "/cgroup"
> Expected: baseHierarchy
> Which is: "/tmp/mesos_test_cgroup"
> -------------------------------------------------------------
> Multiple cgroups base hierarchies detected:
>   '/tmp/mesos_test_cgroup'
>   '/cgroup'
> Mesos does not support multiple cgroups base hierarchies.
> Please unmount the corresponding (or all) subsystems.
> -------------------------------------------------------------
> ../../src/tests/mesos.cpp:932: Failure
> (cgroups::destroy(hierarchy, cgroup)).failure(): Failed to remove cgroup '/tmp/mesos_test_cgroup/perf_event/mesos_test': Device or resource busy
> [  FAILED  ] MemoryPressureMesosTest.CGROUPS_ROOT_Statistics (12 ms)
> [ RUN      ] MemoryPressureMesosTest.CGROUPS_ROOT_SlaveRecovery
> ../../src/tests/mesos.cpp:849: Failure
> Value of: _baseHierarchy.get()
>   Actual: "/cgroup"
> Expected: baseHierarchy
> Which is: "/tmp/mesos_test_cgroup"
> -------------------------------------------------------------
> Multiple cgroups base hierarchies detected:
>   '/tmp/mesos_test_cgroup'
>   '/cgroup'
> Mesos does not support multiple cgroups base hierarchies.
> Please unmount the corresponding (or all) subsystems.
> -------------------------------------------------------------
> ../../src/tests/mesos.cpp:932: Failure
> (cgroups::destroy(hierarchy, cgroup)).failure(): Failed to remove cgroup '/tmp/mesos_test_cgroup/perf_event/mesos_test': Device or resource busy
> [  FAILED  ] MemoryPressureMesosTest.CGROUPS_ROOT_SlaveRecovery (7 ms)
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)