You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@mesos.apache.org by "Alexander Rukletsov (JIRA)" <ji...@apache.org> on 2016/10/07 09:36:20 UTC

[jira] [Commented] (MESOS-6321) CHECK failure in HierarchicalAllocatorTest.NoDoubleAccounting

    [ https://issues.apache.org/jira/browse/MESOS-6321?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15554654#comment-15554654 ] 

Alexander Rukletsov commented on MESOS-6321:
--------------------------------------------

Good run should look like this:
{noformat}
[ RUN      ] HierarchicalAllocatorTest.NoDoubleAccounting
I1007 11:29:37.357229 3211264 hierarchical.cpp:149] Initialized hierarchical allocator process
I1007 11:29:37.357724 1601536 hierarchical.cpp:275] Added framework framework1
I1007 11:29:37.357810 1601536 hierarchical.cpp:1694] No allocations performed
I1007 11:29:37.357842 1601536 hierarchical.cpp:1789] No inverse offers to send out!
I1007 11:29:37.357875 1601536 hierarchical.cpp:1286] Performed allocation for 0 agents in 127us
I1007 11:29:37.358070 1601536 hierarchical.cpp:485] Added agent agent1 (agent1) with cpus(*):1 (allocated: cpus(*):1)
I1007 11:29:37.358151 1601536 hierarchical.cpp:1694] No allocations performed
I1007 11:29:37.358165 1601536 hierarchical.cpp:1789] No inverse offers to send out!
I1007 11:29:37.358182 1601536 hierarchical.cpp:1309] Performed allocation for agent agent1 in 87us
I1007 11:29:37.358243 1601536 hierarchical.cpp:485] Added agent agent2 (agent2) with cpus(*):1 (allocated: cpus(*):1)
I1007 11:29:37.358337 1601536 hierarchical.cpp:1694] No allocations performed
I1007 11:29:37.358361 1601536 hierarchical.cpp:1789] No inverse offers to send out!
I1007 11:29:37.358373 1601536 hierarchical.cpp:1309] Performed allocation for agent agent2 in 102us
I1007 11:29:37.358554 1601536 hierarchical.cpp:275] Added framework framework2
I1007 11:29:37.358619 1601536 hierarchical.cpp:1694] No allocations performed
I1007 11:29:37.358649 1601536 hierarchical.cpp:1789] No inverse offers to send out!
I1007 11:29:37.358662 1601536 hierarchical.cpp:1286] Performed allocation for 2 agents in 95us
I1007 11:29:37.358786 1064960 process.cpp:3377] Handling HTTP event for process 'metrics' with path: '/metrics/snapshot'
[       OK ] HierarchicalAllocatorTest.NoDoubleAccounting (18 ms)
{noformat}

The test failed because allocation events are processed after the metrics event, meaning metrics do not contain information we are looking for. The fix would be to make sure allocation events are processed *before* querying metrics.

> CHECK failure in HierarchicalAllocatorTest.NoDoubleAccounting
> -------------------------------------------------------------
>
>                 Key: MESOS-6321
>                 URL: https://issues.apache.org/jira/browse/MESOS-6321
>             Project: Mesos
>          Issue Type: Bug
>            Reporter: Neil Conway
>            Assignee: Alexander Rukletsov
>              Labels: mesosphere
>
> Observed in internal CI:
> {noformat}
> [15:52:21] :     [Step 10/10] [ RUN      ] HierarchicalAllocatorTest.NoDoubleAccounting
> [15:52:21]W:     [Step 10/10] I1006 15:52:21.813817 23713 hierarchical.cpp:275] Added framework framework1
> [15:52:21]W:     [Step 10/10] I1006 15:52:21.814100 23713 hierarchical.cpp:1694] No allocations performed
> [15:52:21]W:     [Step 10/10] I1006 15:52:21.814102 23712 process.cpp:3377] Handling HTTP event for process 'metrics' with path: '/metrics/snapshot'
> [15:52:21]W:     [Step 10/10] I1006 15:52:21.814121 23713 hierarchical.cpp:1789] No inverse offers to send out!
> [15:52:21]W:     [Step 10/10] I1006 15:52:21.814146 23713 hierarchical.cpp:1286] Performed allocation for 0 agents in 52445ns
> [15:52:21]W:     [Step 10/10] I1006 15:52:21.814206 23713 hierarchical.cpp:485] Added agent agent1 (agent1) with cpus(*):1 (allocated: cpus(*):1)
> [15:52:21]W:     [Step 10/10] I1006 15:52:21.814237 23713 hierarchical.cpp:1694] No allocations performed
> [15:52:21]W:     [Step 10/10] I1006 15:52:21.814247 23713 hierarchical.cpp:1789] No inverse offers to send out!
> [15:52:21]W:     [Step 10/10] I1006 15:52:21.814259 23713 hierarchical.cpp:1309] Performed allocation for agent agent1 in 33887ns
> [15:52:21]W:     [Step 10/10] I1006 15:52:21.814294 23713 hierarchical.cpp:485] Added agent agent2 (agent2) with cpus(*):1 (allocated: cpus(*):1)
> [15:52:21]W:     [Step 10/10] I1006 15:52:21.814332 23713 hierarchical.cpp:1694] No allocations performed
> [15:52:21]W:     [Step 10/10] I1006 15:52:21.814342 23713 hierarchical.cpp:1789] No inverse offers to send out!
> [15:52:21]W:     [Step 10/10] I1006 15:52:21.814349 23713 hierarchical.cpp:1309] Performed allocation for agent agent2 in 42682ns
> [15:52:21]W:     [Step 10/10] I1006 15:52:21.814417 23713 hierarchical.cpp:275] Added framework framework2
> [15:52:21]W:     [Step 10/10] I1006 15:52:21.814445 23713 hierarchical.cpp:1694] No allocations performed
> [15:52:21]W:     [Step 10/10] I1006 15:52:21.814455 23713 hierarchical.cpp:1789] No inverse offers to send out!
> [15:52:21]W:     [Step 10/10] I1006 15:52:21.814469 23713 hierarchical.cpp:1286] Performed allocation for 2 agents in 37976ns
> [15:52:21]W:     [Step 10/10] F1006 15:52:21.824954 23692 json.hpp:334] Check failed: 'boost::get<T>(this)' Must be non NULL
> [15:52:21]W:     [Step 10/10] *** Check failure stack trace: ***
> [15:52:21]W:     [Step 10/10]     @     0x7fe953bbd71d  google::LogMessage::Fail()
> [15:52:21]W:     [Step 10/10]     @     0x7fe953bbf55d  google::LogMessage::SendToLog()
> [15:52:21]W:     [Step 10/10]     @     0x7fe953bbd30c  google::LogMessage::Flush()
> [15:52:21]W:     [Step 10/10]     @     0x7fe953bbfe59  google::LogMessageFatal::~LogMessageFatal()
> [15:52:21]W:     [Step 10/10]     @           0x7cc903  JSON::Value::as<>()
> [15:52:21]W:     [Step 10/10]     @           0x8b633c  mesos::internal::tests::HierarchicalAllocatorTest_NoDoubleAccounting_Test::TestBody()
> [15:52:21]W:     [Step 10/10]     @          0x129ce23  testing::internal::HandleExceptionsInMethodIfSupported<>()
> [15:52:21]W:     [Step 10/10]     @          0x1292f07  testing::Test::Run()
> [15:52:21]W:     [Step 10/10]     @          0x1292fae  testing::TestInfo::Run()
> [15:52:21]W:     [Step 10/10]     @          0x12930b5  testing::TestCase::Run()
> [15:52:21]W:     [Step 10/10]     @          0x1293368  testing::internal::UnitTestImpl::RunAllTests()
> [15:52:21]W:     [Step 10/10]     @          0x1293624  testing::UnitTest::Run()
> [15:52:21]W:     [Step 10/10]     @           0x507254  main
> [15:52:21]W:     [Step 10/10]     @     0x7fe95122876d  (unknown)
> [15:52:21]W:     [Step 10/10]     @           0x51e341  (unknown)
> [15:52:21]W:     [Step 10/10] Aborted (core dumped)
> [15:52:21]W:     [Step 10/10] Process exited with code 134
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)