You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@mesos.apache.org by "Alexander Rukletsov (JIRA)" <ji...@apache.org> on 2017/10/27 14:42:00 UTC

[jira] [Updated] (MESOS-1262) MultipleExecutorsTest.TasksExecutorInfoDiffers is flaky

     [ https://issues.apache.org/jira/browse/MESOS-1262?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Alexander Rukletsov updated MESOS-1262:
---------------------------------------
    Labels: flaky flaky-test  (was: flaky)

> MultipleExecutorsTest.TasksExecutorInfoDiffers is flaky
> -------------------------------------------------------
>
>                 Key: MESOS-1262
>                 URL: https://issues.apache.org/jira/browse/MESOS-1262
>             Project: Mesos
>          Issue Type: Bug
>          Components: test
>            Reporter: Vinod Kone
>              Labels: flaky, flaky-test
>
> [ RUN      ] MultipleExecutorsTest.TasksExecutorInfoDiffers
> I0428 19:05:48.540652 24547 leveldb.cpp:174] Opened db in 20.021997ms
> I0428 19:05:48.541288 24547 leveldb.cpp:181] Compacted db in 562623ns
> I0428 19:05:48.541317 24547 leveldb.cpp:196] Created db iterator in 8388ns
> I0428 19:05:48.541326 24547 leveldb.cpp:202] Seeked to beginning of db in 1139ns
> I0428 19:05:48.541334 24547 leveldb.cpp:271] Iterated through 0 keys in the db in 484ns
> I0428 19:05:48.541360 24547 replica.cpp:729] Replica recovered with log positions 0 -> 0 with 1 holes and 0 unlearned
> I0428 19:05:48.541730 24584 recover.cpp:425] Starting replica recovery
> I0428 19:05:48.541844 24584 recover.cpp:451] Replica is in EMPTY status
> I0428 19:05:48.542312 24584 replica.cpp:626] Replica in EMPTY status received a broadcasted recover request
> I0428 19:05:48.542394 24584 recover.cpp:188] Received a recover response from a replica in EMPTY status
> I0428 19:05:48.542531 24584 recover.cpp:542] Updating replica status to STARTING
> I0428 19:05:48.543365 24584 leveldb.cpp:304] Persisting metadata (8 bytes) to leveldb took 757883ns
> I0428 19:05:48.543386 24584 replica.cpp:320] Persisted replica status to STARTING
> I0428 19:05:48.543460 24584 recover.cpp:451] Replica is in STARTING status
> I0428 19:05:48.543798 24584 replica.cpp:626] Replica in STARTING status received a broadcasted recover request
> I0428 19:05:48.543859 24584 recover.cpp:188] Received a recover response from a replica in STARTING status
> I0428 19:05:48.543967 24584 recover.cpp:542] Updating replica status to VOTING
> I0428 19:05:48.544258 24584 leveldb.cpp:304] Persisting metadata (8 bytes) to leveldb took 239912ns
> I0428 19:05:48.544271 24584 replica.cpp:320] Persisted replica status to VOTING
> I0428 19:05:48.544307 24584 recover.cpp:556] Successfully joined the Paxos group
> I0428 19:05:48.544369 24584 recover.cpp:440] Recover process terminated
> I0428 19:05:48.545640 24584 master.cpp:266] Master 20140428-190548-143311683-40673-24547 (minerva.apache.org) started on 67.195.138.8:40673
> I0428 19:05:48.545666 24584 master.cpp:303] Master only allowing authenticated frameworks to register
> I0428 19:05:48.545673 24584 master.cpp:308] Master only allowing authenticated slaves to register
> I0428 19:05:48.545680 24584 credentials.hpp:35] Loading credentials for authentication
> W0428 19:05:48.545733 24584 credentials.hpp:48] Failed to stat credentials file 'file:///tmp/MultipleExecutorsTest_TasksExecutorInfoDiffers_zyeSd0/credentials': No such file or directory
> I0428 19:05:48.546339 24584 hierarchical_allocator_process.hpp:302] Initializing hierarchical allocator process with master : master@67.195.138.8:40673
> I0428 19:05:48.546376 24584 master.cpp:104] No whitelist given. Advertising offers for all slaves
> I0428 19:05:48.546589 24584 master.cpp:922] The newly elected leader is master@67.195.138.8:40673 with id 20140428-190548-143311683-40673-24547
> I0428 19:05:48.546600 24584 master.cpp:932] Elected as the leading master!
> I0428 19:05:48.546608 24584 master.cpp:753] Recovering from registrar
> I0428 19:05:48.546699 24584 registrar.cpp:275] Recovering registrar
> I0428 19:05:48.547029 24584 log.cpp:656] Attempting to start the writer
> I0428 19:05:48.547441 24584 replica.cpp:474] Replica received implicit promise request with proposal 1
> I0428 19:05:48.547695 24584 leveldb.cpp:304] Persisting metadata (8 bytes) to leveldb took 239042ns
> I0428 19:05:48.547708 24584 replica.cpp:342] Persisted promised to 1
> I0428 19:05:48.547904 24584 coordinator.cpp:229] Coordinator attemping to fill missing position
> I0428 19:05:48.548322 24584 replica.cpp:375] Replica received explicit promise request for position 0 with proposal 2
> I0428 19:05:48.548477 24584 leveldb.cpp:341] Persisting action (8 bytes) to leveldb took 138289ns
> I0428 19:05:48.548493 24584 replica.cpp:664] Persisted action at 0
> I0428 19:05:48.559414 24586 replica.cpp:508] Replica received write request for position 0
> I0428 19:05:48.559535 24586 leveldb.cpp:436] Reading position from leveldb took 46649ns
> I0428 19:05:48.559965 24586 leveldb.cpp:341] Persisting action (14 bytes) to leveldb took 413113ns
> I0428 19:05:48.559980 24586 replica.cpp:664] Persisted action at 0
> I0428 19:05:48.560171 24586 replica.cpp:643] Replica received learned notice for position 0
> I0428 19:05:48.560372 24586 leveldb.cpp:341] Persisting action (16 bytes) to leveldb took 186478ns
> I0428 19:05:48.560385 24586 replica.cpp:664] Persisted action at 0
> I0428 19:05:48.560395 24586 replica.cpp:649] Replica learned NOP action at position 0
> I0428 19:05:48.560659 24586 log.cpp:672] Writer started with ending position 0
> I0428 19:05:48.561046 24586 leveldb.cpp:436] Reading position from leveldb took 11812ns
> I0428 19:05:48.562821 24586 registrar.cpp:308] Successfully recovered registrar
> I0428 19:05:48.562868 24586 registrar.cpp:379] Attempting to update the 'registry'
> I0428 19:05:48.564551 24586 log.cpp:680] Attempting to append 137 bytes to the log
> I0428 19:05:48.564638 24586 coordinator.cpp:339] Coordinator attempting to write APPEND action at position 1
> I0428 19:05:48.565008 24586 replica.cpp:508] Replica received write request for position 1
> I0428 19:05:48.571110 24586 leveldb.cpp:341] Persisting action (156 bytes) to leveldb took 6.053569ms
> I0428 19:05:48.571272 24586 replica.cpp:664] Persisted action at 1
> I0428 19:05:48.579241 24585 replica.cpp:643] Replica received learned notice for position 1
> I0428 19:05:48.579629 24585 leveldb.cpp:341] Persisting action (158 bytes) to leveldb took 332299ns
> I0428 19:05:48.579644 24585 replica.cpp:664] Persisted action at 1
> I0428 19:05:48.579656 24585 replica.cpp:649] Replica learned APPEND action at position 1
> I0428 19:05:48.580095 24585 registrar.cpp:427] Successfully updated 'registry'
> I0428 19:05:48.580178 24585 log.cpp:699] Attempting to truncate the log to 1
> I0428 19:05:48.580258 24585 master.cpp:780] Recovered 0 slaves from the Registry (99B) ; allowing 10mins for slaves to re-register
> I0428 19:05:48.580320 24585 coordinator.cpp:339] Coordinator attempting to write TRUNCATE action at position 2
> I0428 19:05:48.580610 24585 replica.cpp:508] Replica received write request for position 2
> I0428 19:05:48.580724 24585 leveldb.cpp:341] Persisting action (16 bytes) to leveldb took 99788ns
> I0428 19:05:48.580735 24585 replica.cpp:664] Persisted action at 2
> I0428 19:05:48.580899 24585 replica.cpp:643] Replica received learned notice for position 2
> I0428 19:05:48.580988 24585 leveldb.cpp:341] Persisting action (18 bytes) to leveldb took 78815ns
> I0428 19:05:48.581009 24585 leveldb.cpp:399] Deleting ~1 keys from leveldb took 10110ns
> I0428 19:05:48.581018 24585 replica.cpp:664] Persisted action at 2
> I0428 19:05:48.581027 24585 replica.cpp:649] Replica learned TRUNCATE action at position 2
> I0428 19:05:49.549613 24585 hierarchical_allocator_process.hpp:726] No resources available to allocate!
> I0428 19:05:49.549664 24585 hierarchical_allocator_process.hpp:688] Performed allocation for 0 slaves in 94678ns
> I0428 19:05:50.550092 24585 hierarchical_allocator_process.hpp:726] No resources available to allocate!
> I0428 19:05:50.550153 24585 hierarchical_allocator_process.hpp:688] Performed allocation for 0 slaves in 78268ns
> 2014-04-28 19:05:51,026:24547(0x2b174c200700):ZOO_ERROR@handle_socket_error_msg@1697: Socket [127.0.0.1:49011] zk retcode=-4, errno=111(Connection refused): server refused to accept the client
> I0428 19:05:51.550614 24585 hierarchical_allocator_process.hpp:726] No resources available to allocate!
> I0428 19:05:51.550695 24585 hierarchical_allocator_process.hpp:688] Performed allocation for 0 slaves in 125181ns
> I0428 19:05:52.551409 24585 hierarchical_allocator_process.hpp:726] No resources available to allocate!
> I0428 19:05:52.551448 24585 hierarchical_allocator_process.hpp:688] Performed allocation for 0 slaves in 58922ns
> I0428 19:05:53.547530 24584 master.cpp:104] No whitelist given. Advertising offers for all slaves
> I0428 19:05:53.552539 24589 hierarchical_allocator_process.hpp:726] No resources available to allocate!
> I0428 19:05:53.552562 24589 hierarchical_allocator_process.hpp:688] Performed allocation for 0 slaves in 77109ns
> 2014-04-28 19:05:54,362:24547(0x2b174c200700):ZOO_ERROR@handle_socket_error_msg@1697: Socket [127.0.0.1:49011] zk retcode=-4, errno=111(Connection refused): server refused to accept the client
> I0428 19:05:54.553315 24590 hierarchical_allocator_process.hpp:726] No resources available to allocate!
> I0428 19:05:54.553380 24590 hierarchical_allocator_process.hpp:688] Performed allocation for 0 slaves in 80265ns
> I0428 19:05:55.553812 24587 hierarchical_allocator_process.hpp:726] No resources available to allocate!
> I0428 19:05:55.553856 24587 hierarchical_allocator_process.hpp:688] Performed allocation for 0 slaves in 90273ns
> I0428 19:05:56.558269 24589 hierarchical_allocator_process.hpp:726] No resources available to allocate!
> I0428 19:05:56.558323 24589 hierarchical_allocator_process.hpp:688] Performed allocation for 0 slaves in 71014ns
> I0428 19:05:57.559363 24589 hierarchical_allocator_process.hpp:726] No resources available to allocate!
> I0428 19:05:57.559417 24589 hierarchical_allocator_process.hpp:688] Performed allocation for 0 slaves in 74060ns
> 2014-04-28 19:05:57,698:24547(0x2b174c200700):ZOO_ERROR@handle_socket_error_msg@1697: Socket [127.0.0.1:49011] zk retcode=-4, errno=111(Connection refused): server refused to accept the client
> I0428 19:05:58.550842 24589 master.cpp:104] No whitelist given. Advertising offers for all slaves
> I0428 19:05:58.562800 24589 hierarchical_allocator_process.hpp:726] No resources available to allocate!
> I0428 19:05:58.562861 24589 hierarchical_allocator_process.hpp:688] Performed allocation for 0 slaves in 79561ns
> F0428 19:05:58.582340 24547 cluster.hpp:373] Failed to wait for _recover
> *** Check failure stack trace: ***
>     @     0x2b16240b41ed  google::LogMessage::Fail()
>     @     0x2b16240b627f  google::LogMessage::SendToLog()
>     @     0x2b16240b3ddc  google::LogMessage::Flush()
>     @     0x2b16240b6aed  google::LogMessageFatal::~LogMessageFatal()
>     @           0x72b4f9  mesos::internal::tests::Cluster::Masters::start()
>     @           0x726233  mesos::internal::tests::MesosTest::StartMaster()
>     @           0x76803d  MultipleExecutorsTest_TasksExecutorInfoDiffers_Test::TestBody()
>     @           0x8a571d  testing::internal::HandleExceptionsInMethodIfSupported<>()
>     @           0x89dd51  testing::Test::Run()
>     @           0x89de36  testing::TestInfo::Run()
>     @           0x89df77  testing::TestCase::Run()
>     @           0x89e2de  testing::internal::UnitTestImpl::RunAllTests()
>     @           0x8a529d  testing::internal::HandleExceptionsInMethodIfSupported<>()
>     @           0x89d3ae  testing::UnitTest::Run()
>     @           0x4a2b90  main
>     @     0x2b162550176d  (unknown)
>     @           0x4adf81  (unknown)
> make[4]: *** [check-local] Aborted



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)