You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@mesos.apache.org by "Andrei Budnik (JIRA)" <ji...@apache.org> on 2019/04/16 17:08:00 UTC
[jira] [Commented] (MESOS-8983)
SlaveRecoveryTest/0.PingTimeoutDuringRecovery is flaky
[ https://issues.apache.org/jira/browse/MESOS-8983?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16819281#comment-16819281 ]
Andrei Budnik commented on MESOS-8983:
--------------------------------------
ThisĀ testĀ fails pretty often on ARM.
> SlaveRecoveryTest/0.PingTimeoutDuringRecovery is flaky
> ------------------------------------------------------
>
> Key: MESOS-8983
> URL: https://issues.apache.org/jira/browse/MESOS-8983
> Project: Mesos
> Issue Type: Bug
> Affects Versions: 1.7.0, 1.8.0
> Reporter: Alexander Rojas
> Assignee: Joseph Wu
> Priority: Major
> Labels: flaky-test, foundations
>
> During an unrelated change in a PR, the apache build bot sent the following error:
> {noformat}
> @ 00007FF71117D888 std::invoke<<lambda_9f5bb6c728b761604e288ae85a7b250c>,process::Future<Option<mesos::MasterInfo> >,process::ProcessBase *>
> @ 00007FF71119257B lambda::internal::Partial<<lambda_9f5bb6c728b761604e288ae85a7b250c>,process::Future<Option<mesos::MasterInfo> >,std::_Ph<1> >::invoke_expand<<lambda_9f5bb6c728b761604e288ae85a7b250c>,std::tuple<process::Future<Option<mesos::MasterInfo> >,std::_Ph<1> >,st
> @ 00007FF7110C08BA )<process::ProcessBase *
> @ 00007FF7110F058C std::_Invoker_functor::_Call<lambda::internal::Partial<<lambda_9f5bb6c728b761604e288ae85a7b250c>,process::Future<Option<mesos::MasterInfo> >,std::_Ph<1> >,process::ProcessBase *>
> @ 00007FF711183EBC std::invoke<lambda::internal::Partial<<lambda_9f5bb6c728b761604e288ae85a7b250c>,process::Future<Option<mesos::MasterInfo> >,std::_Ph<1> >,process::ProcessBase *>
> @ 00007FF7110C9F21 )<lambda::internal::Partial<<lambda_9f5bb6c728b761604e288ae85a7b250c>,process::Future<Option<mesos::MasterInfo> >,std::_Ph<1> >,process::ProcessBase *
> @ 00007FF711236416 process::ProcessBase *)>::CallableFn<lambda::internal::Partial<<lambda_9f5bb6c728b761604e288ae85a7b250c>,process::Future<Option<mesos::MasterInfo> >,std::_Ph<1> > >::operator(
> @ 00007FF712C1A25D process::ProcessBase *)>::operator(
> @ 00007FF712ACB2F9 process::ProcessBase::consume
> @ 00007FF712C738CA process::DispatchEvent::consume
> @ 00007FF70ECE7B07 process::ProcessBase::serve
> @ 00007FF712AD93B0 process::ProcessManager::resume
> @ 00007FF712C07371 ??
> @ 00007FF712B2B130 std::_Invoker_functor::_Call<<lambda_124422ac022fa041208b80c1460630d7> >
> @ 00007FF712B8B8E0 std::invoke<<lambda_124422ac022fa041208b80c1460630d7> >
> @ 00007FF712B4076C std::_LaunchPad<std::unique_ptr<std::tuple<<lambda_124422ac022fa041208b80c1460630d7> >,std::default_delete<std::tuple<<lambda_124422ac022fa041208b80c1460630d7> > > > >::_Execute<0>
> @ 00007FF712C5A60A std::_LaunchPad<std::unique_ptr<std::tuple<<lambda_124422ac022fa041208b80c1460630d7> >,std::default_delete<std::tuple<<lambda_124422ac022fa041208b80c1460630d7> > > > >::_Run
> @ 00007FF712C45E78 std::_LaunchPad<std::unique_ptr<std::tuple<<lambda_124422ac022fa041208b80c1460630d7> >,std::default_delete<std::tuple<<lambda_124422ac022fa041208b80c1460630d7> > > > >::_Go
> @ 00007FF712C2C3CD std::_Pad::_Call_func
> @ 00007FFF9BE53428 _register_onexit_function
> @ 00007FFF9BE53071 _register_onexit_function
> @ 00007FFFB6391FE4 BaseThreadInitThunk
> @ 00007FFFB69FF061 RtlUserThreadStart
> ll containerizers
> I0606 10:25:26.680230 18356 slave.cpp:7158] Recovering executors
> I0606 10:25:26.680230 18356 slave.cpp:7182] Sending reconnect request to executor '3f11d255-bb7b-4e99-967b-055fef95b595' of framework 62cf792a-dc69-4e3c-b54f-d83f98fb9451-0000 at executor(1)@192.10.1.5:55652
> I0606 10:25:26.688225 22560 slave.cpp:4984] Received re-registration message from executor '3f11d255-bb7b-4e99-967b-055fef95b595' of framework 62cf792a-dc69-4e3c-b54f-d83f98fb9451-0000
> I0606 10:25:26.691216 22888 slave.cpp:5901] No pings from master received within 75secs
> F0606 10:25:26.692219 22888 slave.cpp:1249] Check failed: state == DISCONNECTED || state == RUNNING || state == TERMINATING RECOVERING
> {noformat}
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)