You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@mesos.apache.org by "Neil Conway (JIRA)" <ji...@apache.org> on 2016/02/22 23:49:18 UTC

[jira] [Commented] (MESOS-4739) libprocess CHECK failure in SlaveRecoveryTest/0.ReconnectHTTPExecutor

    [ https://issues.apache.org/jira/browse/MESOS-4739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15157869#comment-15157869 ] 

Neil Conway commented on MESOS-4739:
------------------------------------

cc [~anandmazumdar] [~mcypark]

> libprocess CHECK failure in SlaveRecoveryTest/0.ReconnectHTTPExecutor
> ---------------------------------------------------------------------
>
>                 Key: MESOS-4739
>                 URL: https://issues.apache.org/jira/browse/MESOS-4739
>             Project: Mesos
>          Issue Type: Bug
>          Components: HTTP API, libprocess
>            Reporter: Neil Conway
>              Labels: flaky-test, libprocess, mesosphere
>
> {noformat}
> [ RUN      ] SlaveRecoveryTest/0.ReconnectHTTPExecutor
> I0223 09:38:55.434953 11158 executor.cpp:172] Version: 0.28.0
> Received a SUBSCRIBED event
> Starting task 1
> Finishing task 1
> Received an ERROR event
> Received an ERROR event
> E0223 09:38:55.504820 11159 executor.cpp:553] End-Of-File received from agent. The agent closed the event stream
> Received an ERROR event
> Received an ERROR event
> Received an ERROR event
> F0223 09:39:00.535778 22159 process.cpp:1114] Check failed: items.size() > 0
> *** Check failure stack trace: ***
> Received an ERROR event
> Received an ERROR event
>     @     0x7f4affd0e754  google::LogMessage::Fail()
> Received an ERROR event
> Received an ERROR event
> Received an ERROR event
> Received an ERROR event
>     @     0x7f4affd0e6ad  google::LogMessage::SendToLog()
>     @     0x7f4affd0e0a3  google::LogMessage::Flush()
>     @     0x7f4affd10f14  google::LogMessageFatal::~LogMessageFatal()
>     @     0x7f4affc618d4  process::HttpProxy::waited()
>     @     0x7f4affc8f57f  _ZZN7process8dispatchINS_9HttpProxyERKNS_6FutureINS_4http8ResponseEEES5_EEvRKNS_3PIDIT_EEMS9_FvT0_ET1_ENKUlPNS_11ProcessBaseEE_clESI_
>     @     0x7f4affcac946  _ZNSt17_Function_handlerIFvPN7process11ProcessBaseEEZNS0_8dispatchINS0_9HttpProxyERKNS0_6FutureINS0_4http8ResponseEEES9_EEvRKNS0_3PIDIT_EEMSD_FvT0_ET1_EUlS2_E_E9_M_invokeERKSt9_Any_dataOS2_
>     @     0x7f4affc89961  std::function<>::operator()()
>     @     0x7f4affc6ef02  process::ProcessBase::visit()
>     @     0x7f4affc74e52  process::DispatchEvent::visit()
>     @           0xa3afe8  process::ProcessBase::serve()
>     @     0x7f4affc6b073  process::ProcessManager::resume()
>     @     0x7f4affc6813b  _ZZN7process14ProcessManager12init_threadsEvENKUlRKSt6atomicIbEE_clES4_
>     @     0x7f4affc745fa  _ZNSt5_BindIFZN7process14ProcessManager12init_threadsEvEUlRKSt6atomicIbEE_St17reference_wrapperIS4_EEE6__callIvJEJLm0EEEET_OSt5tupleIJDpT0_EESt12_Index_tupleIJXspT1_EEE
>     @     0x7f4affc745a8  _ZNSt5_BindIFZN7process14ProcessManager12init_threadsEvEUlRKSt6atomicIbEE_St17reference_wrapperIS4_EEEclIJEvEET0_DpOT_
>     @     0x7f4affc74556  _ZNSt12_Bind_simpleIFSt5_BindIFZN7process14ProcessManager12init_threadsEvEUlRKSt6atomicIbEE_St17reference_wrapperIS5_EEEvEE9_M_invokeIJEEEvSt12_Index_tupleIJXspT_EEE
>     @     0x7f4affc744bf  _ZNSt12_Bind_simpleIFSt5_BindIFZN7process14ProcessManager12init_threadsEvEUlRKSt6atomicIbEE_St17reference_wrapperIS5_EEEvEEclEv
>     @     0x7f4affc7445e  _ZNSt6thread5_ImplISt12_Bind_simpleIFSt5_BindIFZN7process14ProcessManager12init_threadsEvEUlRKSt6atomicIbEE_St17reference_wrapperIS7_EEEvEEE6_M_runEv
>     @     0x7f4afa6ddc40  execute_native_thread_routine
>     @     0x7f4afadba424  start_thread
>     @     0x7f4af9e50cbd  __clone
>     @              (nil)  (unknown)
> Aborted (core dumped)
> {noformat}
> This crash was observed in a recent ArchLinux VM (Virtualbox), running concurrently with {{stress --cpu 4}}. Repro'd with {{./src/mesos-tests --gtest_filter="SlaveRecovery*" --gtest_repeat=100 --gtest_break_on_failure}}; took about 20 iterations to trigger a crash.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)