You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mesos.apache.org by Davies Liu <da...@gmail.com> on 2011/12/22 06:20:23 UTC

Mesos master crash

Hi, Devs:

Our Mesos master (latest version in github/master) crash with:

F1221 11:30:07.528009 10986 master.cpp:690] Check failed:
offer->framework_id() == frameworkId
*** Check failure stack trace: ***
    @           0x5aa61d  google::LogMessage::Fail()
    @           0x5ace2f  google::LogMessage::SendToLog()
    @           0x5aa206  google::LogMessage::Flush()
    @           0x5ad6ad  google::LogMessageFatal::~LogMessageFatal()
    @           0x44eed9  mesos::internal::master::Master::launchTasks()
    @           0x46e42a  ProtobufProcess<>::handler4<>()
    @           0x450ed5  std::tr1::_Function_handler<>::_M_invoke()
    @           0x440158  mesos::internal::master::Master::operator()()
    @           0x5bfc39  process::ProcessManager::run()
    @           0x5bfd8a  process::trampoline()
    @     0x7f25a2f08160  (unknown)

and

F1221 11:30:18.039479 10256 master.cpp:1641] Check failed:
!framework->hasExecutor(slave->id, task.executor_id())
*** Check failure stack trace: ***
    @           0x5aa61d  google::LogMessage::Fail()
    @           0x5ace2f  google::LogMessage::SendToLog()
    @           0x5aa206  google::LogMessage::Flush()
    @           0x5ad6ad  google::LogMessageFatal::~LogMessageFatal()
    @           0x446b31  mesos::internal::master::Master::readdSlave()
    @           0x45af80  std::tr1::_Function_handler<>::_M_invoke()
    @           0x5b9cd6  process::ProcessBase::serve()
    @           0x440178  mesos::internal::master::Master::operator()()
    @           0x5bfc39  process::ProcessManager::run()
    @           0x5bfd8a  process::trampoline()
    @     0x7f1a0de85160  (unknown)

I1221 19:50:07.753770 10361 master.cpp:1184] Sending 1 offers to
framework 201112211130-0000000387-0130
I1221 19:50:07.765286 10361 master.cpp:679] Received reply for offer
201112211130-0000000387-101663
F1221 19:50:07.776593 10361 master.cpp:690] Check failed:
offer->framework_id() == frameworkId
*** Check failure stack trace: ***
    @           0x5aa61d  google::LogMessage::Fail()
    @           0x5ace2f  google::LogMessage::SendToLog()
    @           0x5aa206  google::LogMessage::Flush()
    @           0x5ad6ad  google::LogMessageFatal::~LogMessageFatal()
    @           0x44eed9  mesos::internal::master::Master::launchTasks()
    @           0x46e42a  ProtobufProcess<>::handler4<>()
    @           0x450ed5  std::tr1::_Function_handler<>::_M_invoke()
    @           0x440158  mesos::internal::master::Master::operator()()
    @           0x5bfc39  process::ProcessManager::run()
    @           0x5bfd8a  process::trampoline()
    @     0x7f40e01d5160  (unknown)

-- 
 - Davies

Re: Mesos master crash

Posted by Matei Zaharia <ma...@eecs.berkeley.edu>.
Hi Davies,

Do you have any info on what was happening around the time of the crash?

Matei

On Dec 22, 2011, at 12:20 AM, Davies Liu wrote:

> Hi, Devs:
> 
> Our Mesos master (latest version in github/master) crash with:
> 
> F1221 11:30:07.528009 10986 master.cpp:690] Check failed:
> offer->framework_id() == frameworkId
> *** Check failure stack trace: ***
>    @           0x5aa61d  google::LogMessage::Fail()
>    @           0x5ace2f  google::LogMessage::SendToLog()
>    @           0x5aa206  google::LogMessage::Flush()
>    @           0x5ad6ad  google::LogMessageFatal::~LogMessageFatal()
>    @           0x44eed9  mesos::internal::master::Master::launchTasks()
>    @           0x46e42a  ProtobufProcess<>::handler4<>()
>    @           0x450ed5  std::tr1::_Function_handler<>::_M_invoke()
>    @           0x440158  mesos::internal::master::Master::operator()()
>    @           0x5bfc39  process::ProcessManager::run()
>    @           0x5bfd8a  process::trampoline()
>    @     0x7f25a2f08160  (unknown)
> 
> and
> 
> F1221 11:30:18.039479 10256 master.cpp:1641] Check failed:
> !framework->hasExecutor(slave->id, task.executor_id())
> *** Check failure stack trace: ***
>    @           0x5aa61d  google::LogMessage::Fail()
>    @           0x5ace2f  google::LogMessage::SendToLog()
>    @           0x5aa206  google::LogMessage::Flush()
>    @           0x5ad6ad  google::LogMessageFatal::~LogMessageFatal()
>    @           0x446b31  mesos::internal::master::Master::readdSlave()
>    @           0x45af80  std::tr1::_Function_handler<>::_M_invoke()
>    @           0x5b9cd6  process::ProcessBase::serve()
>    @           0x440178  mesos::internal::master::Master::operator()()
>    @           0x5bfc39  process::ProcessManager::run()
>    @           0x5bfd8a  process::trampoline()
>    @     0x7f1a0de85160  (unknown)
> 
> I1221 19:50:07.753770 10361 master.cpp:1184] Sending 1 offers to
> framework 201112211130-0000000387-0130
> I1221 19:50:07.765286 10361 master.cpp:679] Received reply for offer
> 201112211130-0000000387-101663
> F1221 19:50:07.776593 10361 master.cpp:690] Check failed:
> offer->framework_id() == frameworkId
> *** Check failure stack trace: ***
>    @           0x5aa61d  google::LogMessage::Fail()
>    @           0x5ace2f  google::LogMessage::SendToLog()
>    @           0x5aa206  google::LogMessage::Flush()
>    @           0x5ad6ad  google::LogMessageFatal::~LogMessageFatal()
>    @           0x44eed9  mesos::internal::master::Master::launchTasks()
>    @           0x46e42a  ProtobufProcess<>::handler4<>()
>    @           0x450ed5  std::tr1::_Function_handler<>::_M_invoke()
>    @           0x440158  mesos::internal::master::Master::operator()()
>    @           0x5bfc39  process::ProcessManager::run()
>    @           0x5bfd8a  process::trampoline()
>    @     0x7f40e01d5160  (unknown)
> 
> -- 
>  - Davies