You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mesos.apache.org by "Benjamin Hindman (JIRA)" <ji...@apache.org> on 2013/05/22 17:21:19 UTC

[jira] [Commented] (MESOS-479) SlaveRecoveryTest/0.CleanupExecutor failure.

    [ https://issues.apache.org/jira/browse/MESOS-479?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13664180#comment-13664180 ] 

Benjamin Hindman commented on MESOS-479:
----------------------------------------

The test also hung on failure here:

Thread 1 (Thread 0x7fc8364b5720 (LWP 36613)):
#0  0x00007fc8349daee9 in pthread_cond_wait@@GLIBC_2.3.2 () from /lib64/libpthread.so.0
#1  0x00007fc835223c9b in arrive (this=0x292e450, pid=<value optimized out>) at ../../../third_party/libprocess/src/gate.hpp:73
#2  process::ProcessManager::wait (this=0x292e450, pid=<value optimized out>) at ../../../third_party/libprocess/src/process.cpp:2669
#3  0x00007fc835233545 in process::wait (pid=..., duration=...) at ../../../third_party/libprocess/src/process.cpp:3109
#4  0x00007fc834ef091e in wait (this=0x7fff81049190, __in_chrg=<value optimized out>) at ../../third_party/libprocess/include/process/process.hpp:357
#5  mesos::MesosSchedulerDriver::~MesosSchedulerDriver (this=0x7fff81049190, __in_chrg=<value optimized out>) at ../../src/sched/sched.cpp:757
#6  0x00000000005d8f23 in SlaveRecoveryTest_CleanupExecutor_Test<mesos::internal::slave::ProcessIsolator>::TestBody (this=<value optimized out>)
    at ../../src/tests/slave_recovery_tests.cpp:771
#7  0x00000000008646eb in testing::internal::HandleExceptionsInMethodIfSupported<testing::Test, void> (object=0x7fc8240055b0, 
    method=&virtual testing::Test::TestBody(), location=0x92fd37 "the test body") at gmock-1.6.0/gtest/src/gtest.cc:2126
#8  0x0000000000858579 in testing::Test::Run (this=0x7fc8240055b0) at gmock-1.6.0/gtest/src/gtest.cc:2162
#9  0x00000000008586b5 in testing::TestInfo::Run (this=0x2926f30) at gmock-1.6.0/gtest/src/gtest.cc:2338
#10 0x000000000085876d in testing::TestCase::Run (this=0x2925be0) at gmock-1.6.0/gtest/src/gtest.cc:2445
#11 0x000000000085b94d in testing::internal::UnitTestImpl::RunAllTests (this=0x2907f30) at gmock-1.6.0/gtest/src/gtest.cc:4237
#12 0x000000000085c345 in testing::internal::HandleSehExceptionsInMethodIfSupported<testing::internal::UnitTestImpl, bool> (object=<value optimized out>, 
    method=(bool (testing::internal::UnitTestImpl::*)(testing::internal::UnitTestImpl *)) 0x85b6d0 <testing::internal::UnitTestImpl::RunAllTests()>, 
    location=0xffffffffffffffff <Address 0xffffffffffffffff out of bounds>) at gmock-1.6.0/gtest/src/gtest.cc:2090
#13 0x000000000086426e in testing::internal::HandleExceptionsInMethodIfSupported<testing::internal::UnitTestImpl, bool> (object=0x2907f30, method=
    (bool (testing::internal::UnitTestImpl::*)(testing::internal::UnitTestImpl *)) 0x85b6d0 <testing::internal::UnitTestImpl::RunAllTests()>, 
---Type <return> to continue, or q <return> to quit---
    location=0x9319c0 "auxiliary test code (environments or event listeners)") at gmock-1.6.0/gtest/src/gtest.cc:2126
#14 0x0000000000857afa in testing::UnitTest::Run (this=<value optimized out>) at gmock-1.6.0/gtest/src/gtest.cc:3874
#15 0x000000000048c3a2 in main (argc=1, argv=0x7fc82c002020) at ../../src/tests/main.cpp:109
                
> SlaveRecoveryTest/0.CleanupExecutor failure.
> --------------------------------------------
>
>                 Key: MESOS-479
>                 URL: https://issues.apache.org/jira/browse/MESOS-479
>             Project: Mesos
>          Issue Type: Bug
>            Reporter: Benjamin Hindman
>
> [ RUN      ] SlaveRecoveryTest/0.CleanupExecutor
> Checkpointing SlaveInfo to '/tmp/SlaveRecoveryTest_0_CleanupExecutor_WDbjOB/meta/slaves/201305220702-1828659978-49643-36613-0/slave.info'
> Checkpointing FrameworkInfo to '/tmp/SlaveRecoveryTest_0_CleanupExecutor_WDbjOB/meta/slaves/201305220702-1828659978-49643-36613-0/frameworks/201305220702-1828659978-49643-36613-0000/framework.info'
> Checkpointing 'scheduler(84)@10.35.255.108:49643' to '/tmp/SlaveRecoveryTest_0_CleanupExecutor_WDbjOB/meta/slaves/201305220702-1828659978-49643-36613-0/frameworks/201305220702-1828659978-49643-36613-0000/framework.pid'
> Checkpointing ExecutorInfo to '/tmp/SlaveRecoveryTest_0_CleanupExecutor_WDbjOB/meta/slaves/201305220702-1828659978-49643-36613-0/frameworks/201305220702-1828659978-49643-36613-0000/executors/07360cb3-7b42-44b5-9942-b27802a18224/executor.info'
> Checkpointing Task to '/tmp/SlaveRecoveryTest_0_CleanupExecutor_WDbjOB/meta/slaves/201305220702-1828659978-49643-36613-0/frameworks/201305220702-1828659978-49643-36613-0000/executors/07360cb3-7b42-44b5-9942-b27802a18224/runs/60657969-3cdc-46e3-ba9e-51c8db502ef9/tasks/07360cb3-7b42-44b5-9942-b27802a18224/task.info'
> Checkpointing forked pid 38518
> Checkpointing '38518' to '/tmp/SlaveRecoveryTest_0_CleanupExecutor_WDbjOB/meta/slaves/201305220702-1828659978-49643-36613-0/frameworks/201305220702-1828659978-49643-36613-0000/executors/07360cb3-7b42-44b5-9942-b27802a18224/runs/60657969-3cdc-46e3-ba9e-51c8db502ef9/pids/forked.pid'
> Fetching resources into '/tmp/SlaveRecoveryTest_0_CleanupExecutor_WDbjOB/slaves/201305220702-1828659978-49643-36613-0/frameworks/201305220702-1828659978-49643-36613-0000/executors/07360cb3-7b42-44b5-9942-b27802a18224/runs/60657969-3cdc-46e3-ba9e-51c8db502ef9'
> Checkpointing 'executor(1)@10.35.255.108:48801' to '/tmp/SlaveRecoveryTest_0_CleanupExecutor_WDbjOB/meta/slaves/201305220702-1828659978-49643-36613-0/frameworks/201305220702-1828659978-49643-36613-0000/executors/07360cb3-7b42-44b5-9942-b27802a18224/runs/60657969-3cdc-46e3-ba9e-51c8db502ef9/pids/libprocess.pid'
> Registered executor on smfd-atr-11-sr1.devel.twitter.com
> Starting task 07360cb3-7b42-44b5-9942-b27802a18224
> Forked command at 38572
> sh -c 'sleep 1000'
> Checkpointing 'scheduler(84)@10.35.255.108:49643' to '/tmp/SlaveRecoveryTest_0_CleanupExecutor_WDbjOB/meta/slaves/201305220702-1828659978-49643-36613-0/frameworks/201305220702-1828659978-49643-36613-0000/framework.pid'
> Checkpointing 'scheduler(84)@10.35.255.108:49643' to '/tmp/SlaveRecoveryTest_0_CleanupExecutor_WDbjOB/meta/slaves/201305220702-1828659978-49643-36613-0/frameworks/201305220702-1828659978-49643-36613-0000/framework.pid'
> libprocess: process-isolator(19)@10.35.255.108:49643 terminating due to basic_filebuf::underflow error reading the file
> Waited on process 38572, returned status 15
> WARNING: Logging before InitGoogleLogging() is written to STDERR
> I0522 07:02:54.414633 38554 process_utils.hpp:64] Stopping ... 38572
> Group members:
>   PID  PPID  PGID  SESS COMMAND
> 38572 38518 38572 38572 sleep 1000
> Session members:
>   PID  PPID  PGID  SESS COMMAND
> 38572 38518 38572 38572 sleep 1000
> Sent signal to 38572
> GMOCK WARNING:
> Uninteresting mock function call - returning directly.
>     Function call: slaveLost(0x7fff81049190, @0x7fc814001eb0 201305220702-1828659978-49643-36613-0)
> Stack trace:
> ../../src/tests/slave_recovery_tests.cpp:764: Failure
> Value of: status.get().state()
>   Actual: TASK_LOST
> Expected: TASK_FAILED

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira