You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@mesos.apache.org by "Chun-Hung Hsiao (JIRA)" <ji...@apache.org> on 2018/02/26 21:46:00 UTC
[jira] [Created] (MESOS-8613) Test
`MasterAllocatorTest/*.TaskFinished` is flaky
Chun-Hung Hsiao created MESOS-8613:
--------------------------------------
Summary: Test `MasterAllocatorTest/*.TaskFinished` is flaky
Key: MESOS-8613
URL: https://issues.apache.org/jira/browse/MESOS-8613
Project: Mesos
Issue Type: Bug
Affects Versions: 1.5.0
Reporter: Chun-Hung Hsiao
Assignee: Till Toenshoff
Occasionally the test would crash with the following logs:
{noformat}
I0225 06:34:21.732908 1835 slave.cpp:3467] Launching container 46824279-01b3-4dcb-9be0-696cdacefe2f for executor 'default' of framework f1caa559-f62f-40f6-9786-100401bc9062-0000
I0225 06:34:21.733026 1835 slave.cpp:5705] Forwarding the update TASK_LOST (Status UUID: 29c6ae17-0e46-4dea-be36-e9e64e1d95c5) for task 1 of framework f1caa559-f62f-40f6-9786-100401bc9062-0000 to master@172.16.10.21:58470
I0225 06:34:21.733083 1835 slave.cpp:5598] Task status update manager successfully handled status update TASK_LOST (Status UUID: 29c6ae17-0e46-4dea-be36-e9e64e1d95c5) for task 1 of framework f1caa559-f62f-40f6-9786-100401bc9062-0000
I0225 06:34:21.733202 1835 master.cpp:7894] Status update TASK_LOST (Status UUID: 29c6ae17-0e46-4dea-be36-e9e64e1d95c5) for task 1 of framework f1caa559-f62f-40f6-9786-100401bc9062-0000 from agent f1caa559-f62f-40f6-9786-100401bc9062-S0 at sla
ve(128)@172.16.10.21:58470 (ip-172-16-10-21.ec2.internal)
I0225 06:34:21.733232 1835 master.cpp:7950] Forwarding status update TASK_LOST (Status UUID: 29c6ae17-0e46-4dea-be36-e9e64e1d95c5) for task 1 of framework f1caa559-f62f-40f6-9786-100401bc9062-0000
I0225 06:34:21.733273 1835 master.cpp:10258] Updating the state of task 1 of framework f1caa559-f62f-40f6-9786-100401bc9062-0000 (latest state: TASK_LOST, status update state: TASK_LOST)
*** Aborted at 1519540461 (unix time) try "date -d @1519540461" if you are using GNU date ***
I0225 06:34:21.733470 1838 sched.cpp:1027] Scheduler::statusUpdate took 8126ns
I0225 06:34:21.733700 1838 hierarchical.cpp:1192] Recovered cpus(allocated: *):1; mem(allocated: *):256 (total: cpus:3; mem:1024; disk:35056; ports:[31000-32000], allocated: cpus(allocated: *):1; mem(allocated: *):256) on agent f1caa559-f62f-4
0f6-9786-100401bc9062-S0 from framework f1caa559-f62f-40f6-9786-100401bc9062-0000
WPC: @ 0x20 (unknown)
0225 06:34:21.734422 1836 process.cpp:2805] Attempted to spawn already running process version@172.16.10.21:58470
I0225 06:34:21.734592 1836 exec.cpp:162] Version: 1.6.0
I0225 06:34:21.734761 1837 exec.cpp:212] Executor started at: executor(32)@172.16.10.21:58470 with pid 1814
I0225 06:34:21.734835 1837 slave.cpp:4747] Got registration for executor 'default' of framework f1caa559-f62f-40f6-9786-100401bc9062-0000 from executor(32)@172.16.10.21:58470
I0225 06:34:21.734966 1840 exec.cpp:236] Executor registered on agent f1caa559-f62f-40f6-9786-100401bc9062-S0
I0225 06:34:21.734990 1840 exec.cpp:248] Executor::registered took 9639ns
I0225 06:34:21.740001 1836 slave.cpp:3199] Sending queued task '0' to executor 'default' of framework f1caa559-f62f-40f6-9786-100401bc9062-0000 at executor(32)@172.16.10.21:58470
I0225 06:34:21.740236 1840 exec.cpp:330] Executor asked to run task '0'
I0225 06:34:21.740284 1840 exec.cpp:339] Executor::launchTask took 28783ns
I0225 06:34:21.740320 1840 exec.cpp:581] Executor sending status update TASK_RUNNING (Status UUID: 7cf0069b-a6dc-4d7e-9e16-7ade7a451334) for task 0 of framework f1caa559-f62f-40f6-9786-100401bc9062-0000
I0225 06:34:21.740419 1840 slave.cpp:5213] Handling status update TASK_RUNNING (Status UUID: 7cf0069b-a6dc-4d7e-9e16-7ade7a451334) for task 0 of framework f1caa559-f62f-40f6-9786-100401bc9062-0000 from executor(32)@172.16.10.21:58470
I0225 06:34:21.740563 1840 task_status_update_manager.cpp:328] Received task status update TASK_RUNNING (Status UUID: 7cf0069b-a6dc-4d7e-9e16-7ade7a451334) for task 0 of framework f1caa559-f62f-40f6-9786-100401bc9062-0000
I0225 06:34:21.740583 1840 task_status_update_manager.cpp:507] Creating StatusUpdate stream for task 0 of framework f1caa559-f62f-40f6-9786-100401bc9062-0000
I0225 06:34:21.740676 1840 task_status_update_manager.cpp:383] Forwarding task status update TASK_RUNNING (Status UUID: 7cf0069b-a6dc-4d7e-9e16-7ade7a451334) for task 0 of framework f1caa559-f62f-40f6-9786-100401bc9062-0000 to the agent
I0225 06:34:21.740733 1840 slave.cpp:5705] Forwarding the update TASK_RUNNING (Status UUID: 7cf0069b-a6dc-4d7e-9e16-7ade7a451334) for task 0 of framework f1caa559-f62f-40f6-9786-100401bc9062-0000 to master@172.16.10.21:58470
I0225 06:34:21.740799 1840 slave.cpp:5598] Task status update manager successfully handled status update TASK_RUNNING (Status UUID: 7cf0069b-a6dc-4d7e-9e16-7ade7a451334) for task 0 of framework f1caa559-f62f-40f6-9786-100401bc9062-0000
I0225 06:34:21.740828 1840 slave.cpp:5614] Sending acknowledgement for status update TASK_RUNNING (Status UUID: 7cf0069b-a6dc-4d7e-9e16-7ade7a451334) for task 0 of framework f1caa559-f62f-40f6-9786-100401bc9062-0000 to executor(32)@172.16.10.21:58470
I0225 06:34:21.740886 1840 exec.cpp:398] Executor received status update acknowledgement 7cf0069b-a6dc-4d7e-9e16-7ade7a451334 for task 0 of framework f1caa559-f62f-40f6-9786-100401bc9062-0000
{noformat}
This has been observed multiple times and every time the test crashed right after
{noformat}
I0225 06:34:21.733273 1835 master.cpp:10258] Updating the state of task 1 of framework f1caa559-f62f-40f6-9786-100401bc9062-0000 (latest state: TASK_LOST, status update state: TASK_LOST)
{noformat}
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)