You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mesos.apache.org by "Yan Xu (JIRA)" <ji...@apache.org> on 2014/01/13 19:58:54 UTC

[jira] [Created] (MESOS-901) ZooKeeperMasterContenderDetectorTest.MasterDetectorExpireSlaveZKSessionNewMaster is flaky

Yan Xu created MESOS-901:
----------------------------

             Summary: ZooKeeperMasterContenderDetectorTest.MasterDetectorExpireSlaveZKSessionNewMaster is flaky
                 Key: MESOS-901
                 URL: https://issues.apache.org/jira/browse/MESOS-901
             Project: Mesos
          Issue Type: Bug
          Components: test
    Affects Versions: 0.16.0
         Environment: ubuntu-13.04-gcc
            Reporter: Yan Xu


I0110 19:40:52.043822 10697 zookeeper_test_server.cpp:141] Started ZooKeeperTestServer on port 59493
I0110 19:40:52.045255 10718 contender.cpp:124] Joining the ZK group with data: '@128.150.152.0:10000'
2014-01-10 19:40:52,046:10697(0x2ab24b3d3700):ZOO_INFO@log_env@658: Client environment:zookeeper.version=zookeeper C client 3.3.4
2014-01-10 19:40:52,046:10697(0x2ab24b3d3700):ZOO_INFO@log_env@662: Client environment:host.name=raring
2014-01-10 19:40:52,046:10697(0x2ab24b3d3700):ZOO_INFO@log_env@669: Client environment:os.name=Linux
2014-01-10 19:40:52,046:10697(0x2ab24b3d3700):ZOO_INFO@log_env@670: Client environment:os.arch=3.8.0-35-generic
2014-01-10 19:40:52,046:10697(0x2ab24b3d3700):ZOO_INFO@log_env@671: Client environment:os.version=#50-Ubuntu SMP Tue Dec 3 01:24:59 UTC 2013
2014-01-10 19:40:52,046:10697(0x2ab24b3d3700):ZOO_INFO@log_env@679: Client environment:user.name=(null)
2014-01-10 19:40:52,046:10697(0x2ab24b3d3700):ZOO_INFO@log_env@687: Client environment:user.home=/home/jenkins
2014-01-10 19:40:52,046:10697(0x2ab24b3d3700):ZOO_INFO@log_env@699: Client environment:user.dir=<http://a.b.c.d:8080/job/mesos-ubuntu-13.04-gcc/ws/src>
2014-01-10 19:40:52,046:10697(0x2ab24b3d3700):ZOO_INFO@zookeeper_init@727: Initiating client connection, host=127.0.0.1:59493 sessionTimeout=10000 watcher=0x2ab2438ee760 sessionId=0 sessionPasswd=<null> context=0x2ab264009bb0 flags=0
2014-01-10 19:40:52,049:10697(0x2ab274c9e700):ZOO_INFO@check_events@1585: initiated connection to server [127.0.0.1:59493]
2014-01-10 19:40:52,056:10697(0x2ab274c9e700):ZOO_INFO@check_events@1632: session establishment complete on server [127.0.0.1:59493], sessionId=0x1437f6351c80000, negotiated timeout=10000
I0110 19:40:52.058683 10723 group.cpp:307] Group process ((972)@127.0.1.1:46375) connected to ZooKeeper
I0110 19:40:52.058718 10723 group.cpp:724] Syncing group operations: queue size (joins, cancels, datas) = (1, 0, 0)
I0110 19:40:52.058729 10723 group.cpp:364] Trying to create path '/mesos' in ZooKeeper
I0110 19:40:52.075853 10717 contender.cpp:221] New candidate (id='0', data='@128.150.152.0:10000') has entered the contest for leadership
I0110 19:40:52.077625 10723 detector.cpp:120] Detected a new leader: '(id='0')
I0110 19:40:52.077756 10723 group.cpp:611] Trying to get '/mesos/0000000000' in ZooKeeper
I0110 19:40:52.080947 10723 detector.cpp:310] A new leading master (UPID=@128.150.152.0:10000) is detected
I0110 19:40:52.081331 10718 contender.cpp:124] Joining the ZK group with data: '@129.150.152.0:10001'
2014-01-10 19:40:52,081:10697(0x2ab24add0700):ZOO_INFO@log_env@658: Client environment:zookeeper.version=zookeeper C client 3.3.4
2014-01-10 19:40:52,081:10697(0x2ab24add0700):ZOO_INFO@log_env@662: Client environment:host.name=raring
2014-01-10 19:40:52,081:10697(0x2ab24add0700):ZOO_INFO@log_env@669: Client environment:os.name=Linux
2014-01-10 19:40:52,081:10697(0x2ab24add0700):ZOO_INFO@log_env@670: Client environment:os.arch=3.8.0-35-generic
2014-01-10 19:40:52,081:10697(0x2ab24add0700):ZOO_INFO@log_env@671: Client environment:os.version=#50-Ubuntu SMP Tue Dec 3 01:24:59 UTC 2013
2014-01-10 19:40:52,081:10697(0x2ab24add0700):ZOO_INFO@log_env@679: Client environment:user.name=(null)
2014-01-10 19:40:52,081:10697(0x2ab24add0700):ZOO_INFO@log_env@687: Client environment:user.home=/home/jenkins
2014-01-10 19:40:52,081:10697(0x2ab24add0700):ZOO_INFO@log_env@699: Client environment:user.dir=<http://a.b.c.d:8080/job/mesos-ubuntu-13.04-gcc/ws/src>
2014-01-10 19:40:52,081:10697(0x2ab24add0700):ZOO_INFO@zookeeper_init@727: Initiating client connection, host=127.0.0.1:59493 sessionTimeout=10000 watcher=0x2ab2438ee760 sessionId=0 sessionPasswd=<null> context=0x2ab26c005ff0 flags=0
2014-01-10 19:40:52,087:10697(0x2ab273e97700):ZOO_INFO@check_events@1585: initiated connection to server [127.0.0.1:59493]
2014-01-10 19:40:52,095:10697(0x2ab273e97700):ZOO_INFO@check_events@1632: session establishment complete on server [127.0.0.1:59493], sessionId=0x1437f6351c80001, negotiated timeout=10000
I0110 19:40:52.096133 10721 group.cpp:307] Group process ((978)@127.0.1.1:46375) connected to ZooKeeper
I0110 19:40:52.096168 10721 group.cpp:724] Syncing group operations: queue size (joins, cancels, datas) = (1, 0, 0)
I0110 19:40:52.096194 10721 group.cpp:364] Trying to create path '/mesos' in ZooKeeper
I0110 19:40:52.106248 10719 contender.cpp:221] New candidate (id='1', data='@129.150.152.0:10001') has entered the contest for leadership
I0110 19:40:52.108146 10721 detector.cpp:120] Detected a new leader: '(id='0')
I0110 19:40:52.108271 10721 group.cpp:611] Trying to get '/mesos/0000000000' in ZooKeeper
I0110 19:40:52.109643 10721 detector.cpp:310] A new leading master (UPID=@128.150.152.0:10000) is detected
2014-01-10 19:40:52,109:10697(0x2ab24b7d5700):ZOO_INFO@log_env@658: Client environment:zookeeper.version=zookeeper C client 3.3.4
2014-01-10 19:40:52,110:10697(0x2ab24b7d5700):ZOO_INFO@log_env@662: Client environment:host.name=raring
2014-01-10 19:40:52,110:10697(0x2ab24b7d5700):ZOO_INFO@log_env@669: Client environment:os.name=Linux
2014-01-10 19:40:52,110:10697(0x2ab24b7d5700):ZOO_INFO@log_env@670: Client environment:os.arch=3.8.0-35-generic
2014-01-10 19:40:52,110:10697(0x2ab24b7d5700):ZOO_INFO@log_env@671: Client environment:os.version=#50-Ubuntu SMP Tue Dec 3 01:24:59 UTC 2013
2014-01-10 19:40:52,110:10697(0x2ab24b7d5700):ZOO_INFO@log_env@679: Client environment:user.name=(null)
2014-01-10 19:40:52,110:10697(0x2ab24b7d5700):ZOO_INFO@log_env@687: Client environment:user.home=/home/jenkins
2014-01-10 19:40:52,110:10697(0x2ab24b7d5700):ZOO_INFO@log_env@699: Client environment:user.dir=<http://a.b.c.d:8080/job/mesos-ubuntu-13.04-gcc/ws/src>
2014-01-10 19:40:52,110:10697(0x2ab24b7d5700):ZOO_INFO@zookeeper_init@727: Initiating client connection, host=127.0.0.1:59493 sessionTimeout=10000 watcher=0x2ab2438ee760 sessionId=0 sessionPasswd=<null> context=0x2ab24c015b20 flags=0
2014-01-10 19:40:52,112:10697(0x2ab274a9d700):ZOO_INFO@check_events@1585: initiated connection to server [127.0.0.1:59493]
2014-01-10 19:40:52,123:10697(0x2ab274a9d700):ZOO_INFO@check_events@1632: session establishment complete on server [127.0.0.1:59493], sessionId=0x1437f6351c80002, negotiated timeout=10000
I0110 19:40:52.125962 10720 group.cpp:307] Group process ((984)@127.0.1.1:46375) connected to ZooKeeper
I0110 19:40:52.125998 10720 group.cpp:724] Syncing group operations: queue size (joins, cancels, datas) = (0, 0, 0)
I0110 19:40:52.126008 10720 group.cpp:364] Trying to create path '/mesos' in ZooKeeper
I0110 19:40:52.141446 10720 detector.cpp:120] Detected a new leader: '(id='0')
I0110 19:40:52.141608 10720 group.cpp:611] Trying to get '/mesos/0000000000' in ZooKeeper
I0110 19:40:52.143333 10720 detector.cpp:310] A new leading master (UPID=@128.150.152.0:10000) is detected
2014-01-10 19:40:52,145:10697(0x2ab274a9d700):ZOO_ERROR@handle_socket_error_msg@1603: Socket [127.0.0.1:59493] zk retcode=-4, errno=112(Host is down): failed while receiving a server response
I0110 19:40:52.146263 10718 group.cpp:397] Lost connection to ZooKeeper, attempting to reconnect ...
2014-01-10 19:40:52,148:10697(0x2ab274c9e700):ZOO_ERROR@handle_socket_error_msg@1603: Socket [127.0.0.1:59493] zk retcode=-4, errno=112(Host is down): failed while receiving a server response
I0110 19:40:52.148707 10718 group.cpp:397] Lost connection to ZooKeeper, attempting to reconnect ...
I0110 19:40:52.150110 10716 detector.cpp:108] The current leader (id=0) is lost
I0110 19:40:52.150136 10716 detector.cpp:120] Detected a new leader: '(id='1')
I0110 19:40:52.150254 10716 group.cpp:611] Trying to get '/mesos/0000000001' in ZooKeeper
I0110 19:40:52.151603 10716 detector.cpp:310] A new leading master (UPID=@129.150.152.0:10001) is detected
2014-01-10 19:40:53,883:10697(0x2ab274098700):ZOO_ERROR@handle_socket_error_msg@1579: Socket [127.0.0.1:53165] zk retcode=-4, errno=111(Connection refused): server refused to accept the client
2014-01-10 19:40:55,482:10697(0x2ab274a9d700):ZOO_INFO@check_events@1585: initiated connection to server [127.0.0.1:59493]
2014-01-10 19:40:55,482:10697(0x2ab274c9e700):ZOO_INFO@check_events@1585: initiated connection to server [127.0.0.1:59493]
2014-01-10 19:40:55,489:10697(0x2ab274a9d700):ZOO_ERROR@handle_socket_error_msg@1621: Socket [127.0.0.1:59493] zk retcode=-112, errno=116(Stale NFS file handle): sessionId=0x1437f6351c80002 has expired.
2014-01-10 19:40:55,490:10697(0x2ab24abcf700):ZOO_INFO@zookeeper_close@2321: Freeing zookeeper resources for sessionId=0x1437f6351c80002

2014-01-10 19:40:55,490:10697(0x2ab24abcf700):ZOO_INFO@log_env@658: Client environment:zookeeper.version=zookeeper C client 3.3.4
2014-01-10 19:40:55,490:10697(0x2ab24abcf700):ZOO_INFO@log_env@662: Client environment:host.name=raring
2014-01-10 19:40:55,490:10697(0x2ab24abcf700):ZOO_INFO@log_env@669: Client environment:os.name=Linux
2014-01-10 19:40:55,490:10697(0x2ab24abcf700):ZOO_INFO@log_env@670: Client environment:os.arch=3.8.0-35-generic
2014-01-10 19:40:55,490:10697(0x2ab24abcf700):ZOO_INFO@log_env@671: Client environment:os.version=#50-Ubuntu SMP Tue Dec 3 01:24:59 UTC 2013
2014-01-10 19:40:55,490:10697(0x2ab24abcf700):ZOO_INFO@log_env@679: Client environment:user.name=(null)
2014-01-10 19:40:55,490:10697(0x2ab24abcf700):ZOO_INFO@log_env@687: Client environment:user.home=/home/jenkins
2014-01-10 19:40:55,490:10697(0x2ab24abcf700):ZOO_INFO@log_env@699: Client environment:user.dir=<http://a.b.c.d:8080/job/mesos-ubuntu-13.04-gcc/ws/src>
2014-01-10 19:40:55,490:10697(0x2ab24abcf700):ZOO_INFO@zookeeper_init@727: Initiating client connection, host=127.0.0.1:59493 sessionTimeout=10000 watcher=0x2ab2438ee760 sessionId=0 sessionPasswd=<null> context=0x2ab26c008b30 flags=0
2014-01-10 19:40:55,492:10697(0x2ab274c9e700):ZOO_ERROR@handle_socket_error_msg@1621: Socket [127.0.0.1:59493] zk retcode=-112, errno=116(Stale NFS file handle): sessionId=0x1437f6351c80000 has expired.
I0110 19:40:55.492712 10722 contender.cpp:185] Membership cancelled: 0
2014-01-10 19:40:55,492:10697(0x2ab24b1d2700):ZOO_INFO@zookeeper_close@2321: Freeing zookeeper resources for sessionId=0x1437f6351c80000

2014-01-10 19:40:55,493:10697(0x2ab24b1d2700):ZOO_INFO@log_env@658: Client environment:zookeeper.version=zookeeper C client 3.3.4
2014-01-10 19:40:55,493:10697(0x2ab24b1d2700):ZOO_INFO@log_env@662: Client environment:host.name=raring
2014-01-10 19:40:55,493:10697(0x2ab24b1d2700):ZOO_INFO@log_env@669: Client environment:os.name=Linux
2014-01-10 19:40:55,493:10697(0x2ab24b1d2700):ZOO_INFO@log_env@670: Client environment:os.arch=3.8.0-35-generic
2014-01-10 19:40:55,493:10697(0x2ab24b1d2700):ZOO_INFO@log_env@671: Client environment:os.version=#50-Ubuntu SMP Tue Dec 3 01:24:59 UTC 2013
2014-01-10 19:40:55,493:10697(0x2ab24b1d2700):ZOO_INFO@log_env@679: Client environment:user.name=(null)
2014-01-10 19:40:55,493:10697(0x2ab24b1d2700):ZOO_INFO@log_env@687: Client environment:user.home=/home/jenkins
2014-01-10 19:40:55,493:10697(0x2ab24b1d2700):ZOO_INFO@log_env@699: Client environment:user.dir=<http://a.b.c.d:8080/job/mesos-ubuntu-13.04-gcc/ws/src>
2014-01-10 19:40:55,493:10697(0x2ab24b1d2700):ZOO_INFO@zookeeper_init@727: Initiating client connection, host=127.0.0.1:59493 sessionTimeout=10000 watcher=0x2ab2438ee760 sessionId=0 sessionPasswd=<null> context=0x2ab25c00f1f0 flags=0
2014-01-10 19:40:55,494:10697(0x2ab274299700):ZOO_INFO@check_events@1585: initiated connection to server [127.0.0.1:59493]
2014-01-10 19:40:55,502:10697(0x2ab274299700):ZOO_INFO@check_events@1632: session establishment complete on server [127.0.0.1:59493], sessionId=0x1437f6351c80003, negotiated timeout=10000
I0110 19:40:55.505771 10723 group.cpp:307] Group process ((972)@127.0.1.1:46375) connected to ZooKeeper
I0110 19:40:55.505795 10723 group.cpp:724] Syncing group operations: queue size (joins, cancels, datas) = (0, 0, 0)
I0110 19:40:55.505805 10723 group.cpp:364] Trying to create path '/mesos' in ZooKeeper
2014-01-10 19:40:55,504:10697(0x2ab2754a2700):ZOO_INFO@check_events@1585: initiated connection to server [127.0.0.1:59493]
2014-01-10 19:40:55,509:10697(0x2ab2754a2700):ZOO_INFO@check_events@1632: session establishment complete on server [127.0.0.1:59493], sessionId=0x1437f6351c80004, negotiated timeout=10000
I0110 19:40:55.510289 10720 group.cpp:307] Group process ((984)@127.0.1.1:46375) connected to ZooKeeper
I0110 19:40:55.510308 10720 group.cpp:724] Syncing group operations: queue size (joins, cancels, datas) = (0, 0, 0)
I0110 19:40:55.510315 10720 group.cpp:364] Trying to create path '/mesos' in ZooKeeper
I0110 19:40:55.521286 10720 detector.cpp:108] The current leader (id=0) is lost
I0110 19:40:55.521314 10720 detector.cpp:120] Detected a new leader: '(id='1')
I0110 19:40:55.521425 10720 group.cpp:611] Trying to get '/mesos/0000000001' in ZooKeeper
I0110 19:40:55.522613 10723 detector.cpp:108] The current leader (id=0) is lost
I0110 19:40:55.522634 10723 detector.cpp:120] Detected a new leader: '(id='1')
I0110 19:40:55.522704 10723 group.cpp:611] Trying to get '/mesos/0000000001' in ZooKeeper
I0110 19:40:55.526757 10720 detector.cpp:310] A new leading master (UPID=@129.150.152.0:10001) is detected
2014-01-10 19:40:55,527:10697(0x2ab243034b40):ZOO_INFO@zookeeper_close@2304: Closing zookeeper sessionId=0x1437f6351c80004 to [127.0.0.1:59493]

I0110 19:40:55.527711 10697 contender.cpp:175] Now cancelling the membership: 1
2014-01-10 19:40:55,527:10697(0x2ab243034b40):ZOO_INFO@zookeeper_close@2304: Closing zookeeper sessionId=0x1437f6351c80001 to [127.0.0.1:59493]

I0110 19:40:55.528144 10697 contender.cpp:175] Now cancelling the membership: 0
2014-01-10 19:40:55,534:10697(0x2ab243034b40):ZOO_INFO@zookeeper_close@2304: Closing zookeeper sessionId=0x1437f6351c80003 to [127.0.0.1:59493]

pure virtual method called
terminate called without an active exception



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)