You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mesos.apache.org by "Yan Xu (JIRA)" <ji...@apache.org> on 2014/06/23 20:56:25 UTC

[jira] [Resolved] (MESOS-928) AllocatorZooKeeperTest/0.SlaveReregistersFirst is flaky

     [ https://issues.apache.org/jira/browse/MESOS-928?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Yan Xu resolved MESOS-928.
--------------------------

    Resolution: Cannot Reproduce

Superseded by MESOS-1532 which seems to be the latest symptom for this test. 

> AllocatorZooKeeperTest/0.SlaveReregistersFirst is flaky
> -------------------------------------------------------
>
>                 Key: MESOS-928
>                 URL: https://issues.apache.org/jira/browse/MESOS-928
>             Project: Mesos
>          Issue Type: Bug
>          Components: test
>            Reporter: Yan Xu
>
> https://builds.apache.org/job/Mesos-Trunk-Ubuntu-Build-In-Src-Set-JAVA_HOME/1575/consoleFull
> I0117 22:45:27.010951  5602 sched.cpp:261] Authenticating with master master@67.195.138.8:58517
> GMOCK WARNING:
> Uninteresting mock function call - taking default action specified at:
> ./tests/mesos.hpp:407:
>     Function call: slaveAdded(@0x2b5164037140 201401172245-143311683-58517-5141-0, @0x2b51640370d0 hostname: "minerva.apache.org"
> webui_hostname: "minerva.apache.org"
> resources {
>   name: "cpus"
>   type: SCALAR
>   scalar {
>     value: 2
>   }
>   role: "*"
> }
> resources {
>   name: "mem"
>   type: SCALAR
>   scalar {
>     value: 1024
>   }
>   role: "*"
> }
> resources {
>   name: "disk"
>   type: SCALAR
>   scalar {
>     value: 23038
>   }
>   role: "*"
> }
> resources {
>   name: "ports"
>   type: RANGES
>   ranges {
>     range {
>       begin: 31000
>       end: 32000
>     }
>   }
>   role: "*"
> }
> id {
>   value: "201401172245-143311683-58517-5141-0"
> }
> checkpoint: false
> port: 58517
> , @0x2b51640370a0 { (201401172245-143311683-58517-5141-0000, { cpus(*):1, mem(*):500 }) })
> Stack trace:
> I0117 22:45:27.010987  5602 sched.cpp:230] Detecting new master
> I0117 22:45:27.011076  5601 hierarchical_allocator_process.hpp:445] Added slave 201401172245-143311683-58517-5141-0 (minerva.apache.org) with cpus(*):2; mem(*):1024; disk(*):23038; ports(*):[31000-32000] (and cpus(*):1; mem(*):524; disk(*):23038; ports(*):[31000-32000] available)
> I0117 22:45:27.011111  5601 hierarchical_allocator_process.hpp:708] Performed allocation for slave 201401172245-143311683-58517-5141-0 in 4802ns
> I0117 22:45:27.011157  5603 authenticatee.hpp:124] Creating new client SASL connection
> I0117 22:45:27.011283  5602 master.cpp:1836] Authenticating framework at scheduler(132)@67.195.138.8:58517
> I0117 22:45:27.011338  5603 authenticator.hpp:140] Creating new server SASL connection
> I0117 22:45:27.011437  5602 authenticatee.hpp:212] Received SASL authentication mechanisms: CRAM-MD5
> I0117 22:45:27.011454  5602 authenticatee.hpp:238] Attempting to authenticate with mechanism 'CRAM-MD5'
> I0117 22:45:27.011486  5602 authenticator.hpp:243] Received SASL authentication start
> I0117 22:45:27.011534  5602 authenticator.hpp:325] Authentication requires more steps
> I0117 22:45:27.011564  5602 authenticatee.hpp:258] Received SASL authentication step
> I0117 22:45:27.011606  5602 authenticator.hpp:271] Received SASL authentication step
> I0117 22:45:27.011620  5602 auxprop.cpp:81] Request to lookup properties for user: 'test-principal' realm: 'minerva.apache.org' server FQDN: 'minerva.apache.org' SASL_AUXPROP_VERIFY_AGAINST_HASH: false SASL_AUXPROP_OVERRIDE: false SASL_AUXPROP_AUTHZID: false 
> I0117 22:45:27.011631  5602 auxprop.cpp:153] Looking up auxiliary property '*userPassword'
> I0117 22:45:27.011643  5602 auxprop.cpp:153] Looking up auxiliary property '*cmusaslsecretCRAM-MD5'
> I0117 22:45:27.011653  5602 auxprop.cpp:81] Request to lookup properties for user: 'test-principal' realm: 'minerva.apache.org' server FQDN: 'minerva.apache.org' SASL_AUXPROP_VERIFY_AGAINST_HASH: false SASL_AUXPROP_OVERRIDE: false SASL_AUXPROP_AUTHZID: true 
> I0117 22:45:27.011659  5602 auxprop.cpp:103] Skipping auxiliary property '*userPassword' since SASL_AUXPROP_AUTHZID == true
> I0117 22:45:27.011664  5602 auxprop.cpp:103] Skipping auxiliary property '*cmusaslsecretCRAM-MD5' since SASL_AUXPROP_AUTHZID == true
> I0117 22:45:27.011675  5602 authenticator.hpp:317] Authentication success
> I0117 22:45:27.011710  5603 authenticatee.hpp:298] Authentication success
> I0117 22:45:27.011803  5603 sched.cpp:335] Successfully authenticated with master master@67.195.138.8:58517
> I0117 22:45:27.011803  5602 master.cpp:1876] Successfully authenticated framework at scheduler(132)@67.195.138.8:58517
> 2014-01-17 22:45:27,438:5141(0x2b5274803700):ZOO_ERROR@handle_socket_error_msg@1697: Socket [127.0.0.1:48985] zk retcode=-4, errno=111(Connection refused): server refused to accept the client
> I0117 22:45:27.980269  5603 monitor.cpp:193] Publishing resource usage for executor 'default' of framework '201401172245-143311683-58517-5141-0000'
> I0117 22:45:27.999601  5604 monitor.cpp:193] Publishing resource usage for executor 'default' of framework '201401172245-143311683-58517-5141-0000'
> I0117 22:45:28.006669  5599 hierarchical_allocator_process.hpp:688] Performed allocation for 1 slaves in 20519ns
> I0117 22:45:28.981165  5601 monitor.cpp:193] Publishing resource usage for executor 'default' of framework '201401172245-143311683-58517-5141-0000'
> I0117 22:45:29.000334  5599 monitor.cpp:193] Publishing resource usage for executor 'default' of framework '201401172245-143311683-58517-5141-0000'
> I0117 22:45:29.007385  5603 hierarchical_allocator_process.hpp:688] Performed allocation for 1 slaves in 22274ns
> I0117 22:45:29.981925  5601 monitor.cpp:193] Publishing resource usage for executor 'default' of framework '201401172245-143311683-58517-5141-0000'
> I0117 22:45:30.001018  5604 monitor.cpp:193] Publishing resource usage for executor 'default' of framework '201401172245-143311683-58517-5141-0000'
> I0117 22:45:30.008090  5603 hierarchical_allocator_process.hpp:688] Performed allocation for 1 slaves in 24251ns
> 2014-01-17 22:45:30,774:5141(0x2b5274803700):ZOO_ERROR@handle_socket_error_msg@1697: Socket [127.0.0.1:48985] zk retcode=-4, errno=111(Connection refused): server refused to accept the client
> I0117 22:45:30.982780  5602 monitor.cpp:193] Publishing resource usage for executor 'default' of framework '201401172245-143311683-58517-5141-0000'
> I0117 22:45:31.002218  5604 monitor.cpp:193] Publishing resource usage for executor 'default' of framework '201401172245-143311683-58517-5141-0000'
> I0117 22:45:31.009212  5604 hierarchical_allocator_process.hpp:688] Performed allocation for 1 slaves in 23330ns
> I0117 22:45:31.983933  5601 monitor.cpp:193] Publishing resource usage for executor 'default' of framework '201401172245-143311683-58517-5141-0000'
> I0117 22:45:32.003159  5600 monitor.cpp:193] Publishing resource usage for executor 'default' of framework '201401172245-143311683-58517-5141-0000'
> I0117 22:45:32.006222  5600 master.cpp:85] No whitelist given. Advertising offers for all slaves
> I0117 22:45:32.009299  5600 hierarchical_allocator_process.hpp:688] Performed allocation for 1 slaves in 19118ns
> I0117 22:45:32.984730  5600 monitor.cpp:193] Publishing resource usage for executor 'default' of framework '201401172245-143311683-58517-5141-0000'
> I0117 22:45:33.004012  5600 monitor.cpp:193] Publishing resource usage for executor 'default' of framework '201401172245-143311683-58517-5141-0000'
> I0117 22:45:33.010018  5599 hierarchical_allocator_process.hpp:688] Performed allocation for 1 slaves in 24777ns
> I0117 22:45:33.985347  5603 monitor.cpp:193] Publishing resource usage for executor 'default' of framework '201401172245-143311683-58517-5141-0000'
> I0117 22:45:34.004447  5603 monitor.cpp:193] Publishing resource usage for executor 'default' of framework '201401172245-143311683-58517-5141-0000'
> I0117 22:45:34.010575  5606 hierarchical_allocator_process.hpp:688] Performed allocation for 1 slaves in 27435ns
> 2014-01-17 22:45:34,111:5141(0x2b5274803700):ZOO_ERROR@handle_socket_error_msg@1697: Socket [127.0.0.1:48985] zk retcode=-4, errno=111(Connection refused): server refused to accept the client
> I0117 22:45:34.985859  5602 monitor.cpp:193] Publishing resource usage for executor 'default' of framework '201401172245-143311683-58517-5141-0000'
> I0117 22:45:35.005079  5599 monitor.cpp:193] Publishing resource usage for executor 'default' of framework '201401172245-143311683-58517-5141-0000'
> I0117 22:45:35.011052  5599 hierarchical_allocator_process.hpp:688] Performed allocation for 1 slaves in 22058ns
> I0117 22:45:35.986865  5600 monitor.cpp:193] Publishing resource usage for executor 'default' of framework '201401172245-143311683-58517-5141-0000'
> I0117 22:45:36.006248  5606 monitor.cpp:193] Publishing resource usage for executor 'default' of framework '201401172245-143311683-58517-5141-0000'
> I0117 22:45:36.011289  5606 hierarchical_allocator_process.hpp:688] Performed allocation for 1 slaves in 11259ns
> I0117 22:45:36.987643  5606 monitor.cpp:193] Publishing resource usage for executor 'default' of framework '201401172245-143311683-58517-5141-0000'
> I0117 22:45:37.007050  5599 master.cpp:85] No whitelist given. Advertising offers for all slaves
> I0117 22:45:37.007182  5599 monitor.cpp:193] Publishing resource usage for executor 'default' of framework '201401172245-143311683-58517-5141-0000'
> I0117 22:45:37.012279  5601 hierarchical_allocator_process.hpp:688] Performed allocation for 1 slaves in 44495ns
> tests/allocator_zookeeper_tests.cpp:290: Failure
> Failed to wait 10secs for slaveAdded
> tests/allocator_zookeeper_tests.cpp:284: Failure
> Actual function call count doesn't match EXPECT_CALL(allocator2, slaveAdded(_, _, _))...
>          Expected: to be called once
>            Actual: never called - unsatisfied and active



--
This message was sent by Atlassian JIRA
(v6.2#6252)