You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@geode.apache.org by "Swapnil Bawaskar (JIRA)" <ji...@apache.org> on 2016/02/10 00:20:18 UTC

[jira] [Commented] (GEODE-446) CI failure: PartitionedRegionHAFailureAndRecoveryDUnitTest.testRecoveryOfSingleMemberFailure

    [ https://issues.apache.org/jira/browse/GEODE-446?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15139983#comment-15139983 ] 

Swapnil Bawaskar commented on GEODE-446:
----------------------------------------

One of the members was not able to talk to the locator for about 32 seconds. The test failed after 30 seconds.
In the new log, I see a 15 second warning, followed by a IOException.

{noformat}
[vm_0][warn 2015/10/26 10:29:05.610 PDT <RMI TCP Connection(10)-10.118.33.207> tid=0xf39] 15 seconds have elapsed while waiting for replies: <UpdateAttributesProcessor$UpdateAttributesReplyProcessor 33155 waiting for 1 replies from [cc8-rh64(8770:locator)<v0>:42343]> on cc8-rh64(8782)<v257>:7795 whose current membership list is: [[cc8-rh64(8770:locator)<v0>:42343, cc8-rh64(8800)<v250>:64079, cc8-rh64(8791)<v255>:12640, cc8-rh64(8782)<v257>:7795, cc8-rh64(8795)<v249>:61553]]
[vm_0]
[locator][info 2015/10/26 10:29:20.766 PDT <Pooled High Priority Message Processor 1> tid=0x41] Connection: shared=true ordered=false failed to connect to peer cc8-rh64(8782)<v257>:7795 because: java.io.IOException: 30000 millis timeout while waiting for channel to be ready for connect. ch : java.nio.channels.SocketChannel[closed]
[locator]
{noformat}

The test fails after waiting for 30 seconds, and yet after 32 seconds, the attempt to reconnect is successful and UpdateAttributesProcessor gets a reply:
{noformat}
[locator][info 2015/10/26 10:29:22.778 PDT <Pooled High Priority Message Processor 1> tid=0x41] cc8-rh64(8770:locator)<v0>:42343: Successfully reestablished connection to peer cc8-rh64(8782)<v257>:7795
[locator]
[vm_0][info 2015/10/26 10:29:22.778 PDT <P2P message reader for cc8-rh64(8770:locator)<v0>:42343 shared unordered uid=129 port=55337> tid=0x102e] Member cc8-rh64(8770:locator)<v0>:42343 is equivalent or in the same redundancy zone.
[vm_0]
[vm_0][info 2015/10/26 10:29:22.779 PDT <RMI TCP Connection(10)-10.118.33.207> tid=0xf39] UpdateAttributesProcessor$UpdateAttributesReplyProcessor wait for replies completed
[vm_0]
{noformat}

This is rather odd as the two processes were running on the same machine. Bruce, can you please take a look?

> CI failure: PartitionedRegionHAFailureAndRecoveryDUnitTest.testRecoveryOfSingleMemberFailure
> --------------------------------------------------------------------------------------------
>
>                 Key: GEODE-446
>                 URL: https://issues.apache.org/jira/browse/GEODE-446
>             Project: Geode
>          Issue Type: Bug
>            Reporter: Kirk Lund
>            Assignee: Darrel Schneider
>              Labels: CI
>         Attachments: GEODE-446-failure.log, com.gemstone.gemfire.internal.cache.PartitionedRegionHAFailureAndRecoveryDUnitTest.html
>
>
> {noformat}
> dunit.RMIException: While invoking com.gemstone.gemfire.internal.cache.PartitionedRegionHAFailureAndRecoveryDUnitTest$7.run in VM 1 running on Host angola.gemstone.com with 4 VMs
> 	at dunit.VM.invoke(VM.java:360)
> 	at dunit.VM.invoke(VM.java:303)
> 	at dunit.VM.invoke(VM.java:257)
> 	at com.gemstone.gemfire.internal.cache.PartitionedRegionHAFailureAndRecoveryDUnitTest.testRecoveryOfSingleMemberFailure(PartitionedRegionHAFailureAndRecoveryDUnitTest.java:458)
> 	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> 	at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
> 	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> 	at java.lang.reflect.Method.invoke(Method.java:497)
> 	at junit.framework.TestCase.runTest(TestCase.java:176)
> 	at junit.framework.TestCase.runBare(TestCase.java:141)
> 	at junit.framework.TestResult$1.protect(TestResult.java:122)
> 	at junit.framework.TestResult.runProtected(TestResult.java:142)
> 	at junit.framework.TestResult.run(TestResult.java:125)
> 	at junit.framework.TestCase.run(TestCase.java:129)
> 	at junit.framework.TestSuite.runTest(TestSuite.java:252)
> 	at junit.framework.TestSuite.run(TestSuite.java:247)
> 	at org.junit.internal.runners.JUnit38ClassRunner.run(JUnit38ClassRunner.java:86)
> 	at org.gradle.api.internal.tasks.testing.junit.JUnitTestClassExecuter.runTestClass(JUnitTestClassExecuter.java:86)
> 	at org.gradle.api.internal.tasks.testing.junit.JUnitTestClassExecuter.execute(JUnitTestClassExecuter.java:49)
> 	at org.gradle.api.internal.tasks.testing.junit.JUnitTestClassProcessor.processTestClass(JUnitTestClassProcessor.java:64)
> 	at org.gradle.api.internal.tasks.testing.SuiteTestClassProcessor.processTestClass(SuiteTestClassProcessor.java:50)
> 	at sun.reflect.GeneratedMethodAccessor177.invoke(Unknown Source)
> 	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> 	at java.lang.reflect.Method.invoke(Method.java:497)
> 	at org.gradle.messaging.dispatch.ReflectionDispatch.dispatch(ReflectionDispatch.java:35)
> 	at org.gradle.messaging.dispatch.ReflectionDispatch.dispatch(ReflectionDispatch.java:24)
> 	at org.gradle.messaging.dispatch.ContextClassLoaderDispatch.dispatch(ContextClassLoaderDispatch.java:32)
> 	at org.gradle.messaging.dispatch.ProxyDispatchAdapter$DispatchingInvocationHandler.invoke(ProxyDispatchAdapter.java:93)
> 	at com.sun.proxy.$Proxy2.processTestClass(Unknown Source)
> 	at org.gradle.api.internal.tasks.testing.worker.TestWorker.processTestClass(TestWorker.java:106)
> 	at sun.reflect.GeneratedMethodAccessor176.invoke(Unknown Source)
> 	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> 	at java.lang.reflect.Method.invoke(Method.java:497)
> 	at org.gradle.messaging.dispatch.ReflectionDispatch.dispatch(ReflectionDispatch.java:35)
> 	at org.gradle.messaging.dispatch.ReflectionDispatch.dispatch(ReflectionDispatch.java:24)
> 	at org.gradle.messaging.remote.internal.hub.MessageHub$Handler.run(MessageHub.java:360)
> 	at org.gradle.internal.concurrent.DefaultExecutorFactory$StoppableExecutorImpl$1.run(DefaultExecutorFactory.java:64)
> 	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
> 	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
> 	at java.lang.Thread.run(Thread.java:745)
> Caused by: junit.framework.AssertionFailedError: Test waited more than 120 seconds for redundancy recover
> 	at junit.framework.Assert.fail(Assert.java:57)
> 	at junit.framework.TestCase.fail(TestCase.java:227)
> 	at com.gemstone.gemfire.internal.cache.PartitionedRegionHAFailureAndRecoveryDUnitTest$7.run(PartitionedRegionHAFailureAndRecoveryDUnitTest.java:472)
> 	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> 	at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
> 	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> 	at java.lang.reflect.Method.invoke(Method.java:497)
> 	at hydra.MethExecutor.executeObject(MethExecutor.java:258)
> 	at dunit.standalone.RemoteDUnitVM.executeMethodOnObject(RemoteDUnitVM.java:68)
> 	at sun.reflect.GeneratedMethodAccessor57.invoke(Unknown Source)
> 	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> 	at java.lang.reflect.Method.invoke(Method.java:497)
> 	at sun.rmi.server.UnicastServerRef.dispatch(UnicastServerRef.java:323)
> 	at sun.rmi.transport.Transport$1.run(Transport.java:200)
> 	at sun.rmi.transport.Transport$1.run(Transport.java:197)
> 	at java.security.AccessController.doPrivileged(Native Method)
> 	at sun.rmi.transport.Transport.serviceCall(Transport.java:196)
> 	at sun.rmi.transport.tcp.TCPTransport.handleMessages(TCPTransport.java:568)
> 	at sun.rmi.transport.tcp.TCPTransport$ConnectionHandler.run0(TCPTransport.java:826)
> 	at sun.rmi.transport.tcp.TCPTransport$ConnectionHandler.lambda$run$241(TCPTransport.java:683)
> 	at sun.rmi.transport.tcp.TCPTransport$ConnectionHandler$$Lambda$1/868282079.run(Unknown Source)
> 	at java.security.AccessController.doPrivileged(Native Method)
> 	at sun.rmi.transport.tcp.TCPTransport$ConnectionHandler.run(TCPTransport.java:682)
> 	... 3 more
> {noformat}
> Previously run tests: [PartitionedRegionSingleHopDUnitTest, PartitionedRegionCacheXMLExampleDUnitTest, DeltaSizingDUnitTest, OffHeapEvictionStatsDUnitTest, Bug39079DUnitTest, PRBadToDataDUnitTest, PartitionedRegionBucketCreationDistributionDUnitTest, DeltaPropagationDUnitTest, DiskRegByteArrayDUnitTest, Bug41733DUnitTest, IteratorDUnitTest, BackupDUnitTest, ConcurrentDestroySubRegionDUnitTest, RemoteTransactionDUnitTest, EvictionObjectSizerDUnitTest, Bug42010StatsDUnitTest, PartitionedRegionInvalidateDUnitTest, PartitionedRegionHADUnitTest, CacheAdvisorDUnitTest, PartitionedRegionMultipleDUnitTest, PartitionedRegionDelayedRecoveryDUnitTest, ClientServerTransactionDUnitTest, P2PDeltaPropagationDUnitTest, PartitionedRegionLocalMaxMemoryDUnitTest, PartitionedRegionRedundancyZoneDUnitTest, PartitionedRegionHAFailureAndRecoveryDUnitTest]



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)