You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@ozone.apache.org by "Raju Balpande (Jira)" <ji...@apache.org> on 2024/04/17 11:10:00 UTC

[jira] [Comment Edited] (HDDS-8184) Intermittent timeout in TestContainerReplication

    [ https://issues.apache.org/jira/browse/HDDS-8184?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838110#comment-17838110 ] 

Raju Balpande edited comment on HDDS-8184 at 4/17/24 11:09 AM:
---------------------------------------------------------------

As a first try I run this as a flaky test for 10x10 and observed no failure https://github.com/raju-balpande/apache_ozone/actions/runs/8710014704/job/23891065129. Looking more into it.


was (Author: JIRAUSER296391):
As a first try I run this as a flaky test for 10x10 and observed no failure. Looking more into it.

> Intermittent timeout in TestContainerReplication
> ------------------------------------------------
>
>                 Key: HDDS-8184
>                 URL: https://issues.apache.org/jira/browse/HDDS-8184
>             Project: Apache Ozone
>          Issue Type: Sub-task
>    Affects Versions: 1.4.0
>            Reporter: Attila Doroszlai
>            Assignee: Raju Balpande
>            Priority: Minor
>
> {code:title=https://github.com/adoroszlai/ozone-build-results/blob/master/2023/03/15/20796/it-ozone/hadoop-ozone/integration-test/org.apache.hadoop.ozone.container.replication.TestContainerReplication.txt}
> org.apache.hadoop.ozone.container.replication.TestContainerReplication.testPush(CopyContainerCompression)[5]  Time elapsed: 30.108 s  <<< ERROR!
> java.util.concurrent.TimeoutException: 
> ...
>   at 
> org.apache.hadoop.ozone.container.replication.TestContainerReplication.queueAndWaitForCompletion(TestContainerReplication.java:187)
>   at org.apache.hadoop.ozone.container.replication.TestContainerReplication.testPush(TestContainerReplication.java:111)
> {code}
> {code:title=https://github.com/adoroszlai/ozone-build-results/blob/master/2023/03/15/20796/it-ozone/hadoop-ozone/integration-test/org.apache.hadoop.ozone.container.replication.TestContainerReplication-output.txt}
> 2023-03-15 18:13:33,572 [ContainerReplicationThread-0] INFO  replication.PushReplicator (PushReplicator.java:replicate(58)) - Starting replication of container 1 to 2dde47c4-9545-41ac-a72b-8075b684fab1(fv-az196-962.ad143mqho3xu3jekw201s0oc5a.jx.internal.cloudapp.net/10.1.0.30) using NO_COMPRESSION
> 2023-03-15 18:13:33,583 [ContainerReplicationThread-0] WARN  replication.PushReplicator (PushReplicator.java:replicate(73)) - Container 1 replication was unsuccessful.
> org.apache.hadoop.hdds.scm.container.common.helpers.StorageContainerException: Container 1 is not found.
> 	at org.apache.hadoop.ozone.container.replication.OnDemandContainerReplicationSource.copyData(OnDemandContainerReplicationSource.java:58)
> 	at org.apache.hadoop.ozone.container.replication.PushReplicator.replicate(PushReplicator.java:67)
> 	at org.apache.hadoop.ozone.container.replication.MeasuredReplicator.replicate(MeasuredReplicator.java:83)
> 	at org.apache.hadoop.ozone.container.replication.ReplicationTask.runTask(ReplicationTask.java:122)
> 	at org.apache.hadoop.ozone.container.replication.ReplicationSupervisor$TaskRunner.run(ReplicationSupervisor.java:215)
> 	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
> 	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
> 	at java.lang.Thread.run(Thread.java:750)
> 2023-03-15 18:13:33,589 [ContainerReplicationThread-0] INFO  replication.GrpcOutputStream (GrpcOutputStream.java:close(111)) - Sent 0 bytes for container 1
> 2023-03-15 18:13:33,590 [grpc-default-executor-4] WARN  replication.SendContainerRequestHandler (SendContainerRequestHandler.java:onCompleted(104)) - Received container without any parts
> ...
> 2023-03-15 18:13:38,590 [ContainerReplicationThread-0] WARN  replication.ReplicationSupervisor (ReplicationSupervisor.java:run(217)) - Failed FAILED replicateContainerCommand: containerId=1, replicaIndex=0, targetNode=2dde47c4-9545-41ac-a72b-8075b684fab1(fv-az196-962.ad143mqho3xu3jekw201s0oc5a.jx.internal.cloudapp.net/10.1.0.30), priority=NORMAL
> {code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@ozone.apache.org
For additional commands, e-mail: issues-help@ozone.apache.org