You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@ozone.apache.org by "Raju Balpande (Jira)" <ji...@apache.org> on 2024/04/17 11:10:00 UTC
[jira] [Comment Edited] (HDDS-8184) Intermittent timeout in TestContainerReplication
[ https://issues.apache.org/jira/browse/HDDS-8184?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838110#comment-17838110 ]
Raju Balpande edited comment on HDDS-8184 at 4/17/24 11:09 AM:
---------------------------------------------------------------
As a first try I run this as a flaky test for 10x10 and observed no failure https://github.com/raju-balpande/apache_ozone/actions/runs/8710014704/job/23891065129. Looking more into it.
was (Author: JIRAUSER296391):
As a first try I run this as a flaky test for 10x10 and observed no failure. Looking more into it.
> Intermittent timeout in TestContainerReplication
> ------------------------------------------------
>
> Key: HDDS-8184
> URL: https://issues.apache.org/jira/browse/HDDS-8184
> Project: Apache Ozone
> Issue Type: Sub-task
> Affects Versions: 1.4.0
> Reporter: Attila Doroszlai
> Assignee: Raju Balpande
> Priority: Minor
>
> {code:title=https://github.com/adoroszlai/ozone-build-results/blob/master/2023/03/15/20796/it-ozone/hadoop-ozone/integration-test/org.apache.hadoop.ozone.container.replication.TestContainerReplication.txt}
> org.apache.hadoop.ozone.container.replication.TestContainerReplication.testPush(CopyContainerCompression)[5] Time elapsed: 30.108 s <<< ERROR!
> java.util.concurrent.TimeoutException:
> ...
> at
> org.apache.hadoop.ozone.container.replication.TestContainerReplication.queueAndWaitForCompletion(TestContainerReplication.java:187)
> at org.apache.hadoop.ozone.container.replication.TestContainerReplication.testPush(TestContainerReplication.java:111)
> {code}
> {code:title=https://github.com/adoroszlai/ozone-build-results/blob/master/2023/03/15/20796/it-ozone/hadoop-ozone/integration-test/org.apache.hadoop.ozone.container.replication.TestContainerReplication-output.txt}
> 2023-03-15 18:13:33,572 [ContainerReplicationThread-0] INFO replication.PushReplicator (PushReplicator.java:replicate(58)) - Starting replication of container 1 to 2dde47c4-9545-41ac-a72b-8075b684fab1(fv-az196-962.ad143mqho3xu3jekw201s0oc5a.jx.internal.cloudapp.net/10.1.0.30) using NO_COMPRESSION
> 2023-03-15 18:13:33,583 [ContainerReplicationThread-0] WARN replication.PushReplicator (PushReplicator.java:replicate(73)) - Container 1 replication was unsuccessful.
> org.apache.hadoop.hdds.scm.container.common.helpers.StorageContainerException: Container 1 is not found.
> at org.apache.hadoop.ozone.container.replication.OnDemandContainerReplicationSource.copyData(OnDemandContainerReplicationSource.java:58)
> at org.apache.hadoop.ozone.container.replication.PushReplicator.replicate(PushReplicator.java:67)
> at org.apache.hadoop.ozone.container.replication.MeasuredReplicator.replicate(MeasuredReplicator.java:83)
> at org.apache.hadoop.ozone.container.replication.ReplicationTask.runTask(ReplicationTask.java:122)
> at org.apache.hadoop.ozone.container.replication.ReplicationSupervisor$TaskRunner.run(ReplicationSupervisor.java:215)
> at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
> at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
> at java.lang.Thread.run(Thread.java:750)
> 2023-03-15 18:13:33,589 [ContainerReplicationThread-0] INFO replication.GrpcOutputStream (GrpcOutputStream.java:close(111)) - Sent 0 bytes for container 1
> 2023-03-15 18:13:33,590 [grpc-default-executor-4] WARN replication.SendContainerRequestHandler (SendContainerRequestHandler.java:onCompleted(104)) - Received container without any parts
> ...
> 2023-03-15 18:13:38,590 [ContainerReplicationThread-0] WARN replication.ReplicationSupervisor (ReplicationSupervisor.java:run(217)) - Failed FAILED replicateContainerCommand: containerId=1, replicaIndex=0, targetNode=2dde47c4-9545-41ac-a72b-8075b684fab1(fv-az196-962.ad143mqho3xu3jekw201s0oc5a.jx.internal.cloudapp.net/10.1.0.30), priority=NORMAL
> {code}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@ozone.apache.org
For additional commands, e-mail: issues-help@ozone.apache.org