You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@ozone.apache.org by "István Fajth (Jira)" <ji...@apache.org> on 2023/07/15 00:14:00 UTC

[jira] [Commented] (HDDS-9013) Intermittent timeout waiting for scm4 role

    [ https://issues.apache.org/jira/browse/HDDS-9013?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17743324#comment-17743324 ] 

István Fajth commented on HDDS-9013:
------------------------------------

Looking at the logs, it seems that the initialiaztion of SCM4 in this case happened in parallel with a rootCA rotation, and the newly added SCM4 was initialized with a certificate for which the trust anchor is rootCA-3, while SCM2 the leader at that point in time started to use a certificate signed by rootCA-4, hence SCM4 was not able to establish trust and the SSL handshake problem is hidden behind a Ratis NOPROGRESS exception.

This is a valid issue with a very low chance of happening in production, and is happening in tests sometimes due to the short certificate lifetimes in the tests.

We will need to check into this and see how we can solve the problem, as I do not have anything trivial that can fix this.

> Intermittent timeout waiting for scm4 role
> ------------------------------------------
>
>                 Key: HDDS-9013
>                 URL: https://issues.apache.org/jira/browse/HDDS-9013
>             Project: Apache Ozone
>          Issue Type: Sub-task
>    Affects Versions: 1.4.0
>            Reporter: Attila Doroszlai
>            Assignee: István Fajth
>            Priority: Major
>
> {code:title=https://github.com/adoroszlai/ozone-build-results/blob/master/2023/07/13/24228/acceptance-HA-secure/output.log}
> Port 9894 is available on scm4.org
> ...
> ozone admin scm roles | grep scm4.org hasn't succeed yet
> Timed out waiting on ozone admin scm roles | grep scm4.org to be successful
> ...
> ERROR: Test execution of ozonesecure-ha/test-root-ca-rotation.sh is FAILED!!!!
> {code}
> CC [~Sammi], [~sgal]



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@ozone.apache.org
For additional commands, e-mail: issues-help@ozone.apache.org