You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@ozone.apache.org by GitBox <gi...@apache.org> on 2021/07/26 10:29:31 UTC

[GitHub] [ozone] adoroszlai opened a new pull request #2461: HDDS-5494. Reduce retry in Kubernetes test

adoroszlai opened a new pull request #2461:
URL: https://github.com/apache/ozone/pull/2461


   ## What changes were proposed in this pull request?
   
   _kubernetes_ tests wait for cluster startup, checking some conditions with retry.  In worst case all conditions are checked 100 times with 3 seconds delay, so the test may take 15 minutes to fail.
   
   Skip waiting for SCM and OM readiness if retries for previous conditions are exhausted.
   
   https://issues.apache.org/jira/browse/HDDS-5494
   
   ## How was this patch tested?
   
   Currently the test in `ozone` env. fails to start the cluster, so the change is verified by the failing CI check:
   
   ```
   ...
   99 'all_pods_are_running' is failed...
   4 pods are running out from the 5
   100 'all_pods_are_running' is failed...
   
   **** Executing robot tests scm-0 ****
   
   ...
   ```
   
   https://github.com/adoroszlai/hadoop-ozone/runs/3159798487#step:6:797
   
   The happy path is verified by successful startup in `getting-started` env.:
   
   ```
   ...
   -1 pods are running. Waiting for more.
   12 'all_pods_are_running' is failed...
   5 pods are running out from the 6
   13 'all_pods_are_running' is failed...
   1 'grep_log scm-0 SCM exiting safe mode.' is failed...
   2 'grep_log scm-0 SCM exiting safe mode.' is failed...
   3 'grep_log scm-0 SCM exiting safe mode.' is failed...
   4 'grep_log scm-0 SCM exiting safe mode.' is failed...
   5 'grep_log scm-0 SCM exiting safe mode.' is failed...
   6 'grep_log scm-0 SCM exiting safe mode.' is failed...
   7 'grep_log scm-0 SCM exiting safe mode.' is failed...
   8 'grep_log scm-0 SCM exiting safe mode.' is failed...
   2021-07-26 09:01:01 INFO  SCMSafeModeManager:248 - SCM exiting safe mode.
   2021-07-26 09:01:02 INFO  BaseHttpServer:329 - HTTP server of ozoneManager listening at http://0.0.0.0:9874
   
   **** Cluster is up and running ****
   
   ...
   ```
   
   https://github.com/adoroszlai/hadoop-ozone/runs/3159798487#step:6:172


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@ozone.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@ozone.apache.org
For additional commands, e-mail: issues-help@ozone.apache.org


[GitHub] [ozone] GeorgeJahad commented on pull request #2461: HDDS-5494. Reduce retry in Kubernetes test

Posted by GitBox <gi...@apache.org>.
GeorgeJahad commented on pull request #2461:
URL: https://github.com/apache/ozone/pull/2461#issuecomment-887854240


   This looks like a useful fix to me. (I ran into the same problem working on this kubernetes PR: https://github.com/apache/ozone/pull/2464)
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@ozone.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@ozone.apache.org
For additional commands, e-mail: issues-help@ozone.apache.org


[GitHub] [ozone] adoroszlai merged pull request #2461: HDDS-5494. Reduce retry in Kubernetes test

Posted by GitBox <gi...@apache.org>.
adoroszlai merged pull request #2461:
URL: https://github.com/apache/ozone/pull/2461


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@ozone.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@ozone.apache.org
For additional commands, e-mail: issues-help@ozone.apache.org


[GitHub] [ozone] adoroszlai commented on pull request #2461: HDDS-5494. Reduce retry in Kubernetes test

Posted by GitBox <gi...@apache.org>.
adoroszlai commented on pull request #2461:
URL: https://github.com/apache/ozone/pull/2461#issuecomment-888025483


   Thanks @GeorgeJahad and @smengcl for the review.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@ozone.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@ozone.apache.org
For additional commands, e-mail: issues-help@ozone.apache.org