You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@yunikorn.apache.org by "Kinga Marton (Jira)" <ji...@apache.org> on 2020/11/21 11:25:00 UTC

[jira] [Created] (YUNIKORN-465) scheduler health check REST API

Kinga Marton created YUNIKORN-465:
-------------------------------------

             Summary: scheduler health check REST API
                 Key: YUNIKORN-465
                 URL: https://issues.apache.org/jira/browse/YUNIKORN-465
             Project: Apache YuniKorn
          Issue Type: Bug
            Reporter: Kinga Marton
            Assignee: Kinga Marton


We need to build a health check REST API for the scheduler
This is needed for chaos monkey tests, the validation script can call the API to verify the scheduler state periodically
We should leverage scheduler metrics to do the validation, things to validate like:
 # Negative resources on node/app/cluster
 # Consistency of the data, e.g sum of allocated resource of apps = allocated resource in the partition
 # critical errors logged in the metrics (things should not happen but happened)
 # ...



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@yunikorn.apache.org
For additional commands, e-mail: dev-help@yunikorn.apache.org