You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@yunikorn.apache.org by "Peter Bacsko (Jira)" <ji...@apache.org> on 2023/03/11 09:35:00 UTC

[jira] [Created] (YUNIKORN-1628) YuniKorn application traceability

Peter Bacsko created YUNIKORN-1628:
--------------------------------------

             Summary: YuniKorn application traceability
                 Key: YUNIKORN-1628
                 URL: https://issues.apache.org/jira/browse/YUNIKORN-1628
             Project: Apache YuniKorn
          Issue Type: New Feature
          Components: core - scheduler
            Reporter: Peter Bacsko


The current implementation of YuniKorn is focused on the application and the states of the application. K8s does not and cannot provide details on what happens inside the application. This limits what we can offer at a YuniKorn level for applications.

To increase supportability, we need to understand what happens inside the core scheduler and how we got into a certain state.

We want to record a stream of events in memory when something relevant happens which is related to the application or nodes:
* Partition changed (nodes added / removed, capacity changed, etc.)
* Application created / removed
* An ask is created / removed
* An allocation is created / removed
* Reservation occurs
* Placeholder is replaced
* etc.

The recorded events should be queried from the REST interface.

The number of stored events can be limited by two settings: maximum number of events or expiration time (eg. events from the past 5 minutes).




--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@yunikorn.apache.org
For additional commands, e-mail: dev-help@yunikorn.apache.org