You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@yunikorn.apache.org by "Chenya Zhang (Jira)" <ji...@apache.org> on 2021/05/25 19:29:00 UTC

[jira] [Commented] (YUNIKORN-647) Add new metrics to monitor pending applications: "long_pending_app"

    [ https://issues.apache.org/jira/browse/YUNIKORN-647?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17351318#comment-17351318 ] 

Chenya Zhang commented on YUNIKORN-647:
---------------------------------------

Discussed with [~yuchaoran2011], from what we have observed in the past two weeks, this is no longer a good indicator of scheduler failure. Closing this ticket for now.

> Add new metrics to monitor pending applications: "long_pending_app"
> -------------------------------------------------------------------
>
>                 Key: YUNIKORN-647
>                 URL: https://issues.apache.org/jira/browse/YUNIKORN-647
>             Project: Apache YuniKorn
>          Issue Type: Sub-task
>          Components: core - common
>            Reporter: Chenya Zhang
>            Assignee: Chenya Zhang
>            Priority: Major
>
> Based on our observation, if there is one application pending for more than a threshold (e.g. 10 minutes), the scheduler is likely down.
> We would like to capture it for more timely alerting.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@yunikorn.apache.org
For additional commands, e-mail: issues-help@yunikorn.apache.org