You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@yunikorn.apache.org by "Weiwei Yang (Jira)" <ji...@apache.org> on 2022/02/07 22:15:00 UTC

[jira] [Created] (YUNIKORN-1070) Potential scheduler memory leak

Weiwei Yang created YUNIKORN-1070:
-------------------------------------

             Summary: Potential scheduler memory leak
                 Key: YUNIKORN-1070
                 URL: https://issues.apache.org/jira/browse/YUNIKORN-1070
             Project: Apache YuniKorn
          Issue Type: Bug
            Reporter: Weiwei Yang


Ben mentioned this in the slack, he runs 0.12.2 on EKS and runs into periodic OOM cases for the scheduler in EKS after a few days.  Currently, the scheduler is configured for 10GB of memory and eventually always seems to run out of memory.    In my environment, I have a lot of nodes coming in and out of the cluster due to autoscaling.   Wondering if this could be a possible reason or if you guys have any other ideas.   Let me know what kind of troubleshooting information might be useful here, but there is just a continuous growth of memory consumption that ends with OOMKilled.  



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@yunikorn.apache.org
For additional commands, e-mail: dev-help@yunikorn.apache.org