You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@yunikorn.apache.org by "Kinga Marton (Jira)" <ji...@apache.org> on 2021/02/05 20:09:00 UTC

[jira] [Closed] (YUNIKORN-540) Possible deadlock when recovering or deleting an allocation ask

     [ https://issues.apache.org/jira/browse/YUNIKORN-540?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Kinga Marton closed YUNIKORN-540.
---------------------------------
    Resolution: Not A Bug

We don't have any deadlock in production.

The deadlock was caused by a watch I had for sa.GetAllocatedResource

> Possible deadlock when recovering or deleting an allocation ask
> ---------------------------------------------------------------
>
>                 Key: YUNIKORN-540
>                 URL: https://issues.apache.org/jira/browse/YUNIKORN-540
>             Project: Apache YuniKorn
>          Issue Type: Bug
>            Reporter: Kinga Marton
>            Priority: Critical
>         Attachments: stacktrace.txt
>
>
> Steps to reproduce locally the deadlock during recovery:
>  # modify the sleep example to have a bigger sleep time (for example 300s), to make sure that the pods are still running after recovery
>  # when the pods are already running stop the scheduler
>  # start the scheduler in debug mode and add a breakpoint here in the application#RecoverAllocationAsk(ask *AllocationAsk) method here: [https://github.com/apache/incubator-yunikorn-core/blob/master/pkg/scheduler/objects/application.go#L400.] I think we need the breakpoint to make this a little bit slower than usual, however I tried to reproduce it in normal running mode by adding some sleep, but I couldn't, it came out just in debug mode. Also if I commented out the lock, then it disappeared. 
>  # Once the program will stop at the breakpoint let it go forward.
>  # After this step it will hang until the node recovery times out



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@yunikorn.apache.org
For additional commands, e-mail: dev-help@yunikorn.apache.org