You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@yunikorn.apache.org by "Brandon Grams (Jira)" <ji...@apache.org> on 2023/02/03 03:32:00 UTC

[jira] [Created] (YUNIKORN-1555) Completed Spark applications are recovered/remain as New

Brandon Grams created YUNIKORN-1555:
---------------------------------------

             Summary: Completed Spark applications are recovered/remain as New
                 Key: YUNIKORN-1555
                 URL: https://issues.apache.org/jira/browse/YUNIKORN-1555
             Project: Apache YuniKorn
          Issue Type: Bug
          Components: shim - kubernetes
    Affects Versions: 1.1.0, 1.2.0
            Reporter: Brandon Grams
         Attachments: CEEB85D7-D0D5-4D3D-8C0A-730CBA48B150.jpeg

The k8s Spark plugin is [documented|https://github.com/apache/yunikorn-k8shim/blob/master/pkg/appmgmt/sparkoperator/spark.go#L33] as implementing the Recoverable interface, however it does not. This leads to the following recovery outcome when driver pods in a terminal state remain in the cluster pending garbage collection:

!CEEB85D7-D0D5-4D3D-8C0A-730CBA48B150.jpeg|width=1063,height=157!

Instead, we should implement the Recoverable interface so that completed applications have their proper state propagated in such a scenario.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@yunikorn.apache.org
For additional commands, e-mail: dev-help@yunikorn.apache.org