You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Shiqi Sun (Jira)" <ji...@apache.org> on 2023/02/11 02:42:00 UTC
[jira] [Created] (SPARK-42404) Spark driver pod should not create executor pods when there is no driver service
Shiqi Sun created SPARK-42404:
---------------------------------
Summary: Spark driver pod should not create executor pods when there is no driver service
Key: SPARK-42404
URL: https://issues.apache.org/jira/browse/SPARK-42404
Project: Spark
Issue Type: Improvement
Components: Kubernetes
Affects Versions: 3.3.1
Reporter: Shiqi Sun
Currently, the driver pod assumes the driver headless service exists when creating the executor pods. However, when this assumption doesn't hold, the driver would still spin up executor pods, and the executor pods would fail, and then the driver would try to create more pods, and so on. With this, the spark job doesn't make any progress, while it eats a lot of computational resource, and it won't reach to a terminal state until manual intervention (e.g. deleting the job or recreate the driver service).
This Jira Issue is to address this problem, by having the driver check the driver service before creating the executor pods.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org