You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Jagadeeswara Rao (Jira)" <ji...@apache.org> on 2023/02/16 19:03:00 UTC

[jira] [Created] (SPARK-42466) spark.kubernetes.file.upload.path not deleting files under HDFS after job completes

Jagadeeswara Rao created SPARK-42466:
----------------------------------------

             Summary: spark.kubernetes.file.upload.path not deleting files under HDFS after job completes
                 Key: SPARK-42466
                 URL: https://issues.apache.org/jira/browse/SPARK-42466
             Project: Spark
          Issue Type: Bug
          Components: Kubernetes
    Affects Versions: 3.2.0
            Reporter: Jagadeeswara Rao


In cluster mode after uploading files to HDFS location using spark.kubernetes.file.upload.path property files are not getting cleared . 

File is successfully uploaded to hdfs location in this format spark-upload-[randomUUID] using {{KubernetesUtils}} is requested to [ https://github.com/apache/spark/blob/76a134ade60a9f354aca01eaca0b2e2477c6bd43/resource-managers/kubernetes/core/src/main/scala/org/apache/spark/deploy/k8s/KubernetesUtils.scala#L310|https://jaceklaskowski.github.io/spark-kubernetes-book/KubernetesUtils/#uploadFileUri]

spark driver is completed successfully and shutdownhook is not cleared the hdfs files.

 

 
{code:java}
23/02/16 18:06:56 INFO KubernetesClusterSchedulerBackend: Shutting down all executors
23/02/16 18:06:56 INFO KubernetesClusterSchedulerBackend$KubernetesDriverEndpoint: Asking each executor to shut down
23/02/16 18:06:56 WARN ExecutorPodsWatchSnapshotSource: Kubernetes client has been closed.
23/02/16 18:06:57 INFO MapOutputTrackerMasterEndpoint: MapOutputTrackerMasterEndpoint stopped!
23/02/16 18:06:57 INFO MemoryStore: MemoryStore cleared
23/02/16 18:06:57 INFO BlockManager: BlockManager stopped
23/02/16 18:06:57 INFO BlockManagerMaster: BlockManagerMaster stopped
23/02/16 18:06:57 INFO OutputCommitCoordinator$OutputCommitCoordinatorEndpoint: OutputCommitCoordinator stopped!
23/02/16 18:06:57 INFO SparkContext: Successfully stopped SparkContext
23/02/16 18:06:57 INFO ShutdownHookManager: Shutdown hook called
23/02/16 18:06:57 INFO ShutdownHookManager: Deleting directory /tmp/spark-efb8f725-4ead-4729-a8e0-f478280121b7
23/02/16 18:06:57 INFO ShutdownHookManager: Deleting directory /spark-local2/spark-66dbf7e6-fe7e-4655-8724-69d76d93fc1f
23/02/16 18:06:57 INFO ShutdownHookManager: Deleting directory /spark-local1/spark-53aefaee-58a5-4fce-b5b0-5e29f42e337f{code}
 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org