You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@flink.apache.org by "Till Rohrmann (Jira)" <ji...@apache.org> on 2021/12/23 16:13:00 UTC
[jira] [Created] (FLINK-25436) Allow BlobServer/BlobCache to clean up unused blobs after recovering from working directory
Till Rohrmann created FLINK-25436:
-------------------------------------
Summary: Allow BlobServer/BlobCache to clean up unused blobs after recovering from working directory
Key: FLINK-25436
URL: https://issues.apache.org/jira/browse/FLINK-25436
Project: Flink
Issue Type: Sub-task
Components: Runtime / Coordination
Affects Versions: 1.15.0
Reporter: Till Rohrmann
Assignee: Till Rohrmann
Fix For: 1.15.0
In order to let the {{BlobServer}} and the {{BlobCache}} properly clean up unused blobs that are recovered from the working directory, we have to register them for clean up and offer hooks to delete irrelevant job artifacts.
I propose to scan the blobStorage directory at startup and to register for transient blobs the expiry timeouts. Moreover, for the {{BlobServer}} we need to add a {{retainJobs}} method that deletes all jobs that are not in the given list of {{JobIDs}}. Last but not least we also need to register the permanent blobs in the {{PermanentBlobCacheService}} so that they get timed out if not used anymore.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)