You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@flink.apache.org by "Till Rohrmann (JIRA)" <ji...@apache.org> on 2017/05/10 12:08:05 UTC
[jira] [Created] (FLINK-6525) Transferred TM log/stdout files are
never removed from BlobStore
Till Rohrmann created FLINK-6525:
------------------------------------
Summary: Transferred TM log/stdout files are never removed from BlobStore
Key: FLINK-6525
URL: https://issues.apache.org/jira/browse/FLINK-6525
Project: Flink
Issue Type: Bug
Components: Distributed Coordination
Affects Versions: 1.3.0, 1.4.0
Reporter: Till Rohrmann
The {{TaskManager}} uses the {{BlobClient}} to upload its stdout/log file to the {{BlobServer}}. If HA mode is enabled, then these files will also be uploaded to the {{BlobStore}}. Since the {{TaskManagerLogHandler}} only cleans up files from a TM in case it has already received another file from this TM and additionally does this in a non thread safe manner, it can easily happen that files won't get cleaned up from the {{BlobStore}}.
I think we should not upload these kind of files to the persistent/HA {{BlobStore}}. We could do this by introducing a storage mode when uploading files to the {{BlobServer}} (e.g. {{HA_STORAGE}} vs. {{LOCAL_STORAGE}}). Additionally, we should also register a timeout for only locally stored files or at least store them under its {{JobID}} such that these files are also cleaned up once the job is being cleaned up.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)