You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@flink.apache.org by "Till Rohrmann (JIRA)" <ji...@apache.org> on 2017/05/10 12:08:05 UTC

[jira] [Created] (FLINK-6525) Transferred TM log/stdout files are never removed from BlobStore

Till Rohrmann created FLINK-6525:
------------------------------------

             Summary: Transferred TM log/stdout files are never removed from BlobStore
                 Key: FLINK-6525
                 URL: https://issues.apache.org/jira/browse/FLINK-6525
             Project: Flink
          Issue Type: Bug
          Components: Distributed Coordination
    Affects Versions: 1.3.0, 1.4.0
            Reporter: Till Rohrmann


The {{TaskManager}} uses the {{BlobClient}} to upload its stdout/log file to the {{BlobServer}}. If HA mode is enabled, then these files will also be uploaded to the {{BlobStore}}. Since the {{TaskManagerLogHandler}} only cleans up files from a TM in case it has already received another file from this TM and additionally does this in a non thread safe manner, it can easily happen that files won't get cleaned up from the {{BlobStore}}.

I think we should not upload these kind of files to the persistent/HA {{BlobStore}}. We could do this by introducing a storage mode when uploading files to the {{BlobServer}} (e.g. {{HA_STORAGE}} vs. {{LOCAL_STORAGE}}). Additionally, we should also register a timeout for only locally stored files or at least store them under its {{JobID}} such that these files are also cleaned up once the job is being cleaned up.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)