You are viewing a plain text version of this content. The canonical link for it is here.
Posted to yarn-issues@hadoop.apache.org by "Maciej Smole��ski (Jira)" <ji...@apache.org> on 2022/12/16 12:12:00 UTC

[jira] [Created] (YARN-11397) Memory leak when reading aggregated logs from s3 (LogAggregationTFileController::readAggregatedLogs)

Maciej Smole��ski created YARN-11397:
----------------------------------------

             Summary: Memory leak when reading aggregated logs from s3 (LogAggregationTFileController::readAggregatedLogs)
                 Key: YARN-11397
                 URL: https://issues.apache.org/jira/browse/YARN-11397
             Project: Hadoop YARN
          Issue Type: Bug
          Components: log-aggregation
    Affects Versions: 3.2.2
         Environment: Remote logs dir on s3.
            Reporter: Maciej Smole��ski
         Attachments: YarnLogsS3Issue.scala

Reproduction code in the attachment.

When collecting aggregated logs from s3 in a loop (see reproduction code) we can easily see that the number of 'S3AInstrumentation' is increasing although the number of 'S3AFileSystem' is not increasing. It means that 'S3AInstrumentation' is not released together with 'S3AFileSystem' as it should be. The root cause of this seems to be the missing close on S3AFileSystem.

 

Related issues:
https://issues.apache.org/jira/browse/YARN-11039
https://issues.apache.org/jira/browse/HADOOP-18476
The scenario of this bug is different ('thread leak' vs 'memory leak') and 'affected version' are different (3.2.2 in our case), however the root cause might be the same.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: yarn-issues-help@hadoop.apache.org