You are viewing a plain text version of this content. The canonical link for it is here.
Posted to yarn-dev@hadoop.apache.org by "Shuyi Zhang (Jira)" <ji...@apache.org> on 2023/01/04 05:37:00 UTC
[jira] [Created] (YARN-11407) short-live job log aggregation blocked on LRA
Shuyi Zhang created YARN-11407:
----------------------------------
Summary: short-live job log aggregation blocked on LRA
Key: YARN-11407
URL: https://issues.apache.org/jira/browse/YARN-11407
Project: Hadoop YARN
Issue Type: Improvement
Reporter: Shuyi Zhang
Currently, log aggregation threads are scheduled based on applications start time. This means later applications that start later but finish earlier, will be blocked on log aggregation, until the previous running jobs finish and release a log aggregation thread for it to use. To resolve this bottleneck, we can decouple the initialization from the thread scheduling of the ApplogAggregator:
Initializing an AppLogAggregator on application start
But scheduling it in the thread pool on application finish
This schedules log aggregation threads based on finish time instead application start time, so prevents having a pileup of jobs waiting for log aggregation, especially when waiting for long-running jobs to finish.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-dev-unsubscribe@hadoop.apache.org
For additional commands, e-mail: yarn-dev-help@hadoop.apache.org