You are viewing a plain text version of this content. The canonical link for it is here.
Posted to yarn-dev@hadoop.apache.org by "Deegue (Jira)" <ji...@apache.org> on 2022/03/23 14:08:00 UTC

[jira] [Created] (YARN-11095) [Umbrella] Node load based scheduler

Deegue created YARN-11095:
-----------------------------

             Summary: [Umbrella] Node load based scheduler
                 Key: YARN-11095
                 URL: https://issues.apache.org/jira/browse/YARN-11095
             Project: Hadoop YARN
          Issue Type: Improvement
            Reporter: Deegue


Node load based scheduler is quite effective for cluster stability, epecially when we deploy NodeManager, DataNode and use Auxservices like mapreduce shuffle or spark shuffle.

We can set up threshold and auto skip the nodes with high load when scheduling.
Node load should mainly focus on CPU, Memory and DiskIO.

Keeping CPU and Memory under a healthy threshold makes container and task time more stable, reduces the possibility of OOM kill by OS. As for DiskIO, 
high disk load will more likely cause slow DataNode and fetch failure when shuffling data.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-dev-unsubscribe@hadoop.apache.org
For additional commands, e-mail: yarn-dev-help@hadoop.apache.org