You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@myriad.apache.org by "Swapnil Daingade (JIRA)" <ji...@apache.org> on 2015/08/21 22:11:45 UTC
[jira] [Created] (MYRIAD-127) Recommend nodes to launch
NodeManagers optimized for locality based on input data for Yarn jobs
Swapnil Daingade created MYRIAD-127:
---------------------------------------
Summary: Recommend nodes to launch NodeManagers optimized for locality based on input data for Yarn jobs
Key: MYRIAD-127
URL: https://issues.apache.org/jira/browse/MYRIAD-127
Project: Myriad
Issue Type: New Feature
Reporter: Swapnil Daingade
Assignee: Swapnil Daingade
Hadoop/Yarn works on the principle of taking computation to the data. Thus data locality is important for getting optimal performance. When a yarn job is launched, the user specifies the dfs location of the data being operated on. Looking at all these locations (from various running yarn jobs) being operated on, we can try to predict the best location for launching NodeManagers optimized for locality.
For. e.g We have a 20 node mesos cluster and a user has a 5 node yarn cluster running a few jobs. Looking at the data being operated on by the yarn jobs, we can come up with a recommendation for which 5 nodes to launch NodeManagers on optimized for locality.
In a more autonomous mode, we could flex down some of the existing NM's and flexup new ones on new nodes.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)