You are viewing a plain text version of this content. The canonical link for it is here.
Posted to reviews@spark.apache.org by GitBox <gi...@apache.org> on 2019/04/18 15:48:25 UTC

[GitHub] [spark] tgravescs opened a new pull request #24406: Gpu sched executor cl[SPARK-27024] Executor interface for cluster managers to support GPU and other resourcesean

tgravescs opened a new pull request #24406: Gpu sched executor cl[SPARK-27024] Executor interface for cluster managers to support GPU and other resourcesean
URL: https://github.com/apache/spark/pull/24406
 
 
   ## What changes were proposed in this pull request?
   
   Add in GPU and generic resource type allocation to the executors.
   
   Note this is part of a bigger feature for gpu-aware scheduling and is just how the executor find the resources. The general flow :
   
      - users ask for a certain set of resources, for instance number of gpus - each cluster manager has a specific way to do this.
     -  cluster manager allocates a container or set of resources (standalone mode)
   -    When spark launches the executor in that container, the executor either has to be told what resources it has or it has to auto discover them.
     -  Executor has to register with Driver and tell the driver the set of resources it has so the scheduler can use that to schedule tasks that requires a certain amount of each of those resources
   
   In this pr I added configs and arguments to the executor to be able discover resources. The argument to the executor is intended to be used by standalone mode or other cluster managers that don't have isolation so that it can assign specific resources to specific executors in case there are multiple executors on a node. The argument is a file contains JSON Array of ResourceInformation objects.
   
   The discovery script is meant to be used in an isolated environment where the executor only sees the resources it should use.
   
   Note that there will be follow on PRs to add other parts like the scheduler part. See the epic high level jira: https://issues.apache.org/jira/browse/SPARK-24615
   
   ## How was this patch tested?
   
   Added unit tests and manually tested.
   
   Please review http://spark.apache.org/contributing.html before opening a pull request.
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org