You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@spark.apache.org by "Alexander Bessonov (Jira)" <ji...@apache.org> on 2019/10/02 14:46:00 UTC

[jira] [Updated] (SPARK-29330) Allow users to chose the name of Spark Shuffle service

     [ https://issues.apache.org/jira/browse/SPARK-29330?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Alexander Bessonov updated SPARK-29330:
---------------------------------------
    Description: 
As of now, Spark uses hardcoded value {{spark_shuffle}} as the name of the Shuffle Service.

HDP distribution of Spark, on the other hand, uses [{{spark2_shuffle}}|https://github.com/hortonworks/spark2-release/blob/HDP-3.1.0.0-78-tag/resource-managers/yarn/src/main/scala/org/apache/spark/deploy/yarn/ExecutorRunnable.scala#L117]. This is done to be able to run both Spark 1.6 and Spark 2.x on the same Hadoop cluster.

Running vanilla Spark on HDP cluster with only Spark 2.x shuffle service (HDP favor) running becomes impossible due to the shuffle service name mismatch.

  was:
As of now, Spark uses hardcoded value {{spark_shuffle}} as the name of the Shuffle Service.

HDP distribution of Spark, on the other hand, uses [{{spark2_shuffle}}|#L117]]. This is done to be able to run both Spark 1.6 and Spark 2.x on the same Hadoop cluster.

Running vanilla Spark on HDP cluster with only Spark 2.x shuffle service (HDP favor) running becomes impossible due to the shuffle service name mismatch.


> Allow users to chose the name of Spark Shuffle service
> ------------------------------------------------------
>
>                 Key: SPARK-29330
>                 URL: https://issues.apache.org/jira/browse/SPARK-29330
>             Project: Spark
>          Issue Type: Improvement
>          Components: Spark Core, YARN
>    Affects Versions: 2.4.4
>            Reporter: Alexander Bessonov
>            Priority: Minor
>
> As of now, Spark uses hardcoded value {{spark_shuffle}} as the name of the Shuffle Service.
> HDP distribution of Spark, on the other hand, uses [{{spark2_shuffle}}|https://github.com/hortonworks/spark2-release/blob/HDP-3.1.0.0-78-tag/resource-managers/yarn/src/main/scala/org/apache/spark/deploy/yarn/ExecutorRunnable.scala#L117]. This is done to be able to run both Spark 1.6 and Spark 2.x on the same Hadoop cluster.
> Running vanilla Spark on HDP cluster with only Spark 2.x shuffle service (HDP favor) running becomes impossible due to the shuffle service name mismatch.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org