You are viewing a plain text version of this content. The canonical link for it is here.
Posted to reviews@spark.apache.org by mgummelt <gi...@git.apache.org> on 2016/08/05 21:48:59 UTC

[GitHub] spark pull request #14511: [SPARK-16927] [SPARK-16923] Override task propert...

GitHub user mgummelt opened a pull request:

    https://github.com/apache/spark/pull/14511

    [SPARK-16927] [SPARK-16923] Override task properties at dispatcher.

    ## What changes were proposed in this pull request?
    
    - enable setting default properties for all jobs submitted through the dispatcher [SPARK-16927]
    - remove duplication of conf vars on cluster submitted jobs [SPARK-16923] (this is a small fix, so I'm including in the same PR)
    
    ## How was this patch tested?
    
    mesos/spark integration test suite
    manual testing
    
    


You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/mesosphere/spark override-props

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/14511.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #14511
    
----
commit f42ac25ed1301562b96efac802fd52671a9df794
Author: Timothy Chen <tn...@gmail.com>
Date:   2016-03-14T18:49:39Z

    Override task properties at dispatcher.

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #14511: [SPARK-16927] [SPARK-16923] Override task properties at ...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/14511
  
    **[Test build #63464 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/63464/consoleFull)** for PR 14511 at commit [`81f2b1c`](https://github.com/apache/spark/commit/81f2b1c61e814dadaee6f17229cf4d1f46c2cbed).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #14511: [SPARK-16927] [SPARK-16923] Override task properties at ...

Posted by mgummelt <gi...@git.apache.org>.
Github user mgummelt commented on the issue:

    https://github.com/apache/spark/pull/14511
  
    retest please


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #14511: [SPARK-16927] [SPARK-16923] Override task properties at ...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/14511
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/63287/
    Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #14511: [SPARK-16927] [SPARK-16923] Override task propert...

Posted by mgummelt <gi...@git.apache.org>.
Github user mgummelt commented on a diff in the pull request:

    https://github.com/apache/spark/pull/14511#discussion_r73947851
  
    --- Diff: core/src/main/scala/org/apache/spark/scheduler/cluster/mesos/MesosClusterScheduler.scala ---
    @@ -358,14 +358,12 @@ private[spark] class MesosClusterScheduler(
             .orElse(desc.command.environment.get("SPARK_EXECUTOR_URI"))
       }
     
    -  private def getDriverEnvironment(desc: MesosDriverDescription): Environment = {
    -    val env = {
    -      val executorOpts = desc.conf.getAll.map { case (k, v) => s"-D$k=$v" }.mkString(" ")
    -      val executorEnv = Map("SPARK_EXECUTOR_OPTS" -> executorOpts)
    --- End diff --
    
    I've reread this a few times, and still can't understand it.  Can you rephrase this comment?
    



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #14511: [SPARK-16927] [SPARK-16923] Override task propert...

Posted by asfgit <gi...@git.apache.org>.
Github user asfgit closed the pull request at:

    https://github.com/apache/spark/pull/14511


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #14511: [SPARK-16927] [SPARK-16923] Override task propert...

Posted by mgummelt <gi...@git.apache.org>.
Github user mgummelt commented on a diff in the pull request:

    https://github.com/apache/spark/pull/14511#discussion_r73947926
  
    --- Diff: core/src/main/scala/org/apache/spark/scheduler/cluster/mesos/MesosClusterScheduler.scala ---
    @@ -463,9 +455,20 @@ private[spark] class MesosClusterScheduler(
             .mkString(",")
           options ++= Seq("--py-files", formattedFiles)
         }
    -    desc.conf.getAll
    +
    +    // --conf
    +    val replicatedOptionsBlacklist = Set(
    +      "spark.jars", // Avoids duplicate classes in classpath
    +      "spark.submit.deployMode", // this would be set to `cluster`, but we need client
    +      "spark.master" // this contains the address of the dispatcher, not master
    +    )
    +    val defaultConf = conf.getAllWithPrefix("spark.mesos.dispatcher.driverProperty.").toMap
    --- End diff --
    
    That's just as verbose


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #14511: [SPARK-16927] [SPARK-16923] Override task propert...

Posted by skonto <gi...@git.apache.org>.
Github user skonto commented on a diff in the pull request:

    https://github.com/apache/spark/pull/14511#discussion_r73978967
  
    --- Diff: core/src/main/scala/org/apache/spark/scheduler/cluster/mesos/MesosClusterScheduler.scala ---
    @@ -463,9 +455,20 @@ private[spark] class MesosClusterScheduler(
             .mkString(",")
           options ++= Seq("--py-files", formattedFiles)
         }
    -    desc.conf.getAll
    +
    +    // --conf
    +    val replicatedOptionsBlacklist = Set(
    +      "spark.jars", // Avoids duplicate classes in classpath
    +      "spark.submit.deployMode", // this would be set to `cluster`, but we need client
    +      "spark.master" // this contains the address of the dispatcher, not master
    +    )
    +    val defaultConf = conf.getAllWithPrefix("spark.mesos.dispatcher.driverProperty.").toMap
    --- End diff --
    
    I know just a bit more explicit about the semantics


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #14511: [SPARK-16927] [SPARK-16923] Override task propert...

Posted by skonto <gi...@git.apache.org>.
Github user skonto commented on a diff in the pull request:

    https://github.com/apache/spark/pull/14511#discussion_r73979218
  
    --- Diff: core/src/main/scala/org/apache/spark/scheduler/cluster/mesos/MesosClusterScheduler.scala ---
    @@ -358,14 +358,12 @@ private[spark] class MesosClusterScheduler(
             .orElse(desc.command.environment.get("SPARK_EXECUTOR_URI"))
       }
     
    -  private def getDriverEnvironment(desc: MesosDriverDescription): Environment = {
    -    val env = {
    -      val executorOpts = desc.conf.getAll.map { case (k, v) => s"-D$k=$v" }.mkString(" ")
    -      val executorEnv = Map("SPARK_EXECUTOR_OPTS" -> executorOpts)
    --- End diff --
    
    I rephrased it...


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #14511: [SPARK-16927] [SPARK-16923] Override task propert...

Posted by skonto <gi...@git.apache.org>.
Github user skonto commented on a diff in the pull request:

    https://github.com/apache/spark/pull/14511#discussion_r73868759
  
    --- Diff: core/src/main/scala/org/apache/spark/scheduler/cluster/mesos/MesosClusterScheduler.scala ---
    @@ -463,9 +455,20 @@ private[spark] class MesosClusterScheduler(
             .mkString(",")
           options ++= Seq("--py-files", formattedFiles)
         }
    -    desc.conf.getAll
    +
    +    // --conf
    +    val replicatedOptionsBlacklist = Set(
    +      "spark.jars", // Avoids duplicate classes in classpath
    +      "spark.submit.deployMode", // this would be set to `cluster`, but we need client
    +      "spark.master" // this contains the address of the dispatcher, not master
    +    )
    +    val defaultConf = conf.getAllWithPrefix("spark.mesos.dispatcher.driverProperty.").toMap
    +    val driverConf = desc.conf.getAll
           .filter { case (key, _) => !replicatedOptionsBlacklist.contains(key) }
    -      .foreach { case (key, value) => options ++= Seq("--conf", s"$key=${shellEscape(value)}") }
    +      .toMap
    +    (defaultConf ++ driverConf).foreach { case (key, value) =>
    --- End diff --
    
    what if I have the same property assigned both at driverConf and defaultConf should not be the case that the former takes precedence?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #14511: [SPARK-16927] [SPARK-16923] Override task properties at ...

Posted by mgummelt <gi...@git.apache.org>.
Github user mgummelt commented on the issue:

    https://github.com/apache/spark/pull/14511
  
    @skonto Can you take a look at this?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #14511: [SPARK-16927] [SPARK-16923] Override task properties at ...

Posted by skonto <gi...@git.apache.org>.
Github user skonto commented on the issue:

    https://github.com/apache/spark/pull/14511
  
    I will have a look shortly


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #14511: [SPARK-16927] [SPARK-16923] Override task properties at ...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/14511
  
    **[Test build #63396 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/63396/consoleFull)** for PR 14511 at commit [`d6c25f0`](https://github.com/apache/spark/commit/d6c25f0fa6723e07fe83bb16a4baa7c8523d4182).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #14511: [SPARK-16927] [SPARK-16923] Override task properties at ...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/14511
  
    Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #14511: [SPARK-16927] [SPARK-16923] Override task propert...

Posted by mgummelt <gi...@git.apache.org>.
Github user mgummelt commented on a diff in the pull request:

    https://github.com/apache/spark/pull/14511#discussion_r73980410
  
    --- Diff: core/src/main/scala/org/apache/spark/scheduler/cluster/mesos/MesosClusterScheduler.scala ---
    @@ -358,14 +358,12 @@ private[spark] class MesosClusterScheduler(
             .orElse(desc.command.environment.get("SPARK_EXECUTOR_URI"))
       }
     
    -  private def getDriverEnvironment(desc: MesosDriverDescription): Environment = {
    -    val env = {
    -      val executorOpts = desc.conf.getAll.map { case (k, v) => s"-D$k=$v" }.mkString(" ")
    -      val executorEnv = Map("SPARK_EXECUTOR_OPTS" -> executorOpts)
    --- End diff --
    
    Thanks.
    
    The one way to pass configs through.  The only thing I would change in your comment is "to all executors", because most spark properties aren't set on executors.  They're set on drivers, and often influence how the drivers launch the executors.
    
    That single way is now in `generateCmdOption`, which translates `desc.conf` into `--conf` parameters for the driver.  Removing this is removing a second, redundant way of setting the same configs.
    
    There is another way users can pass configs through, which is through the deprecated `SPARK_SUBMIT_OPTS` env var.  That's passed through just as all env vars are passed through, which is in this method (`desc.command.environment`)
    
    



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #14511: [SPARK-16927] [SPARK-16923] Override task properties at ...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/14511
  
    **[Test build #63396 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/63396/consoleFull)** for PR 14511 at commit [`d6c25f0`](https://github.com/apache/spark/commit/d6c25f0fa6723e07fe83bb16a4baa7c8523d4182).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #14511: [SPARK-16927] [SPARK-16923] Override task properties at ...

Posted by srowen <gi...@git.apache.org>.
Github user srowen commented on the issue:

    https://github.com/apache/spark/pull/14511
  
    Merged to master


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #14511: [SPARK-16927] [SPARK-16923] Override task propert...

Posted by skonto <gi...@git.apache.org>.
Github user skonto commented on a diff in the pull request:

    https://github.com/apache/spark/pull/14511#discussion_r73864552
  
    --- Diff: core/src/main/scala/org/apache/spark/scheduler/cluster/mesos/MesosClusterScheduler.scala ---
    @@ -463,9 +455,20 @@ private[spark] class MesosClusterScheduler(
             .mkString(",")
           options ++= Seq("--py-files", formattedFiles)
         }
    -    desc.conf.getAll
    +
    +    // --conf
    +    val replicatedOptionsBlacklist = Set(
    +      "spark.jars", // Avoids duplicate classes in classpath
    +      "spark.submit.deployMode", // this would be set to `cluster`, but we need client
    +      "spark.master" // this contains the address of the dispatcher, not master
    +    )
    +    val defaultConf = conf.getAllWithPrefix("spark.mesos.dispatcher.driverProperty.").toMap
    --- End diff --
    
    Property suffix seems verbose to me.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #14511: [SPARK-16927] [SPARK-16923] Override task propert...

Posted by mgummelt <gi...@git.apache.org>.
Github user mgummelt commented on a diff in the pull request:

    https://github.com/apache/spark/pull/14511#discussion_r73764731
  
    --- Diff: core/src/main/scala/org/apache/spark/scheduler/cluster/mesos/MesosClusterScheduler.scala ---
    @@ -358,14 +358,12 @@ private[spark] class MesosClusterScheduler(
             .orElse(desc.command.environment.get("SPARK_EXECUTOR_URI"))
       }
     
    -  private def getDriverEnvironment(desc: MesosDriverDescription): Environment = {
    -    val env = {
    -      val executorOpts = desc.conf.getAll.map { case (k, v) => s"-D$k=$v" }.mkString(" ")
    -      val executorEnv = Map("SPARK_EXECUTOR_OPTS" -> executorOpts)
    --- End diff --
    
    Not only are these variables already set in `generateCmdOption`, but SPARK_EXECUTOR_OPTS has no effect on the driver process 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #14511: [SPARK-16927] [SPARK-16923] Override task properties at ...

Posted by skonto <gi...@git.apache.org>.
Github user skonto commented on the issue:

    https://github.com/apache/spark/pull/14511
  
    Ok LGTM other than the name of the setting for which I dont have a much better alternative.  
    @srowen could you check and merge pls?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #14511: [SPARK-16927] [SPARK-16923] Override task propert...

Posted by skonto <gi...@git.apache.org>.
Github user skonto commented on a diff in the pull request:

    https://github.com/apache/spark/pull/14511#discussion_r73864247
  
    --- Diff: core/src/main/scala/org/apache/spark/scheduler/cluster/mesos/MesosClusterScheduler.scala ---
    @@ -358,14 +358,12 @@ private[spark] class MesosClusterScheduler(
             .orElse(desc.command.environment.get("SPARK_EXECUTOR_URI"))
       }
     
    -  private def getDriverEnvironment(desc: MesosDriverDescription): Environment = {
    -    val env = {
    -      val executorOpts = desc.conf.getAll.map { case (k, v) => s"-D$k=$v" }.mkString(" ")
    -      val executorEnv = Map("SPARK_EXECUTOR_OPTS" -> executorOpts)
    --- End diff --
    
    From what I understand SPARK_EXECUTOR_OPTS is not used because it is actually reconstructed at the driver side
    [here:](https://github.com/apache/spark/blob/master/core/src/main/scala/org/apache/spark/scheduler/cluster/mesos/MesosCoarseGrainedSchedulerBackend.scala#L180)  from spark.executor.extraJavaOptions.
    I dont see that property set at the Dispacther side.
    
    Two things here:
    I would prefer the process of reconstructing the env to be transparent to the user, the same for client and cluster mode, no need for special prefixes. 
    Secondly, we need to make sure there is continuity with all properties we pass from process to process. 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #14511: [SPARK-16927] [SPARK-16923] Override task properties at ...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/14511
  
    **[Test build #63287 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/63287/consoleFull)** for PR 14511 at commit [`f42ac25`](https://github.com/apache/spark/commit/f42ac25ed1301562b96efac802fd52671a9df794).
     * This patch **fails PySpark unit tests**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #14511: [SPARK-16927] [SPARK-16923] Override task propert...

Posted by mgummelt <gi...@git.apache.org>.
Github user mgummelt commented on a diff in the pull request:

    https://github.com/apache/spark/pull/14511#discussion_r73948127
  
    --- Diff: core/src/main/scala/org/apache/spark/scheduler/cluster/mesos/MesosClusterScheduler.scala ---
    @@ -463,9 +455,20 @@ private[spark] class MesosClusterScheduler(
             .mkString(",")
           options ++= Seq("--py-files", formattedFiles)
         }
    -    desc.conf.getAll
    +
    +    // --conf
    +    val replicatedOptionsBlacklist = Set(
    +      "spark.jars", // Avoids duplicate classes in classpath
    +      "spark.submit.deployMode", // this would be set to `cluster`, but we need client
    +      "spark.master" // this contains the address of the dispatcher, not master
    +    )
    +    val defaultConf = conf.getAllWithPrefix("spark.mesos.dispatcher.driverProperty.").toMap
    +    val driverConf = desc.conf.getAll
           .filter { case (key, _) => !replicatedOptionsBlacklist.contains(key) }
    -      .foreach { case (key, value) => options ++= Seq("--conf", s"$key=${shellEscape(value)}") }
    +      .toMap
    +    (defaultConf ++ driverConf).foreach { case (key, value) =>
    --- End diff --
    
    It does: http://stackoverflow.com/questions/11742451/concatenate-two-immutable-maps-which-elements-are-preferred
    



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #14511: [SPARK-16927] [SPARK-16923] Override task properties at ...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/14511
  
    **[Test build #63287 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/63287/consoleFull)** for PR 14511 at commit [`f42ac25`](https://github.com/apache/spark/commit/f42ac25ed1301562b96efac802fd52671a9df794).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #14511: [SPARK-16927] [SPARK-16923] Override task propert...

Posted by skonto <gi...@git.apache.org>.
Github user skonto commented on a diff in the pull request:

    https://github.com/apache/spark/pull/14511#discussion_r73978909
  
    --- Diff: core/src/main/scala/org/apache/spark/scheduler/cluster/mesos/MesosClusterScheduler.scala ---
    @@ -463,9 +455,20 @@ private[spark] class MesosClusterScheduler(
             .mkString(",")
           options ++= Seq("--py-files", formattedFiles)
         }
    -    desc.conf.getAll
    +
    +    // --conf
    +    val replicatedOptionsBlacklist = Set(
    +      "spark.jars", // Avoids duplicate classes in classpath
    +      "spark.submit.deployMode", // this would be set to `cluster`, but we need client
    +      "spark.master" // this contains the address of the dispatcher, not master
    +    )
    +    val defaultConf = conf.getAllWithPrefix("spark.mesos.dispatcher.driverProperty.").toMap
    +    val driverConf = desc.conf.getAll
           .filter { case (key, _) => !replicatedOptionsBlacklist.contains(key) }
    -      .foreach { case (key, value) => options ++= Seq("--conf", s"$key=${shellEscape(value)}") }
    +      .toMap
    +    (defaultConf ++ driverConf).foreach { case (key, value) =>
    --- End diff --
    
    The answer there does not have a clear reference assume that the behavior is consistent.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #14511: [SPARK-16927] [SPARK-16923] Override task propert...

Posted by mgummelt <gi...@git.apache.org>.
Github user mgummelt commented on a diff in the pull request:

    https://github.com/apache/spark/pull/14511#discussion_r73981151
  
    --- Diff: core/src/main/scala/org/apache/spark/scheduler/cluster/mesos/MesosClusterScheduler.scala ---
    @@ -463,9 +455,20 @@ private[spark] class MesosClusterScheduler(
             .mkString(",")
           options ++= Seq("--py-files", formattedFiles)
         }
    -    desc.conf.getAll
    +
    +    // --conf
    +    val replicatedOptionsBlacklist = Set(
    +      "spark.jars", // Avoids duplicate classes in classpath
    +      "spark.submit.deployMode", // this would be set to `cluster`, but we need client
    +      "spark.master" // this contains the address of the dispatcher, not master
    +    )
    +    val defaultConf = conf.getAllWithPrefix("spark.mesos.dispatcher.driverProperty.").toMap
    --- End diff --
    
    I agree, except I don't think there's any need to make a separate `driver` level in the hierarchy.  I changed it to `driverDefault`.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #14511: [SPARK-16927] [SPARK-16923] Override task properties at ...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/14511
  
    Merged build finished. Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #14511: [SPARK-16927] [SPARK-16923] Override task propert...

Posted by skonto <gi...@git.apache.org>.
Github user skonto commented on a diff in the pull request:

    https://github.com/apache/spark/pull/14511#discussion_r73864717
  
    --- Diff: docs/running-on-mesos.md ---
    @@ -467,7 +467,19 @@ See the [configuration page](configuration.html) for information on Spark config
         Set the Spark Mesos dispatcher webui_url for interacting with the framework.
         If unset it will point to Spark's internal web UI.
       </td>
    +  </tr>
    +<tr>
    +  <td><code>spark.mesos.dispatcher.driverProperty.[PropertyName]</code></td>
    +  <td><code>(none)</code></td>
    +  <td>
    +    Set default properties for drivers submitted through the
    +    dispatcher.  For example,
    +    spark.mesos.dispatcher.driverProperty.spark.executor.memory=32g
    --- End diff --
    
    From a UX perspective I would like to set spark.executor.memory=32g without caring about where I am running on.... test env, real cluster client or cluster mode...


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #14511: [SPARK-16927] [SPARK-16923] Override task properties at ...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/14511
  
    Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #14511: [SPARK-16927] [SPARK-16923] Override task properties at ...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/14511
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/63396/
    Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #14511: [SPARK-16927] [SPARK-16923] Override task properties at ...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/14511
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/63464/
    Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #14511: [SPARK-16927] [SPARK-16923] Override task properties at ...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/14511
  
    **[Test build #63464 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/63464/consoleFull)** for PR 14511 at commit [`81f2b1c`](https://github.com/apache/spark/commit/81f2b1c61e814dadaee6f17229cf4d1f46c2cbed).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org