You are viewing a plain text version of this content. The canonical link for it is here.
Posted to reviews@spark.apache.org by nchammas <gi...@git.apache.org> on 2014/10/29 04:14:42 UTC

[GitHub] spark pull request: [EC2] Don't change working dir on user

GitHub user nchammas opened a pull request:

    https://github.com/apache/spark/pull/2988

    [EC2] Don't change working dir on user

    The issue was uncovered after [this discussion](https://issues.apache.org/jira/browse/SPARK-3398?focusedCommentId=14187471&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-14187471).
    
    Don't change the working directory on the user. This breaks relative paths the user may pass in, e.g., for the SSH identity file.
    
    ```
    ./ec2/spark-ec2 -i ../my.pem
    ```
    
    This patch will preserve the user's current working directory and allow calls like the one above to work.

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/nchammas/spark spark-ec2-cwd

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/2988.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #2988
    
----
commit ce071fca807caf232e6a6287b90ac59ed4bcd605
Author: Nicholas Chammas <ni...@gmail.com>
Date:   2014-10-29T03:05:49Z

    don't change working dir

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-4137] [EC2] Don't change working dir on...

Posted by shivaram <gi...@git.apache.org>.
Github user shivaram commented on the pull request:

    https://github.com/apache/spark/pull/2988#issuecomment-61927044
  
    Thanks @nchammas -- Merged this and backported to 1.2


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-4137] [EC2] Don't change working dir on...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/2988#issuecomment-61006242
  
      [Test build #22464 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/22464/consoleFull) for   PR 2988 at commit [`77871a2`](https://github.com/apache/spark/commit/77871a2036c71f06b1ce431f81af002dbf234433).
     * This patch **passes all tests**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-4137] [EC2] Don't change working dir on...

Posted by nchammas <gi...@git.apache.org>.
Github user nchammas commented on a diff in the pull request:

    https://github.com/apache/spark/pull/2988#discussion_r19587131
  
    --- Diff: ec2/spark_ec2.py ---
    @@ -718,12 +726,16 @@ def get_num_disks(instance_type):
             return 1
     
     
    -# Deploy the configuration file templates in a given local directory to
    -# a cluster, filling in any template parameters with information about the
    -# cluster (e.g. lists of masters and slaves). Files are only deployed to
    -# the first master instance in the cluster, and we expect the setup
    -# script to be run on that instance to copy them to other nodes.
     def deploy_files(conn, root_dir, opts, master_nodes, slave_nodes, modules):
    +    """
    +    Deploy the configuration file templates in a given local directory to
    --- End diff --
    
    Yeah, I thought I'd make this the first change toward having all the function descriptions be in docstrings, but for consistency's sake you're right--it should be a comment on top.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-4137] [EC2] Don't change working dir on...

Posted by nchammas <gi...@git.apache.org>.
Github user nchammas commented on a diff in the pull request:

    https://github.com/apache/spark/pull/2988#discussion_r19923159
  
    --- Diff: ec2/spark_ec2.py ---
    @@ -718,12 +726,16 @@ def get_num_disks(instance_type):
             return 1
     
     
    -# Deploy the configuration file templates in a given local directory to
    -# a cluster, filling in any template parameters with information about the
    -# cluster (e.g. lists of masters and slaves). Files are only deployed to
    -# the first master instance in the cluster, and we expect the setup
    -# script to be run on that instance to copy them to other nodes.
     def deploy_files(conn, root_dir, opts, master_nodes, slave_nodes, modules):
    +    """
    +    Deploy the configuration file templates in a given local directory to
    --- End diff --
    
    No problem. Will do.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-4137] [EC2] Don't change working dir on...

Posted by nchammas <gi...@git.apache.org>.
Github user nchammas commented on the pull request:

    https://github.com/apache/spark/pull/2988#issuecomment-61013749
  
    @shivaram Oh. Glad I pinged you. :)
    
    In my brief testing I didn't allow `launch` to go all the way through. This is obviously broken.
    
    Will update this PR with an alternate approach that lets the CWD be `ec2/` as it was before.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-4137] [EC2] Don't change working dir on...

Posted by asfgit <gi...@git.apache.org>.
Github user asfgit closed the pull request at:

    https://github.com/apache/spark/pull/2988


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-4137] [EC2] Don't change working dir on...

Posted by shivaram <gi...@git.apache.org>.
Github user shivaram commented on the pull request:

    https://github.com/apache/spark/pull/2988#issuecomment-61043164
  
    Functionality LGTM. I left a minor style question for @JoshRosen 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-4137] [EC2] Don't change working dir on...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/2988#issuecomment-61926208
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/22981/
    Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-4137] [EC2] Don't change working dir on...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/2988#issuecomment-61919739
  
      [Test build #22977 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/22977/consoleFull) for   PR 2988 at commit [`fbc20c7`](https://github.com/apache/spark/commit/fbc20c7a92bf0e75d16613c7e56532740257990b).
     * This patch merges cleanly.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [EC2] Don't change working dir on user

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/2988#issuecomment-60873366
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/22420/
    Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-4137] [EC2] Don't change working dir on...

Posted by nchammas <gi...@git.apache.org>.
Github user nchammas commented on the pull request:

    https://github.com/apache/spark/pull/2988#issuecomment-61007294
  
    cc @shivaram @JoshRosen 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-4137] [EC2] Don't change working dir on...

Posted by nchammas <gi...@git.apache.org>.
Github user nchammas commented on the pull request:

    https://github.com/apache/spark/pull/2988#issuecomment-61023615
  
    So basically you're saying go with option 2, right?
    
    From what I can see, `deploy.generic` may be the only file we need to fix the path for. Is that right? Maybe it's not such a big deal then.
    
    In any case, I'll give it a shot and report back.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-4137] [EC2] Don't change working dir on...

Posted by nchammas <gi...@git.apache.org>.
Github user nchammas commented on a diff in the pull request:

    https://github.com/apache/spark/pull/2988#discussion_r19923233
  
    --- Diff: ec2/spark_ec2.py ---
    @@ -718,12 +726,16 @@ def get_num_disks(instance_type):
             return 1
     
     
    -# Deploy the configuration file templates in a given local directory to
    -# a cluster, filling in any template parameters with information about the
    -# cluster (e.g. lists of masters and slaves). Files are only deployed to
    -# the first master instance in the cluster, and we expect the setup
    -# script to be run on that instance to copy them to other nodes.
     def deploy_files(conn, root_dir, opts, master_nodes, slave_nodes, modules):
    +    """
    +    Deploy the configuration file templates in a given local directory to
    --- End diff --
    
    @shivaram Done.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [EC2] Don't change working dir on user

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/2988#issuecomment-60873361
  
      [Test build #22420 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/22420/consoleFull) for   PR 2988 at commit [`ce071fc`](https://github.com/apache/spark/commit/ce071fca807caf232e6a6287b90ac59ed4bcd605).
     * This patch **passes all tests**.
     * This patch merges cleanly.
     * This patch adds the following public classes _(experimental)_:
      * `  class DeferredObjectAdapter(oi: ObjectInspector) extends DeferredObject `



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-4137] [EC2] Don't change working dir on...

Posted by nchammas <gi...@git.apache.org>.
Github user nchammas commented on a diff in the pull request:

    https://github.com/apache/spark/pull/2988#discussion_r19746328
  
    --- Diff: ec2/spark_ec2.py ---
    @@ -718,12 +726,16 @@ def get_num_disks(instance_type):
             return 1
     
     
    -# Deploy the configuration file templates in a given local directory to
    -# a cluster, filling in any template parameters with information about the
    -# cluster (e.g. lists of masters and slaves). Files are only deployed to
    -# the first master instance in the cluster, and we expect the setup
    -# script to be run on that instance to copy them to other nodes.
     def deploy_files(conn, root_dir, opts, master_nodes, slave_nodes, modules):
    +    """
    +    Deploy the configuration file templates in a given local directory to
    --- End diff --
    
    @JoshRosen Any comments on this? I'd be more than happy to convert it back to being a comment.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-4137] [EC2] Don't change working dir on...

Posted by shivaram <gi...@git.apache.org>.
Github user shivaram commented on a diff in the pull request:

    https://github.com/apache/spark/pull/2988#discussion_r19922838
  
    --- Diff: ec2/spark_ec2.py ---
    @@ -718,12 +726,16 @@ def get_num_disks(instance_type):
             return 1
     
     
    -# Deploy the configuration file templates in a given local directory to
    -# a cluster, filling in any template parameters with information about the
    -# cluster (e.g. lists of masters and slaves). Files are only deployed to
    -# the first master instance in the cluster, and we expect the setup
    -# script to be run on that instance to copy them to other nodes.
     def deploy_files(conn, root_dir, opts, master_nodes, slave_nodes, modules):
    +    """
    +    Deploy the configuration file templates in a given local directory to
    --- End diff --
    
    My suggestion would be to revert to the existing style for now and open a separate patch for the style fix (similar to the PEP-8 patches ?). 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-4137] [EC2] Don't change working dir on...

Posted by shivaram <gi...@git.apache.org>.
Github user shivaram commented on the pull request:

    https://github.com/apache/spark/pull/2988#issuecomment-61021907
  
    Thanks for taking a closer look ! I don't know much python, but can't we get the directory that the script is in using something like `__file__` and prefix that to "deploy.generic" ?  
    
    From what I can see there are two kinds of paths we have -- the first are paths supplied by the user (like the SSH identity file) where we can preserve the working directory and not change the path
    
    The second are files relative to spark_ec2.py 's location (like deploy.generic). Making these absolute paths seems like a good idea to me.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-4137] [EC2] Don't change working dir on...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/2988#issuecomment-61009928
  
      [Test build #22465 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/22465/consoleFull) for   PR 2988 at commit [`bcdf6a5`](https://github.com/apache/spark/commit/bcdf6a5da70d1ee0067e72497ccc7fc6a71c7c8b).
     * This patch **passes all tests**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-4137] [EC2] Don't change working dir on...

Posted by shivaram <gi...@git.apache.org>.
Github user shivaram commented on the pull request:

    https://github.com/apache/spark/pull/2988#issuecomment-61008323
  
    @nchammas does the template replacement code still work correctly  ? I am referring to the "deploy.generic" dir that we pass from https://github.com/apache/spark/blob/master/ec2/spark_ec2.py#L589 to https://github.com/apache/spark/blob/master/ec2/spark_ec2.py#L726


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-4137] [EC2] Don't change working dir on...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/2988#issuecomment-61919824
  
      [Test build #22977 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/22977/consoleFull) for   PR 2988 at commit [`fbc20c7`](https://github.com/apache/spark/commit/fbc20c7a92bf0e75d16613c7e56532740257990b).
     * This patch **fails Python style tests**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-4137] [EC2] Don't change working dir on...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/2988#issuecomment-61035013
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/22501/
    Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [EC2] Don't change working dir on user

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/2988#issuecomment-60998143
  
      [Test build #22465 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/22465/consoleFull) for   PR 2988 at commit [`bcdf6a5`](https://github.com/apache/spark/commit/bcdf6a5da70d1ee0067e72497ccc7fc6a71c7c8b).
     * This patch merges cleanly.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-4137] [EC2] Don't change working dir on...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/2988#issuecomment-61920803
  
      [Test build #22981 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/22981/consoleFull) for   PR 2988 at commit [`f3850b5`](https://github.com/apache/spark/commit/f3850b55ceae060a1a0f4c40a85da447a446803c).
     * This patch merges cleanly.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-4137] [EC2] Don't change working dir on...

Posted by nchammas <gi...@git.apache.org>.
Github user nchammas commented on the pull request:

    https://github.com/apache/spark/pull/2988#issuecomment-61041277
  
    @shivaram I took your suggestion and tested to make sure `spark-ec2` still creates a functioning EC2 cluster.
    
    This is ready for another review.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [EC2] Don't change working dir on user

Posted by nchammas <gi...@git.apache.org>.
Github user nchammas commented on the pull request:

    https://github.com/apache/spark/pull/2988#issuecomment-60874638
  
    Phantom new classes _again_? :angry: I'll look into this tomorrow. Thought it was a resolved issue...


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-4137] [EC2] Don't change working dir on...

Posted by nchammas <gi...@git.apache.org>.
Github user nchammas commented on the pull request:

    https://github.com/apache/spark/pull/2988#issuecomment-61016125
  
    Actually, we have a few options here for `spark-ec2` and the underlying `spark_ec2.py` script.
    
    1. We leave the script as-is and just document the fact that relative paths won't work.
    2. We update the script to preserve the user's working directory and qualify any paths relative to that directory.
    3. We update the script so that all relative paths passed in are translated to absolute paths.
    
    Comments:
    * Option 1 is the simplest but is a bit user-unfriendly.
    * Option 3 is tricky since we'd have to do the translation before the Python script is invoked, and this is annoying to do in a cross-platform manner; alternately, we can pass in both the user's CWD and the untranslated paths so that Python can do the translation in a cross-platform manner.
    * Option 2 seems feasible, but we'd have to track down all the places where we handle files.
    
    So far, I see that any code that handles the following files would be affected if we go with either Option 2 or 3:
    * [SSH identity file](https://github.com/apache/spark/blob/e7fd80413d531e23b6c4def0ee32e52a39da36fa/ec2/spark_ec2.py#L71)
    * [user data file](https://github.com/apache/spark/blob/e7fd80413d531e23b6c4def0ee32e52a39da36fa/ec2/spark_ec2.py#L151)
    * [deploy.generic file](https://github.com/apache/spark/blob/e7fd80413d531e23b6c4def0ee32e52a39da36fa/ec2/spark_ec2.py#L589)
    
    This turned out to be a bit more involved than I expected...
    
    @shivaram What would you recommend?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-4137] [EC2] Don't change working dir on...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/2988#issuecomment-61009937
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/22465/
    Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [EC2] Don't change working dir on user

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/2988#issuecomment-60994321
  
      [Test build #22464 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/22464/consoleFull) for   PR 2988 at commit [`77871a2`](https://github.com/apache/spark/commit/77871a2036c71f06b1ce431f81af002dbf234433).
     * This patch merges cleanly.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-4137] [EC2] Don't change working dir on...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/2988#issuecomment-61035007
  
      [Test build #22501 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/22501/consoleFull) for   PR 2988 at commit [`752f958`](https://github.com/apache/spark/commit/752f958e49b4f84bde745db8b8f45f50e5117f6a).
     * This patch **passes all tests**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-4137] [EC2] Don't change working dir on...

Posted by shivaram <gi...@git.apache.org>.
Github user shivaram commented on a diff in the pull request:

    https://github.com/apache/spark/pull/2988#discussion_r19586967
  
    --- Diff: ec2/spark_ec2.py ---
    @@ -718,12 +726,16 @@ def get_num_disks(instance_type):
             return 1
     
     
    -# Deploy the configuration file templates in a given local directory to
    -# a cluster, filling in any template parameters with information about the
    -# cluster (e.g. lists of masters and slaves). Files are only deployed to
    -# the first master instance in the cluster, and we expect the setup
    -# script to be run on that instance to copy them to other nodes.
     def deploy_files(conn, root_dir, opts, master_nodes, slave_nodes, modules):
    +    """
    +    Deploy the configuration file templates in a given local directory to
    --- End diff --
    
    Should we change this style given that other functions in this file have comments on top ? Any thoughts @JoshRosen ?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-4137] [EC2] Don't change working dir on...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/2988#issuecomment-61029892
  
      [Test build #22501 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/22501/consoleFull) for   PR 2988 at commit [`752f958`](https://github.com/apache/spark/commit/752f958e49b4f84bde745db8b8f45f50e5117f6a).
     * This patch merges cleanly.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [EC2] Don't change working dir on user

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/2988#issuecomment-60868345
  
      [Test build #22420 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/22420/consoleFull) for   PR 2988 at commit [`ce071fc`](https://github.com/apache/spark/commit/ce071fca807caf232e6a6287b90ac59ed4bcd605).
     * This patch merges cleanly.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-4137] [EC2] Don't change working dir on...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/2988#issuecomment-61919827
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/22977/
    Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-4137] [EC2] Don't change working dir on...

Posted by nchammas <gi...@git.apache.org>.
Github user nchammas commented on a diff in the pull request:

    https://github.com/apache/spark/pull/2988#discussion_r19922788
  
    --- Diff: ec2/spark_ec2.py ---
    @@ -718,12 +726,16 @@ def get_num_disks(instance_type):
             return 1
     
     
    -# Deploy the configuration file templates in a given local directory to
    -# a cluster, filling in any template parameters with information about the
    -# cluster (e.g. lists of masters and slaves). Files are only deployed to
    -# the first master instance in the cluster, and we expect the setup
    -# script to be run on that instance to copy them to other nodes.
     def deploy_files(conn, root_dir, opts, master_nodes, slave_nodes, modules):
    +    """
    +    Deploy the configuration file templates in a given local directory to
    --- End diff --
    
    @pwendell Perhaps you can comment on this. It's a minor issue, and I think we should be able to get this PR merged in for 1.2.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-4137] [EC2] Don't change working dir on...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/2988#issuecomment-61926204
  
      [Test build #22981 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/22981/consoleFull) for   PR 2988 at commit [`f3850b5`](https://github.com/apache/spark/commit/f3850b55ceae060a1a0f4c40a85da447a446803c).
     * This patch **passes all tests**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-4137] [EC2] Don't change working dir on...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/2988#issuecomment-61006247
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/22464/
    Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org