You are viewing a plain text version of this content. The canonical link for it is here.
Posted to reviews@spark.apache.org by hlin09 <gi...@git.apache.org> on 2015/04/13 17:58:50 UTC

[GitHub] spark pull request: [SPARK-6881][SparkR] Changes the checkpoint di...

GitHub user hlin09 opened a pull request:

    https://github.com/apache/spark/pull/5493

    [SPARK-6881][SparkR] Changes the checkpoint directory name.

    

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/hlin09/spark fixCheckpointDir

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/5493.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #5493
    
----
commit 1f7ed9e35391c8435c45886959d87e3935631e9b
Author: hlin09 <hl...@gmail.com>
Date:   2015-04-13T15:52:12Z

    Change the checkpoint dir name.

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-6881][SparkR] Changes the checkpoint di...

Posted by hlin09 <gi...@git.apache.org>.
Github user hlin09 commented on a diff in the pull request:

    https://github.com/apache/spark/pull/5493#discussion_r28260573
  
    --- Diff: R/pkg/inst/tests/test_rdd.R ---
    @@ -141,7 +141,7 @@ test_that("PipelinedRDD support actions: cache(), persist(), unpersist(), checkp
       unpersist(rdd2)
       expect_false(rdd2@env$isCached)
     
    -  setCheckpointDir(sc, "checkpoints")
    +  setCheckpointDir(sc, "checkpoint")
    --- End diff --
    
    Done.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-6881][SparkR] Changes the checkpoint di...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/5493#issuecomment-92538479
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/30201/
    Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-6881][SparkR] Changes the checkpoint di...

Posted by shivaram <gi...@git.apache.org>.
Github user shivaram commented on the pull request:

    https://github.com/apache/spark/pull/5493#issuecomment-92515144
  
    Jenkins, ok to test


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-6881][SparkR] Changes the checkpoint di...

Posted by hlin09 <gi...@git.apache.org>.
Github user hlin09 commented on a diff in the pull request:

    https://github.com/apache/spark/pull/5493#discussion_r28263584
  
    --- Diff: R/pkg/inst/tests/test_rdd.R ---
    @@ -141,7 +141,8 @@ test_that("PipelinedRDD support actions: cache(), persist(), unpersist(), checkp
       unpersist(rdd2)
       expect_false(rdd2@env$isCached)
     
    -  setCheckpointDir(sc, "checkpoints")
    +  tempDir <- tempfile(pattern = "checkpoint")
    --- End diff --
    
    It creates a path name, and Spark will create the directory according a specific file system. On my local test, checkpoint seems to work as I can see the checkpoint directory with data. But still, I can create a directory in R, and pass it to the setCheckpointDir().


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-6881][SparkR] Changes the checkpoint di...

Posted by shivaram <gi...@git.apache.org>.
Github user shivaram commented on the pull request:

    https://github.com/apache/spark/pull/5493#issuecomment-92539716
  
    LGTM. Thanks @hlin09 - Merging this


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-6881][SparkR] Changes the checkpoint di...

Posted by asfgit <gi...@git.apache.org>.
Github user asfgit closed the pull request at:

    https://github.com/apache/spark/pull/5493


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-6881][SparkR] Changes the checkpoint di...

Posted by srowen <gi...@git.apache.org>.
Github user srowen commented on a diff in the pull request:

    https://github.com/apache/spark/pull/5493#discussion_r28261741
  
    --- Diff: R/pkg/inst/tests/test_rdd.R ---
    @@ -141,7 +141,8 @@ test_that("PipelinedRDD support actions: cache(), persist(), unpersist(), checkp
       unpersist(rdd2)
       expect_false(rdd2@env$isCached)
     
    -  setCheckpointDir(sc, "checkpoints")
    +  tempDir <- tempfile(pattern = "checkpoint")
    --- End diff --
    
    I am not sure if this is an issue, but does this create a file rather than directory? the checkpoint may not work. Can you unlink the result and make a directory out of it? or can `tempdir()` make a unique subdirectory?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-6881][SparkR] Changes the checkpoint di...

Posted by srowen <gi...@git.apache.org>.
Github user srowen commented on a diff in the pull request:

    https://github.com/apache/spark/pull/5493#discussion_r28270144
  
    --- Diff: R/pkg/inst/tests/test_rdd.R ---
    @@ -141,7 +141,8 @@ test_that("PipelinedRDD support actions: cache(), persist(), unpersist(), checkp
       unpersist(rdd2)
       expect_false(rdd2@env$isCached)
     
    -  setCheckpointDir(sc, "checkpoints")
    +  tempDir <- tempfile(pattern = "checkpoint")
    --- End diff --
    
    OK thanks. FWIW this seems like a good change to me.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-6881][SparkR] Changes the checkpoint di...

Posted by hlin09 <gi...@git.apache.org>.
Github user hlin09 commented on a diff in the pull request:

    https://github.com/apache/spark/pull/5493#discussion_r28253099
  
    --- Diff: R/pkg/R/RDD.R ---
    @@ -279,7 +279,7 @@ setMethod("unpersist",
     #' @examples
     #'\dontrun{
     #' sc <- sparkR.init()
    -#' setCheckpointDir(sc, "checkpoints")
    +#' setCheckpointDir(sc, "checkpoint")
    --- End diff --
    
    You are right. They won't be executed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-6881][SparkR] Changes the checkpoint di...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/5493#issuecomment-92411267
  
    Can one of the admins verify this patch?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-6881][SparkR] Changes the checkpoint di...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/5493#issuecomment-92538465
  
      [Test build #30201 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30201/consoleFull) for   PR 5493 at commit [`e67fc40`](https://github.com/apache/spark/commit/e67fc4003569669645057570964077ba740fc530).
     * This patch **passes all tests**.
     * This patch merges cleanly.
     * This patch adds no public classes.
     * This patch does not change any dependencies.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-6881][SparkR] Changes the checkpoint di...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/5493#issuecomment-92515980
  
      [Test build #30201 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30201/consoleFull) for   PR 5493 at commit [`e67fc40`](https://github.com/apache/spark/commit/e67fc4003569669645057570964077ba740fc530).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-6881][SparkR] Changes the checkpoint di...

Posted by srowen <gi...@git.apache.org>.
Github user srowen commented on a diff in the pull request:

    https://github.com/apache/spark/pull/5493#discussion_r28251580
  
    --- Diff: R/pkg/inst/tests/test_rdd.R ---
    @@ -141,7 +141,7 @@ test_that("PipelinedRDD support actions: cache(), persist(), unpersist(), checkp
       unpersist(rdd2)
       expect_false(rdd2@env$isCached)
     
    -  setCheckpointDir(sc, "checkpoints")
    +  setCheckpointDir(sc, "checkpoint")
    --- End diff --
    
    This shouldn't be a directory in the project root to begin with. Make this a temp directory?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-6881][SparkR] Changes the checkpoint di...

Posted by shivaram <gi...@git.apache.org>.
Github user shivaram commented on the pull request:

    https://github.com/apache/spark/pull/5493#issuecomment-92515235
  
    Jenkins, add to whitelist


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-6881][SparkR] Changes the checkpoint di...

Posted by srowen <gi...@git.apache.org>.
Github user srowen commented on a diff in the pull request:

    https://github.com/apache/spark/pull/5493#discussion_r28251542
  
    --- Diff: R/pkg/R/RDD.R ---
    @@ -279,7 +279,7 @@ setMethod("unpersist",
     #' @examples
     #'\dontrun{
     #' sc <- sparkR.init()
    -#' setCheckpointDir(sc, "checkpoints")
    +#' setCheckpointDir(sc, "checkpoint")
    --- End diff --
    
    These two changes are good for consistency although they don't matter to what is ignored by git since this isn't executed right?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org