You are viewing a plain text version of this content. The canonical link for it is here.
Posted to reviews@spark.apache.org by zero323 <gi...@git.apache.org> on 2017/05/12 16:39:01 UTC

[GitHub] spark pull request #17965: [SPARK-20726][SPARKR] wrapper for SQL broadcast

GitHub user zero323 opened a pull request:

    https://github.com/apache/spark/pull/17965

     [SPARK-20726][SPARKR] wrapper for SQL broadcast

    ## What changes were proposed in this pull request?
    
    Adds R wrapper for `o.a.s.sql.functions.broadcast`.
    
    ## How was this patch tested?
    
    Unit tests, check `check-cran.sh`.


You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/zero323/spark SPARK-20726

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/17965.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #17965
    
----
commit f190d62460829dcfb84ff1a8e6dd6fe9cbd25719
Author: zero323 <ze...@users.noreply.github.com>
Date:   2017-05-12T15:54:46Z

    Initial implementation

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #17965: [SPARK-20726][SPARKR] wrapper for SQL broadcast

Posted by felixcheung <gi...@git.apache.org>.
Github user felixcheung commented on a diff in the pull request:

    https://github.com/apache/spark/pull/17965#discussion_r116350427
  
    --- Diff: R/pkg/R/context.R ---
    @@ -258,15 +258,15 @@ includePackage <- function(sc, pkg) {
     #'
     #' # Large Matrix object that we want to broadcast
     #' randomMat <- matrix(nrow=100, ncol=10, data=rnorm(1000))
    -#' randomMatBr <- broadcast(sc, randomMat)
    +#' randomMatBr <- broadcast_(sc, randomMat)
     #'
     #' # Use the broadcast variable inside the function
     #' useBroadcast <- function(x) {
     #'   sum(value(randomMatBr) * x)
     #' }
     #' sumRDD <- lapply(rdd, useBroadcast)
     #'}
    -broadcast <- function(sc, object) {
    +broadcast_ <- function(sc, object) {
    --- End diff --
    
    right, generally this is how we have handled name conflict with an existing RDD method.
    we should be removing the internal only RDD methods at some point


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #17965: [SPARK-20726][SPARKR] wrapper for SQL broadcast

Posted by zero323 <gi...@git.apache.org>.
Github user zero323 closed the pull request at:

    https://github.com/apache/spark/pull/17965


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #17965: [SPARK-20726][SPARKR] wrapper for SQL broadcast

Posted by zero323 <gi...@git.apache.org>.
GitHub user zero323 reopened a pull request:

    https://github.com/apache/spark/pull/17965

     [SPARK-20726][SPARKR] wrapper for SQL broadcast

    ## What changes were proposed in this pull request?
    
    - Adds R wrapper for `o.a.s.sql.functions.broadcast`.
    - Renames `broadcast` to `broadcast_`.
    
    ## How was this patch tested?
    
    Unit tests, check `check-cran.sh`.


You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/zero323/spark SPARK-20726

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/17965.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #17965
    
----
commit f190d62460829dcfb84ff1a8e6dd6fe9cbd25719
Author: zero323 <ze...@users.noreply.github.com>
Date:   2017-05-12T15:54:46Z

    Initial implementation

commit 397ab1f7b4b4e2b9e51b697c92e3be197fed4554
Author: zero323 <ze...@users.noreply.github.com>
Date:   2017-05-12T17:38:31Z

    Fix style

commit 246b91f8af84115af8f6283fb783000c9cc613ec
Author: zero323 <ze...@users.noreply.github.com>
Date:   2017-05-13T10:08:08Z

    Style

commit 1530785f7469830446cd95717d524eb42d88e4ab
Author: zero323 <ze...@users.noreply.github.com>
Date:   2017-05-13T10:38:50Z

    Rename broadcast_ to broadcastRDD

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #17965: [SPARK-20726][SPARKR] wrapper for SQL broadcast

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/17965
  
    **[Test build #76912 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76912/testReport)** for PR 17965 at commit [`d6c3435`](https://github.com/apache/spark/commit/d6c343538a731a45353bec5a7c290a201756039b).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #17965: [SPARK-20726][SPARKR] wrapper for SQL broadcast

Posted by zero323 <gi...@git.apache.org>.
Github user zero323 commented on the issue:

    https://github.com/apache/spark/pull/17965
  
    Points to discuss:
    
    - Do we really need this. It gives us full API parity but is not strictly necessary. `hint(df, "broadcast")` should be equivalent.
    - Is this the best implementation? Some alternatives:
    
        - Use generics for both and `signature(x = "SparkDataFrame", "missing")` for `DataFrame` version and `signature(x = "jobj", object = "Any")` for general version. This would keep internal API intact, but is hard to document without leaking internal details.
    
        - Use different name for `DataFrame` version, for example `broadcast_table`.  This is a bit verbose, and slightly harder to port for users.
    
    - Is `dataframe.R` the best location? It is generic on `SparkDataFrame` so `functions.R` don't feel like a right choice. 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #17965: [SPARK-20726][SPARKR] wrapper for SQL broadcast

Posted by felixcheung <gi...@git.apache.org>.
Github user felixcheung commented on a diff in the pull request:

    https://github.com/apache/spark/pull/17965#discussion_r116350189
  
    --- Diff: R/pkg/R/DataFrame.R ---
    @@ -3769,3 +3769,33 @@ setMethod("alias",
                 sdf <- callJMethod(object@sdf, "alias", data)
                 dataFrame(sdf)
               })
    +
    +
    +#' broadcast
    +#' 
    +#' Return a new SparkDataFrame marked as small enough for use in broadcast joins. 
    +#' 
    +#' Equivalent to hint(x, "broadcast).
    --- End diff --
    
    `"broadcast"`


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #17965: [SPARK-20726][SPARKR] wrapper for SQL broadcast

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/17965
  
    **[Test build #76875 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76875/testReport)** for PR 17965 at commit [`397ab1f`](https://github.com/apache/spark/commit/397ab1f7b4b4e2b9e51b697c92e3be197fed4554).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #17965: [SPARK-20726][SPARKR] wrapper for SQL broadcast

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/17965
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/76898/
    Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #17965: [SPARK-20726][SPARKR] wrapper for SQL broadcast

Posted by felixcheung <gi...@git.apache.org>.
Github user felixcheung commented on a diff in the pull request:

    https://github.com/apache/spark/pull/17965#discussion_r116386883
  
    --- Diff: R/pkg/R/generics.R ---
    @@ -799,6 +799,10 @@ setGeneric("write.df", function(df, path = NULL, ...) { standardGeneric("write.d
     #' @export
     setGeneric("randomSplit", function(x, weights, seed) { standardGeneric("randomSplit") })
     
    +#' @rdname broadcast
    +#' @export
    +setGeneric("broadcast", function(x) { standardGeneric("broadcast") })
    --- End diff --
    
    ouch it is `#` and not `#'`
    
    let's leave the sorting for now. we really need to stick with one method


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #17965: [SPARK-20726][SPARKR] wrapper for SQL broadcast

Posted by zero323 <gi...@git.apache.org>.
Github user zero323 closed the pull request at:

    https://github.com/apache/spark/pull/17965


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #17965: [SPARK-20726][SPARKR] wrapper for SQL broadcast

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/17965
  
    Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #17965: [SPARK-20726][SPARKR] wrapper for SQL broadcast

Posted by felixcheung <gi...@git.apache.org>.
Github user felixcheung commented on a diff in the pull request:

    https://github.com/apache/spark/pull/17965#discussion_r116350368
  
    --- Diff: R/pkg/R/context.R ---
    @@ -258,15 +258,15 @@ includePackage <- function(sc, pkg) {
     #'
     #' # Large Matrix object that we want to broadcast
     #' randomMat <- matrix(nrow=100, ncol=10, data=rnorm(1000))
    -#' randomMatBr <- broadcast(sc, randomMat)
    +#' randomMatBr <- broadcast_(sc, randomMat)
     #'
     #' # Use the broadcast variable inside the function
     #' useBroadcast <- function(x) {
     #'   sum(value(randomMatBr) * x)
     #' }
     #' sumRDD <- lapply(rdd, useBroadcast)
     #'}
    -broadcast <- function(sc, object) {
    +broadcast_ <- function(sc, object) {
    --- End diff --
    
    please change this to `broadcastRDD` like other functions


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #17965: [SPARK-20726][SPARKR] wrapper for SQL broadcast

Posted by zero323 <gi...@git.apache.org>.
Github user zero323 commented on a diff in the pull request:

    https://github.com/apache/spark/pull/17965#discussion_r116355102
  
    --- Diff: R/pkg/R/DataFrame.R ---
    @@ -3769,3 +3769,33 @@ setMethod("alias",
                 sdf <- callJMethod(object@sdf, "alias", data)
                 dataFrame(sdf)
               })
    +
    +
    +#' broadcast
    +#' 
    +#' Return a new SparkDataFrame marked as small enough for use in broadcast joins. 
    +#' 
    +#' Equivalent to hint(x, "broadcast).
    --- End diff --
    
    I double check this but for some reason `\code` here made `roxygen` unhappy when I tried it last time.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #17965: [SPARK-20726][SPARKR] wrapper for SQL broadcast

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/17965
  
    Merged build finished. Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #17965: [SPARK-20726][SPARKR] wrapper for SQL broadcast

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/17965
  
    **[Test build #76874 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76874/testReport)** for PR 17965 at commit [`f190d62`](https://github.com/apache/spark/commit/f190d62460829dcfb84ff1a8e6dd6fe9cbd25719).
     * This patch **fails R style tests**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #17965: [SPARK-20726][SPARKR] wrapper for SQL broadcast

Posted by felixcheung <gi...@git.apache.org>.
Github user felixcheung commented on a diff in the pull request:

    https://github.com/apache/spark/pull/17965#discussion_r116350400
  
    --- Diff: R/pkg/R/generics.R ---
    @@ -799,6 +799,10 @@ setGeneric("write.df", function(df, path = NULL, ...) { standardGeneric("write.d
     #' @export
     setGeneric("randomSplit", function(x, weights, seed) { standardGeneric("randomSplit") })
     
    +#' @rdname broadcast
    +#' @export
    +setGeneric("broadcast", function(x) { standardGeneric("broadcast") })
    --- End diff --
    
    there is a rd for `broadcast` already though https://github.com/zero323/spark/blob/397ab1f7b4b4e2b9e51b697c92e3be197fed4554/R/pkg/R/generics.R#L376
    we probably need to remove that one


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #17965: [SPARK-20726][SPARKR] wrapper for SQL broadcast

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/17965
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/76875/
    Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #17965: [SPARK-20726][SPARKR] wrapper for SQL broadcast

Posted by felixcheung <gi...@git.apache.org>.
Github user felixcheung commented on a diff in the pull request:

    https://github.com/apache/spark/pull/17965#discussion_r116350178
  
    --- Diff: R/pkg/R/DataFrame.R ---
    @@ -3769,3 +3769,33 @@ setMethod("alias",
                 sdf <- callJMethod(object@sdf, "alias", data)
                 dataFrame(sdf)
               })
    +
    --- End diff --
    
    nit:  one empty line instead of two


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #17965: [SPARK-20726][SPARKR] wrapper for SQL broadcast

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/17965
  
    **[Test build #76898 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76898/testReport)** for PR 17965 at commit [`1530785`](https://github.com/apache/spark/commit/1530785f7469830446cd95717d524eb42d88e4ab).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #17965: [SPARK-20726][SPARKR] wrapper for SQL broadcast

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/17965
  
    **[Test build #76874 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76874/testReport)** for PR 17965 at commit [`f190d62`](https://github.com/apache/spark/commit/f190d62460829dcfb84ff1a8e6dd6fe9cbd25719).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #17965: [SPARK-20726][SPARKR] wrapper for SQL broadcast

Posted by asfgit <gi...@git.apache.org>.
Github user asfgit closed the pull request at:

    https://github.com/apache/spark/pull/17965


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #17965: [SPARK-20726][SPARKR] wrapper for SQL broadcast

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/17965
  
    **[Test build #76875 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76875/testReport)** for PR 17965 at commit [`397ab1f`](https://github.com/apache/spark/commit/397ab1f7b4b4e2b9e51b697c92e3be197fed4554).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #17965: [SPARK-20726][SPARKR] wrapper for SQL broadcast

Posted by zero323 <gi...@git.apache.org>.
Github user zero323 commented on a diff in the pull request:

    https://github.com/apache/spark/pull/17965#discussion_r116355839
  
    --- Diff: R/pkg/R/DataFrame.R ---
    @@ -3769,3 +3769,33 @@ setMethod("alias",
                 sdf <- callJMethod(object@sdf, "alias", data)
                 dataFrame(sdf)
               })
    +
    --- End diff --
    
    Done.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #17965: [SPARK-20726][SPARKR] wrapper for SQL broadcast

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/17965
  
    **[Test build #76912 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76912/testReport)** for PR 17965 at commit [`d6c3435`](https://github.com/apache/spark/commit/d6c343538a731a45353bec5a7c290a201756039b).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #17965: [SPARK-20726][SPARKR] wrapper for SQL broadcast

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/17965
  
    Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #17965: [SPARK-20726][SPARKR] wrapper for SQL broadcast

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/17965
  
    Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #17965: [SPARK-20726][SPARKR] wrapper for SQL broadcast

Posted by felixcheung <gi...@git.apache.org>.
Github user felixcheung commented on a diff in the pull request:

    https://github.com/apache/spark/pull/17965#discussion_r116350376
  
    --- Diff: R/pkg/R/generics.R ---
    @@ -799,6 +799,10 @@ setGeneric("write.df", function(df, path = NULL, ...) { standardGeneric("write.d
     #' @export
     setGeneric("randomSplit", function(x, weights, seed) { standardGeneric("randomSplit") })
     
    +#' @rdname broadcast
    +#' @export
    +setGeneric("broadcast", function(x) { standardGeneric("broadcast") })
    --- End diff --
    
    this list is sorted alphabetically within this section


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #17965: [SPARK-20726][SPARKR] wrapper for SQL broadcast

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/17965
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/76912/
    Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #17965: [SPARK-20726][SPARKR] wrapper for SQL broadcast

Posted by zero323 <gi...@git.apache.org>.
GitHub user zero323 reopened a pull request:

    https://github.com/apache/spark/pull/17965

     [SPARK-20726][SPARKR] wrapper for SQL broadcast

    ## What changes were proposed in this pull request?
    
    - Adds R wrapper for `o.a.s.sql.functions.broadcast`.
    - Renames `broadcast` to `broadcast_`.
    
    ## How was this patch tested?
    
    Unit tests, check `check-cran.sh`.


You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/zero323/spark SPARK-20726

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/17965.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #17965
    
----
commit f190d62460829dcfb84ff1a8e6dd6fe9cbd25719
Author: zero323 <ze...@users.noreply.github.com>
Date:   2017-05-12T15:54:46Z

    Initial implementation

commit 397ab1f7b4b4e2b9e51b697c92e3be197fed4554
Author: zero323 <ze...@users.noreply.github.com>
Date:   2017-05-12T17:38:31Z

    Fix style

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #17965: [SPARK-20726][SPARKR] wrapper for SQL broadcast

Posted by felixcheung <gi...@git.apache.org>.
Github user felixcheung commented on a diff in the pull request:

    https://github.com/apache/spark/pull/17965#discussion_r116350359
  
    --- Diff: R/pkg/R/DataFrame.R ---
    @@ -3769,3 +3769,33 @@ setMethod("alias",
                 sdf <- callJMethod(object@sdf, "alias", data)
                 dataFrame(sdf)
               })
    +
    +
    +#' broadcast
    +#' 
    +#' Return a new SparkDataFrame marked as small enough for use in broadcast joins. 
    +#' 
    +#' Equivalent to hint(x, "broadcast).
    --- End diff --
    
    `\code{hint(x, "broadcast")}`


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #17965: [SPARK-20726][SPARKR] wrapper for SQL broadcast

Posted by felixcheung <gi...@git.apache.org>.
Github user felixcheung commented on a diff in the pull request:

    https://github.com/apache/spark/pull/17965#discussion_r116387913
  
    --- Diff: R/pkg/R/generics.R ---
    @@ -799,6 +799,10 @@ setGeneric("write.df", function(df, path = NULL, ...) { standardGeneric("write.d
     #' @export
     setGeneric("randomSplit", function(x, weights, seed) { standardGeneric("randomSplit") })
     
    +#' @rdname broadcast
    +#' @export
    +setGeneric("broadcast", function(x) { standardGeneric("broadcast") })
    --- End diff --
    
    let's fix up the sorting when 2.2.0 is released - it would help to minimize major changes for now to make it easier to merge fixes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #17965: [SPARK-20726][SPARKR] wrapper for SQL broadcast

Posted by zero323 <gi...@git.apache.org>.
Github user zero323 commented on a diff in the pull request:

    https://github.com/apache/spark/pull/17965#discussion_r116355836
  
    --- Diff: R/pkg/R/generics.R ---
    @@ -799,6 +799,10 @@ setGeneric("write.df", function(df, path = NULL, ...) { standardGeneric("write.d
     #' @export
     setGeneric("randomSplit", function(x, weights, seed) { standardGeneric("randomSplit") })
     
    +#' @rdname broadcast
    +#' @export
    +setGeneric("broadcast", function(x) { standardGeneric("broadcast") })
    --- End diff --
    
    It doesn't seem to affect the docs so I don't think we have to touch this for now:
    
    ![image](https://cloud.githubusercontent.com/assets/1554276/26024791/88a39940-37d9-11e7-9f11-ac1510b59215.png)



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #17965: [SPARK-20726][SPARKR] wrapper for SQL broadcast

Posted by zero323 <gi...@git.apache.org>.
Github user zero323 commented on a diff in the pull request:

    https://github.com/apache/spark/pull/17965#discussion_r116366145
  
    --- Diff: R/pkg/R/generics.R ---
    @@ -799,6 +799,10 @@ setGeneric("write.df", function(df, path = NULL, ...) { standardGeneric("write.d
     #' @export
     setGeneric("randomSplit", function(x, weights, seed) { standardGeneric("randomSplit") })
     
    +#' @rdname broadcast
    +#' @export
    +setGeneric("broadcast", function(x) { standardGeneric("broadcast") })
    --- End diff --
    
    > this list is sorted alphabetically within this section
    
    Looks like it used to be at some point, but these days are long gone. I can reorder it right now, but this means rearranging a whole section. 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #17965: [SPARK-20726][SPARKR] wrapper for SQL broadcast

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/17965
  
    **[Test build #76898 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76898/testReport)** for PR 17965 at commit [`1530785`](https://github.com/apache/spark/commit/1530785f7469830446cd95717d524eb42d88e4ab).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #17965: [SPARK-20726][SPARKR] wrapper for SQL broadcast

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/17965
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/76874/
    Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #17965: [SPARK-20726][SPARKR] wrapper for SQL broadcast

Posted by felixcheung <gi...@git.apache.org>.
Github user felixcheung commented on the issue:

    https://github.com/apache/spark/pull/17965
  
    merged to master


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org