You are viewing a plain text version of this content. The canonical link for it is here.

Posted to reviews@spark.apache.org by felixcheungu <gi...@git.apache.org> on 2018/08/15 00:43:43 UTC

[GitHub] spark pull request #22107: [SPARK-25117][R] Add EXEPT ALL and INTERSECT ALL ...

Github user felixcheungu commented on a diff in the pull request:

    https://github.com/apache/spark/pull/22107#discussion_r210145398
  
    --- Diff: R/pkg/R/DataFrame.R ---
    @@ -2848,6 +2848,35 @@ setMethod("intersect",
                 dataFrame(intersected)
               })
     
    +#' intersectAll
    +#'
    +#' Return a new SparkDataFrame containing rows in both this SparkDataFrame
    +#' and another SparkDataFrame while preserving the duplicates.
    +#' This is equivalent to \code{INTERSECT ALL} in SQL. Also as standard in
    +#' SQL, this function resolves columns by position (not by name).
    +#'
    +#' @param x a SparkDataFrame.
    +#' @param y a SparkDataFrame.
    +#' @return A SparkDataFrame containing the result of the intersect all operation.
    +#' @family SparkDataFrame functions
    +#' @aliases intersectAll,SparkDataFrame,SparkDataFrame-method
    +#' @rdname intersectAll
    +#' @name intersectAll
    +#' @examples
    +#'\dontrun{
    +#' sparkR.session()
    +#' df1 <- read.json(path)
    +#' df2 <- read.json(path2)
    +#' intersectAllDF <- intersectAll(df1, df2)
    +#' }
    +#' @rdname intersectAll
    +#' @note intersectAll since 2.4
    --- End diff --
    
    please put `2.4.0`


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org