You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@spark.apache.org by do...@apache.org on 2020/08/19 14:27:32 UTC

[spark] branch branch-3.0 updated: [SPARK-32451][R][3.0] Support Apache Arrow 1.0.0

This is an automated email from the ASF dual-hosted git repository.

dongjoon pushed a commit to branch branch-3.0
in repository https://gitbox.apache.org/repos/asf/spark.git


The following commit(s) were added to refs/heads/branch-3.0 by this push:
     new 2b41bc9  [SPARK-32451][R][3.0] Support Apache Arrow 1.0.0
2b41bc9 is described below

commit 2b41bc928199df6d546ef055e233b9c58b019b38
Author: HyukjinKwon <gu...@apache.org>
AuthorDate: Wed Aug 19 07:22:52 2020 -0700

    [SPARK-32451][R][3.0] Support Apache Arrow 1.0.0
    
    ### What changes were proposed in this pull request?
    
    This PR ports back https://github.com/apache/spark/pull/29252 to support Arrow 1.0.0.
    
    Currently, SparkR with Arrow tests fails with the latest Arrow version in branch-3.0, see https://github.com/apache/spark/pull/29460/checks?check_run_id=996972267
    
    ### Why are the changes needed?
    
    To support higher Arrow R version with SparkR.
    
    ### Does this PR introduce _any_ user-facing change?
    
    Yes, users will be able to use SparkR with Arrow 1.0.0+.
    
    ### How was this patch tested?
    
    Manually tested, GitHub Actions will test it.
    
    Closes #29462 from HyukjinKwon/SPARK-32451-3.0.
    
    Authored-by: HyukjinKwon <gu...@apache.org>
    Signed-off-by: Dongjoon Hyun <do...@apache.org>
---
 R/pkg/R/DataFrame.R | 7 ++++++-
 1 file changed, 6 insertions(+), 1 deletion(-)

diff --git a/R/pkg/R/DataFrame.R b/R/pkg/R/DataFrame.R
index f9a07f6..3a60d1c 100644
--- a/R/pkg/R/DataFrame.R
+++ b/R/pkg/R/DataFrame.R
@@ -1231,9 +1231,14 @@ setMethod("collect",
                 authSecret <- portAuth[[2]]
                 conn <- socketConnection(
                   port = port, blocking = TRUE, open = "wb", timeout = connectionTimeout)
+                version <- packageVersion("arrow")
                 output <- tryCatch({
                   doServerAuth(conn, authSecret)
-                  arrowTable <- arrow::read_arrow(readRaw(conn))
+                  if (version$minor >= 17 || version$major >= 1) {
+                    arrowTable <- arrow::read_ipc_stream(readRaw(conn))
+                  } else {
+                    arrowTable <- arrow::read_arrow(readRaw(conn))
+                  }
                   as.data.frame(arrowTable, stringsAsFactors = stringsAsFactors)
                 }, finally = {
                   close(conn)


---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscribe@spark.apache.org
For additional commands, e-mail: commits-help@spark.apache.org