You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@arrow.apache.org by GitBox <gi...@apache.org> on 2022/03/04 20:14:30 UTC

[GitHub] [arrow] paleolimbot commented on pull request #12564: ARROW-15818: [R] Implement initial Substrait consumer in the R bindings

paleolimbot commented on pull request #12564:
URL: https://github.com/apache/arrow/pull/12564#issuecomment-1059487075


   Whee...this works! (Although until ARROW-15849 is merged it's necessary to copy the arrow/engine/substrait/ headers manually...). It currently just prints the output...before this is merged I think it would be nice to return a `Table` to help us test output.
   
   ``` r
   # remotes::install_github("apache/arrow/r#12564")
   # remotes::install_github("voltrondata/substrait-r")
   library(arrow, warn.conflicts = FALSE)
   library(substrait)
   
   # currently works via the ReadRel object (not an Arrow source node),
   # so we need a parquet file to work with
   temp_parquet_file <- tempfile()
   write_parquet(mtcars, temp_parquet_file)
   
   plan <- substrait$Plan$create(
     relations = list(
       substrait$PlanRel$create(
         rel = substrait$Rel$create(
           read = substrait$ReadRel$create(
             base_schema = as_substrait(mtcars, "substrait.NamedStruct"),
             local_files = substrait$ReadRel$LocalFiles$create(
               items = list(
                 substrait$ReadRel$LocalFiles$FileOrFiles$create(
                   uri_file = sprintf("file://%s", temp_parquet_file),
                   format = substrait$ReadRel$LocalFiles$FileOrFiles$FileFormat$FILE_FORMAT_PARQUET
                 )
               )
             )
           )
         )
       )
     )
   )
   
   arrow:::do_exec_plan_substrait(as.raw(plan))
   #> <tag: 0 rows: 32>
   #> ExecBatch
   #>     # Rows: 32
   #>     0: Array[21,21,22.8,21.4,18.7,18.1,14.3,24.4,22.8,19.2,...,15.2,13.3,19.2,27.3,26,30.4,15.8,19.7,15,21.4]
   #>     1: Array[6,6,4,6,8,6,8,4,4,6,...,8,8,8,4,4,4,8,6,8,4]
   #>     2: Array[160,160,108,258,360,225,360,146.7,140.8,167.6,...,304,350,400,79,120.3,95.1,351,145,301,121]
   #>     3: Array[110,110,93,110,175,105,245,62,95,123,...,150,245,175,66,91,113,264,175,335,109]
   #>     4: Array[3.9,3.9,3.85,3.08,3.15,2.76,3.21,3.69,3.92,3.92,...,3.15,3.73,3.08,4.08,4.43,3.77,4.22,3.62,3.54,4.11]
   #>     5: Array[2.62,2.875,2.32,3.215,3.44,3.46,3.57,3.19,3.15,3.44,...,3.435,3.84,3.845,1.935,2.14,1.513,3.17,2.77,3.57,2.78]
   #>     6: Array[16.46,17.02,18.61,19.44,17.02,20.22,15.84,20,22.9,18.3,...,17.3,15.41,17.05,18.9,16.7,16.9,14.5,15.5,14.6,18.6]
   #>     7: Array[0,0,1,1,0,1,0,1,1,1,...,0,0,0,1,0,1,0,0,0,1]
   #>     8: Array[1,1,1,0,0,0,0,0,0,0,...,0,0,0,1,1,1,1,1,1,1]
   #>     9: Array[4,4,4,3,3,3,3,4,4,4,...,3,3,3,4,5,5,5,5,5,4]
   #>     10: Array[4,4,1,1,2,1,4,2,2,4,...,2,4,2,1,2,2,4,6,8,2]
   #>     11: Scalar[0]
   #>     12: Scalar[0]
   #>     13: Scalar[true]
   #> 
   #> <tag: 0 finished>
   ```
   
   <sup>Created on 2022-03-04 by the [reprex package](https://reprex.tidyverse.org) (v2.0.1)</sup>


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org