You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Neal Richardson (Jira)" <ji...@apache.org> on 2022/06/07 13:49:00 UTC

[jira] [Updated] (ARROW-16777) [R] printing data in Table/RecordBatch print method

     [ https://issues.apache.org/jira/browse/ARROW-16777?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Neal Richardson updated ARROW-16777:
------------------------------------
    Summary: [R] printing data in Table/RecordBatch print method  (was: printing data in Table/RecordBatch print method)

> [R] printing data in Table/RecordBatch print method
> ---------------------------------------------------
>
>                 Key: ARROW-16777
>                 URL: https://issues.apache.org/jira/browse/ARROW-16777
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: Python, R
>            Reporter: Thomas Mock
>            Priority: Minor
>             Fix For: 9.0.0
>
>
> Related to ARROW-16776 but after a brief discussion with Neal Richardson, he requested that I split the improvement request into separate issues.
> When working with Arrow datasets/tables, I often find myself wanting to interactively print or "see" the results of a query or the first few rows of the data without having to fully collect into memory. 
> It would be ideal to lazily print some data with Table/RecordBatch print methods, however, currently, the print methods return schema without data. 
> IE:
> ``` r
> library(dplyr)
> library(arrow)
> mtcars %>% arrow::write_parquet("mtcars.parquet")
> car_ds <- arrow::open_dataset("mtcars.parquet")
> car_ds
> #> FileSystemDataset with 1 Parquet file
> #> mpg: double
> #> cyl: double
> #> disp: double
> #> hp: double
> #> drat: double
> #> wt: double
> #> qsec: double
> #> vs: double
> #> am: double
> #> gear: double
> #> carb: double
> #> 
> #> See $metadata for additional Schema metadata
> car_ds %>%
>   compute()
> #> Table
> #> 32 rows x 11 columns
> #> $mpg <double>
> #> $cyl <double>
> #> $disp <double>
> #> $hp <double>
> #> $drat <double>
> #> $wt <double>
> #> $qsec <double>
> #> $vs <double>
> #> $am <double>
> #> $gear <double>
> #> $carb <double>
> #> 
> #> See $metadata for additional Schema metadata
> ```



--
This message was sent by Atlassian Jira
(v8.20.7#820007)