You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Hideaki Hayashi (Jira)" <ji...@apache.org> on 2021/09/02 07:33:00 UTC
[jira] [Created] (ARROW-13860) [R] arrow 5.0.0 write_parquet throws
error writing grouped data.frame
Hideaki Hayashi created ARROW-13860:
---------------------------------------
Summary: [R] arrow 5.0.0 write_parquet throws error writing grouped data.frame
Key: ARROW-13860
URL: https://issues.apache.org/jira/browse/ARROW-13860
Project: Apache Arrow
Issue Type: Bug
Environment: maxOS 11.1 Big Sur
Reporter: Hideaki Hayashi
arrow 5.0.0 write_parquet throws error writing grouped data.frame.
Here is how to reproduce it.
{{library(dplyr)}}
{{ arrow::write_parquet(mtcars %>% group_by(am),"/tmp/mtcars_test.parquet")}}
{{# Error: x must be an object of class 'data.frame', 'RecordBatch', or 'Table', not 'arrow_dplyr_query’.}}
With arrow 4.0.1, this used to work fine.
{{library(dplyr)}}
{{arrow::write_parquet(mtcars %>% group_by(am),"/tmp/mtcars_test.parquet")}}
{{x <- arrow::read_parquet("/tmp/mtcars_test.parquet")}}
{{x}}
{{# A tibble: 32 x 11}}
{{# Groups: am [2]}}
{{# mpg cyl disp hp drat wt qsec vs am gear carb}}
{{# * <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl>}}
{{# 1 21 6 160 110 3.9 2.62 16.5 0 1 4 4}}
{{# 2 21 6 160 110 3.9 2.88 17.0 0 1 4 4}}
{{# 3 22.8 4 108 93 3.85 2.32 18.6 1 1 4 1}}
{{# 4 21.4 6 258 110 3.08 3.22 19.4 1 0 3 1}}
{{# 5 18.7 8 360 175 3.15 3.44 17.0 0 0 3 2}}
{{# 6 18.1 6 225 105 2.76 3.46 20.2 1 0 3 1}}
{{# 7 14.3 8 360 245 3.21 3.57 15.8 0 0 3 4}}
{{# …}}
--
This message was sent by Atlassian Jira
(v8.3.4#803005)