You are viewing a plain text version of this content. The canonical link for it is here.

Posted to jira@arrow.apache.org by "Nicola Crane (Jira)" <ji...@apache.org> on 2022/09/12 10:13:00 UTC

[jira] [Resolved] (ARROW-17639) [R] infer_type() fails for lists where the first element is NULL

     [ https://issues.apache.org/jira/browse/ARROW-17639?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Nicola Crane resolved ARROW-17639.
----------------------------------
    Fix Version/s: 10.0.0
       Resolution: Fixed

Issue resolved by pull request 14062
[https://github.com/apache/arrow/pull/14062]

> [R] infer_type() fails for lists where the first element is NULL
> ----------------------------------------------------------------
>
>                 Key: ARROW-17639
>                 URL: https://issues.apache.org/jira/browse/ARROW-17639
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: R
>    Affects Versions: 9.0.0
>         Environment: Ubuntu 18.04; R 4.1.1; arrow 9.0
>            Reporter: David
>            Assignee: Nicola Crane
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 10.0.0
>
>          Time Spent: 1h 20m
>  Remaining Estimate: 0h
>
> * Works
> reticulate::py_run_string("
> import pandas as pd
> df = pd.DataFrame( \{'col1': [[1,2], None, [3,4]]}
> )
> df.to_parquet('/tmp/test1.parquet')
> ")
> df1 <- arrow::read_parquet("/tmp/test1.parquet")
> arrow::write_parquet(df1, tempfile(fileext = ".parquet"))
>  * Fails in arrow 9.0; works in arrow 5.0
> reticulate::py_run_string("
> import pandas as pd
> df = pd.DataFrame( \{'col1': [None, [1,2], [3,4]]}
> )
> df.to_parquet('/tmp/test2.parquet')
> ")
> df2 <- arrow::read_parquet("/tmp/test2.parquet")
> arrow::write_parquet(df2, tempfile(fileext = ".parquet"))



--
This message was sent by Atlassian Jira
(v8.20.10#820010)