You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Antoine Pitrou (Jira)" <ji...@apache.org> on 2020/10/08 10:51:00 UTC

[jira] [Closed] (ARROW-10232) FixedSizeListArray is incorrectly written/read to/from parquet

     [ https://issues.apache.org/jira/browse/ARROW-10232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Antoine Pitrou closed ARROW-10232.
----------------------------------
    Resolution: Duplicate

> FixedSizeListArray is incorrectly written/read to/from parquet
> --------------------------------------------------------------
>
>                 Key: ARROW-10232
>                 URL: https://issues.apache.org/jira/browse/ARROW-10232
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: Python
>    Affects Versions: 1.0.1
>            Reporter: Simon Perkins
>            Priority: Major
>             Fix For: 2.0.0
>
>
> FixedSizeListArray's seem to be either incorrectly written or read to or from Parquet files.
>  
> When reading the parquet file, nulls/Nones are returned where the original values should be.
>  
> {code:python}
> import pyarrow as pa
> import pyarrow.parquet as pq
> import numpy as np
> np_data = np.arange(20*4).reshape(20, 4).astype(np.float64)
> pa_data = pa.FixedSizeListArray.from_arrays(np_data.ravel(), 4)
> assert np_data.tolist() == pa_data.tolist()
> schema = pa.schema([pa.field("rectangle", pa_data.type)])
> table = pa.table({"rectangle": pa_data}, schema=schema)
> pq.write_table(table, "test.parquet")
> in_table = pq.read_table("test.parquet")   
> # rectangle is filled with nulls
> assert in_table.column("rectangle").to_pylist() == pa_data.tolist()
> {code}
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)