You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "Wes McKinney (JIRA)" <ji...@apache.org> on 2019/02/11 19:00:00 UTC

[jira] [Updated] (ARROW-1599) [Python] Unable to read Parquet files with list inside struct

     [ https://issues.apache.org/jira/browse/ARROW-1599?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Wes McKinney updated ARROW-1599:
--------------------------------
    Fix Version/s:     (was: 0.13.0)
                   0.14.0

> [Python] Unable to read Parquet files with list inside struct
> -------------------------------------------------------------
>
>                 Key: ARROW-1599
>                 URL: https://issues.apache.org/jira/browse/ARROW-1599
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: Python
>    Affects Versions: 0.7.0
>         Environment: Ubuntu
>            Reporter: Jovann Kung
>            Assignee: Joshua Storck
>            Priority: Major
>              Labels: parquet
>             Fix For: 0.14.0
>
>
> Is PyArrow currently unable to read in Parquet files with a vector as a column? For example, the schema of such a file is below:
> {{<pyarrow._parquet.ParquetSchema object at 0x7f2d42493c88>
> mbc: FLOAT
> deltae: FLOAT
> labels: FLOAT
> features.type: INT32 INT_8
> features.size: INT32
> features.indices.list.element: INT32
> features.values.list.element: DOUBLE}}
> Using either pq.read_table() or pq.ParquetDataset('/path/to/parquet').read() yields the following error: ArrowNotImplementedError: Currently only nesting with Lists is supported.
> From the error I assume that this may be implemented in further releases?



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)