You are viewing a plain text version of this content. The canonical link for it is here.

Posted to jira@arrow.apache.org by "Kouhei Sutou (Jira)" <ji...@apache.org> on 2022/10/19 03:11:00 UTC

[jira] [Updated] (ARROW-17636) [Python] Converting Table to pandas raises NotImplementedError (when table previously saved as partitioned parquet dataset)

     [ https://issues.apache.org/jira/browse/ARROW-17636?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Kouhei Sutou updated ARROW-17636:
---------------------------------
    Fix Version/s: 11.0.0
                       (was: 10.0.0)

> [Python] Converting Table to pandas raises NotImplementedError (when table previously saved as partitioned parquet dataset)
> ---------------------------------------------------------------------------------------------------------------------------
>
>                 Key: ARROW-17636
>                 URL: https://issues.apache.org/jira/browse/ARROW-17636
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: Python
>    Affects Versions: 9.0.0
>         Environment: Docker container, based on continuumio/anaconda3
> Python 3.9.12
> PyArrow 9.0.0
>            Reporter: Roberto Lobo
>            Assignee: Joris Van den Bossche
>            Priority: Major
>             Fix For: 11.0.0
>
>         Attachments: bug.py
>
>
> When converting a table in which one of the column's type is of DictionaryType (values=int32, indices=int32, ordered=0) the conversion to pandas DataFrame fails with:
> NotImplementedError: dictionary<values=int32, indices=int32, ordered=0>
> The dictionary has this conversion not implmented yet.
> This DictionaryType is used as type when using one of the columns (Int64) as one of the parquet's dataset partition columns.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)