You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@arrow.apache.org by "Roberto Lobo (Jira)" <ji...@apache.org> on 2022/09/06 18:16:00 UTC

[jira] [Created] (ARROW-17636) Converting Table to pandas raises NotImplementedError (when table previously saved as partitioned parquet dataset)

Roberto Lobo created ARROW-17636:
------------------------------------

             Summary: Converting Table to pandas raises NotImplementedError (when table previously saved as partitioned parquet dataset)
                 Key: ARROW-17636
                 URL: https://issues.apache.org/jira/browse/ARROW-17636
             Project: Apache Arrow
          Issue Type: Bug
          Components: Python
    Affects Versions: 9.0.0
         Environment: Docker container, based on continuumio/anaconda3
Python 3.9.12
PyArrow 9.0.0
            Reporter: Roberto Lobo


When converting a table in which one of the column's type is of DictionaryType (values=int32, indices=int32, ordered=0) the conversion to pandas DataFrame fails with:

NotImplementedError: dictionary<values=int32, indices=int32, ordered=0>

The dictionary has this conversion not implmented yet.

This DictionaryType is used as type when using one of the columns (Int64) as one of the parquet's dataset partition columns.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)