You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Joris Van den Bossche (Jira)" <ji...@apache.org> on 2020/09/24 08:52:00 UTC

[jira] [Commented] (ARROW-9040) [Python][Parquet]"_ParquetDatasetV2" fail to read with columns and use_pandas_metadata=True

    [ https://issues.apache.org/jira/browse/ARROW-9040?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17201374#comment-17201374 ] 

Joris Van den Bossche commented on ARROW-9040:
----------------------------------------------

The mentioned PR was included in the 1.0.0 release, so closing this issue.

> [Python][Parquet]"_ParquetDatasetV2" fail to read with columns and use_pandas_metadata=True
> -------------------------------------------------------------------------------------------
>
>                 Key: ARROW-9040
>                 URL: https://issues.apache.org/jira/browse/ARROW-9040
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: Python
>    Affects Versions: 0.17.1
>            Reporter: cmsxbc
>            Priority: Major
>             Fix For: 1.0.0
>
>
> Loading a parquet file write by pandas with default index.
> When call _ParquetDatasetV2.read(columns=['column'], use_pandas_metadata=True),
> "TypeError: unhashable type 'dict'"  were raised from 
> {code:java}
> index_columns = set(_get_pandas_index_columns(metadata)){code}
> Is it because of pandas default index?
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)