You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "Wes McKinney (JIRA)" <ji...@apache.org> on 2017/11/14 03:48:00 UTC
[jira] [Resolved] (ARROW-1743) [Python] Table to_pandas fails when
index contains categorical column
[ https://issues.apache.org/jira/browse/ARROW-1743?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Wes McKinney resolved ARROW-1743.
---------------------------------
Resolution: Fixed
Issue resolved by pull request 1260
[https://github.com/apache/arrow/pull/1260]
> [Python] Table to_pandas fails when index contains categorical column
> ---------------------------------------------------------------------
>
> Key: ARROW-1743
> URL: https://issues.apache.org/jira/browse/ARROW-1743
> Project: Apache Arrow
> Issue Type: Bug
> Components: Python
> Affects Versions: 0.7.1
> Reporter: Brian Pendleton
> Assignee: Licht Takeuchi
> Labels: pull-request-available
> Fix For: 0.8.0
>
>
> Categorical columns in the index of a dataframe are causing a roundtrip failure.
> {code}
> >>> df = pd.DataFrame({'a': [1, 2, 3], 'b': [1, 2, 3]})
> >>> df['a'] = df.a.astype('category')
> >>> df = df.set_index('a')
> >>> tbl = pa.Table.from_pandas(df)
> >>> tbl.to_pandas()
> Traceback (most recent call last):
> File "<stdin>", line 1, in <module>
> File "table.pxi", line 881, in pyarrow.lib.Table.to_pandas
> File "C:\Users\bpendlet\Miniconda3\envs\panpy3\lib\site-packages\pyarrow\pandas_compat.py", line 303, in table_to_blockmanager
> if not values.flags.writeable:
> AttributeError: 'Categorical' object has no attribute 'flags'
> {code}
> Works as expected when you don't change have the categorical:
> {code}
> >>> df = pd.DataFrame({'a': [1, 2, 3], 'b': [1, 2, 3]})
> >>> df = df.set_index('a')
> >>> tbl = pa.Table.from_pandas(df)
> >>> tbl.to_pandas()
> b
> a
> 1 1
> 2 2
> 3 3
> {code}
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)