You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "Wes McKinney (JIRA)" <ji...@apache.org> on 2017/04/30 20:28:04 UTC

[jira] [Comment Edited] (ARROW-376) Python: Convert non-range Pandas indices (optionally) to Arrow

    [ https://issues.apache.org/jira/browse/ARROW-376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15990372#comment-15990372 ] 

Wes McKinney edited comment on ARROW-376 at 4/30/17 8:27 PM:
-------------------------------------------------------------

Removing this from release blocker. We can make 0.3.0.post artifacts if we want to get this out there before the 0.4 release


was (Author: wesmckinn):
Removing this from release blocker. We can release 0.3.0.post if we want to get this out there before the 0.4 release

> Python: Convert non-range Pandas indices (optionally) to Arrow
> --------------------------------------------------------------
>
>                 Key: ARROW-376
>                 URL: https://issues.apache.org/jira/browse/ARROW-376
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: Python
>    Affects Versions: 0.2.0
>            Reporter: Uwe L. Korn
>            Assignee: Phillip Cloud
>            Priority: Minor
>             Fix For: 0.4.0
>
>
> Currently the indices of a Pandas DataFrame are totally ignored on the Pandas to Arrow conversion. We should add an option to also convert the index to an Arrow column if they are not a simple range index.
> The condition for a simple index should be {{isinstance(df.index, pd.RangeIndex) && (df.index._start == 0) && (df.index._stop == len(df.index)) && (df.index._step == 1)}}. In this case, we can always skip the index conversion. Otherwise, a new column in the Arrow table shall be created using the index' name as the name of the column. Additionally there should be some metadata annotation of that column that it is derived of an Pandas Index, so that for roundtrips, we'll use it again as the index of a DataFrame.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)