You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@arrow.apache.org by "Bryan Cutler (JIRA)" <ji...@apache.org> on 2018/02/06 19:25:00 UTC

[jira] [Created] (ARROW-2101) [Python] from_pandas reads 'str' types as binary Arrow data with Python 2

Bryan Cutler created ARROW-2101:
-----------------------------------

             Summary: [Python] from_pandas reads 'str' types as binary Arrow data with Python 2
                 Key: ARROW-2101
                 URL: https://issues.apache.org/jira/browse/ARROW-2101
             Project: Apache Arrow
          Issue Type: Bug
          Components: Python
    Affects Versions: 0.8.0
            Reporter: Bryan Cutler


Using Python 2, converting Pandas with 'str' data to Arrow results in Arrow data of binary type, even if the user supplies type information.  conversion of 'unicode' type works to create Arrow data of string types.  For example

{code}
In [25]: pa.Array.from_pandas(pd.Series(['a'])).type
Out[25]: DataType(binary)

In [26]: pa.Array.from_pandas(pd.Series(['a']), type=pa.string()).type
Out[26]: DataType(binary)

In [27]: pa.Array.from_pandas(pd.Series([u'a'])).type
Out[27]: DataType(string)
{code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)