You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "Steve M. Kim (Jira)" <ji...@apache.org> on 2020/07/29 13:15:00 UTC
[jira] [Created] (ARROW-9594) [Python] DictionaryArray.to_numpy
does not correctly convert null indexes to null values
Steve M. Kim created ARROW-9594:
-----------------------------------
Summary: [Python] DictionaryArray.to_numpy does not correctly convert null indexes to null values
Key: ARROW-9594
URL: https://issues.apache.org/jira/browse/ARROW-9594
Project: Apache Arrow
Issue Type: Bug
Affects Versions: 1.0.0
Reporter: Steve M. Kim
Example
{code:java}
>>> a = pa.DictionaryArray.from_arrays(pa.array([0, 1, None, 0], type=pa.int32()), pa.array(['foo', 'bar']))
>>> a
<pyarrow.lib.DictionaryArray object at 0x7f12fc94ccf0>-- dictionary:
[
"foo",
"bar"
]
-- indices:
[
0,
1,
null,
0
]
>>> a.to_pandas() # this works
0 foo
1 bar
2 NaN
3 foo
dtype: category
Categories (2, object): [foo, bar]
>>> a.to_numpy(zero_copy_only=False) # this is broken
array(['foo', 'bar', 'bar', 'foo'], dtype=object)
{code}
--
This message was sent by Atlassian Jira
(v8.3.4#803005)