You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@arrow.apache.org by "Marco Neumann (Jira)" <ji...@apache.org> on 2019/10/14 10:41:00 UTC

[jira] [Created] (ARROW-6872) [C++][Python] Empty table with dictionary-columns raises ArrowNotImplementedError

Marco Neumann created ARROW-6872:
------------------------------------

             Summary: [C++][Python] Empty table with dictionary-columns raises ArrowNotImplementedError
                 Key: ARROW-6872
                 URL: https://issues.apache.org/jira/browse/ARROW-6872
             Project: Apache Arrow
          Issue Type: Bug
          Components: C++, Python
    Affects Versions: 0.15.0
            Reporter: Marco Neumann


h2. Abstract
As a pyarrow user, I would expect that I can create an empty table out of every schema that I created via pandas. This does not work for dictionary types (e.g. {{"category"}} dtypes).

h2. Test Case
This code:

{code:python}
import pandas as pd
import pyarrow as pa

df = pd.DataFrame({"x": pd.Series(["x", "y"], dtype="category")})
table = pa.Table.from_pandas(df)
schema = table.schema
table_empty = schema.empty_table()  # boom
{code}

produces this exception:

{noformat}
Traceback (most recent call last):
  File "arrow_bug.py", line 8, in <module>
    table_empty = schema.empty_table()
  File "pyarrow/types.pxi", line 860, in __iter__
  File "pyarrow/array.pxi", line 211, in pyarrow.lib.array
  File "pyarrow/array.pxi", line 36, in pyarrow.lib._sequence_to_array
  File "pyarrow/error.pxi", line 86, in pyarrow.lib.check_status
pyarrow.lib.ArrowNotImplementedError: Sequence converter for type dictionary<values=string, indices=int8, ordered=0> not implemented
{noformat}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)