You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@arrow.apache.org by "jacques (JIRA)" <ji...@apache.org> on 2018/05/17 09:02:00 UTC

[jira] [Created] (ARROW-2598) [Python] table.to_pandas segfault

jacques created ARROW-2598:
------------------------------

             Summary: [Python]  table.to_pandas segfault
                 Key: ARROW-2598
                 URL: https://issues.apache.org/jira/browse/ARROW-2598
             Project: Apache Arrow
          Issue Type: Bug
          Components: Python
            Reporter: jacques


Here is a small snippet which produce a segfault:

{noformat}

In [1]: import pyarrow as pa

In [2]: import pyarrow.parquet as pq

In [3]: pa_ar = pa.array([[], []])

In [4]: pq.write_table(
   ...:     table=pa.Table.from_arrays([pa_ar],["test"]),
   ...:     where="test5.parquet",
   ...:     compression="snappy",
   ...:     flavor="spark"
   ...: )

In [5]: pq.read_table("test5.parquet")
Out[5]: 
pyarrow.Table
test: list<item: null>
  child 0, item: null

In [6]: pq.read_table("test5.parquet").to_pydict()
Out[6]: OrderedDict([(u'test', [None, None])])

In [7]: pq.read_table("test5.parquet").to_pandas()
Segmentation fault

 

{noformat}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)