You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "Travis Brady (JIRA)" <ji...@apache.org> on 2018/04/14 01:14:00 UTC

[jira] [Created] (ARROW-2459) pyarrow: Segfault with pyarrow.deserialize_pandas

Travis Brady created ARROW-2459:
-----------------------------------

             Summary: pyarrow: Segfault with pyarrow.deserialize_pandas
                 Key: ARROW-2459
                 URL: https://issues.apache.org/jira/browse/ARROW-2459
             Project: Apache Arrow
          Issue Type: Bug
          Components: Python
         Environment: OS X, Linux
            Reporter: Travis Brady


Following up from [https://github.com/apache/arrow/issues/1884] wherein I found that calling deserialize_pandas in the linked app.py script in the repo linked below causes the app.py process to segfault.

I initially observed this on OS X, but have since confirmed that the behavior exists on Linux as well.

Repo containing example: [https://github.com/travisbrady/sanic-arrow] 

And more generally: what is the right way to get a Java-based HTTP microservice to talk to a Python-based HTTP microservice using Arrow as the serialization format? I'm exchanging DataFrame type objects (they are pandas.DataFrame's on the Python side) between the two services for real-time scoring in a few xgboost models implemented in Python.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)