You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "Wes McKinney (Jira)" <ji...@apache.org> on 2020/04/30 14:17:00 UTC

[jira] [Closed] (ARROW-8638) Arrow Cython API Usage Gives an error when calling CTable API Endpoints

     [ https://issues.apache.org/jira/browse/ARROW-8638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Wes McKinney closed ARROW-8638.
-------------------------------
    Resolution: Information Provided

Closing since there isn't a bug to fix, further discussion can take place here or on the mailing list

> Arrow Cython API Usage Gives an error when calling CTable API Endpoints
> -----------------------------------------------------------------------
>
>                 Key: ARROW-8638
>                 URL: https://issues.apache.org/jira/browse/ARROW-8638
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: C++, Python
>    Affects Versions: 0.16.0
>         Environment: Ubuntu 20.04 with Python 3.8.2
> RHEL7 with Python 3.6.8
>            Reporter: Vibhatha Lakmal Abeykoon
>            Priority: Blocker
>             Fix For: 0.16.0
>
>
> I am working on using both Arrow C++ API and Cython API to support an application that I am developing. But here, I will add the issue I experienced when I am trying to follow the example, 
> [https://arrow.apache.org/docs/python/extending.html]
> I am testing on Ubuntu 20.04 LTS
> Python version 3.8.2
> These are the steps I followed.
>  # Create Virtualenv
> python3 -m venv ENVARROW
>  
> 2. Activate ENV
> source ENVARROW/bin/activate
>  
> 3. pip3 install pyarrow==0.16.0 cython numpy
>  
>  4. Code block and Tools,
>  
> +*example.pyx*+
>  
>  
> {code:java}
> from pyarrow.lib cimport *
> def get_array_length(obj):
>  # Just an example function accessing both the pyarrow Cython API
>  # and the Arrow C++ API
>  cdef shared_ptr[CArray] arr = pyarrow_unwrap_array(obj)
>  if arr.get() == NULL:
>  raise TypeError("not an array")
>  return arr.get().length()
> def get_table_info(obj):
>  cdef shared_ptr[CTable] table = pyarrow_unwrap_table(obj)
>  if table.get() == NULL:
>  raise TypeError("not an table")
>  
>  return table.get().num_columns() 
> {code}
>  
>  
> +*setup.py*+
>  
>  
> {code:java}
> from distutils.core import setup
> from Cython.Build import cythonize
> import os
> import numpy as np
> import pyarrow as pa
> ext_modules = cythonize("example.pyx")
> for ext in ext_modules:
>  # The Numpy C headers are currently required
>  ext.include_dirs.append(np.get_include())
>  ext.include_dirs.append(pa.get_include())
>  ext.libraries.extend(pa.get_libraries())
>  ext.library_dirs.extend(pa.get_library_dirs())
> if os.name == 'posix':
>  ext.extra_compile_args.append('-std=c++11')
> # Try uncommenting the following line on Linux
>  # if you get weird linker errors or runtime crashes
>  #ext.define_macros.append(("_GLIBCXX_USE_CXX11_ABI", "0"))
> setup(ext_modules=ext_modules)
> {code}
>  
>  
> +*arrow_array.py*+
>  
> {code:java}
> import example
> import pyarrow as pa
> import numpy as np
> arr = pa.array([1,2,3,4,5])
> len = example.get_array_length(arr)
> print("Array length {} ".format(len)) 
> {code}
>  
> +*arrow_table.py*+
>  
> {code:java}
> import example
> import pyarrow as pa
> import numpy as np
> from pyarrow import csv
> fn = 'data.csv'
> table = csv.read_csv(fn)
> print(table)
> cols = example.get_table_info(table)
> print(cols)
>  
> {code}
> +*data.csv*+
> {code:java}
> 1,2,3,4,5
> 6,7,8,9,10
> 11,12,13,14,15
> {code}
>  
> +*Makefile*+
>  
> {code:java}
> install: 
>         python3 setup.py build_ext --inplace
> clean: 
>         rm -R *.so build *.cpp 
> {code}
>  
> **When I try to run either of the python example scripts arrow_table.py or arrow_array.py, 
> I get the following error. 
>  
> {code:java}
> File "arrow_array.py", line 1, in <module>
>  import example
> ImportError: libarrow.so.16: cannot open shared object file: No such file or directory
> {code}
>  
>  
> *Note: I also checked this on RHEL7 with Python 3.6.8, I got a similar response.* 
>  
>  
>  
>  
>  
>  
>  
>  
>  
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)