You are viewing a plain text version of this content. The canonical link for it is here.
Posted to notifications@superset.apache.org by GitBox <gi...@apache.org> on 2020/03/08 20:54:19 UTC

[GitHub] [incubator-superset] uros-tm opened a new issue #8843: Unable to build on Alpine Linux with PyArrow 0.15.1

uros-tm opened a new issue #8843: Unable to build on Alpine Linux with PyArrow 0.15.1
URL: https://github.com/apache/incubator-superset/issues/8843
 
 
   PyArrow dep was [bumped to 0.15.1 recently](https://github.com/apache/incubator-superset/pull/8583). Unfortunately the installation of this version errors out on Alpine as below.
   
   ### Expected results
   
   Successful installation on Alpine of PyArrow, subsequently Superset..
   
   ### Actual results
   
   Have tried 2 routes: 1. a direct pip install, and 2. a manual build of PyArrow 0.15.1.
   
   1. Direct pip3 install of `pyarrow==0.15.1` fails as follows, even though numpy was installed:
   ```
     Configured for RELEASE build (set with cmake -DCMAKE_BUILD_TYPE={release,debug,...})
     -- Build Type: RELEASE
     -- Build output directory: /tmp/pip-install-qe4_90uh/pyarrow/build/temp.linux-x86_64-3.7/release
     -- Found PythonInterp: /usr/local/bin/python (found version "3.7.5")
     -- Searching for Python libs in /usr/local/lib64;/usr/local/lib;/usr/local/lib/python3.7/config-3.7m-x86_64-linux-gnu
     -- Looking for python3.7m
     -- Found Python lib /usr/local/lib/libpython3.7m.so
     -- Found PythonLibs: /usr/local/lib/libpython3.7m.so
     CMake Error at cmake_modules/FindNumPy.cmake:62 (message):
       NumPy import failure:
   
       Traceback (most recent call last):
   
         File "<string>", line 1, in <module>
   
       ModuleNotFoundError: No module named 'numpy'
   
     Call Stack (most recent call first):
       CMakeLists.txt:189 (find_package)
   
   
     -- Configuring incomplete, errors occurred!
     See also "/tmp/pip-install-qe4_90uh/pyarrow/build/temp.linux-x86_64-3.7/CMakeFiles/CMakeOutput.log".
     See also "/tmp/pip-install-qe4_90uh/pyarrow/build/temp.linux-x86_64-3.7/CMakeFiles/CMakeError.log".
     error: command 'cmake' failed with exit status 1
     ----------------------------------------
     ERROR: Failed building wheel for pyarrow
   ```
   
   2. Building from the 0.15.1 tarball fails reproducibly with a flatbuffers_ep compile error, as per [this Apache Arrow ticket](https://github.com/apache/arrow/issues/6028).
   
   *Note* that `flatbuffers_ep` is gone in Arrow since 0.15.1 was cut, and PyArrow builds fine on Alpine from the project's master.
   
   #### How to reproduce
   
   Simple build with pre-reqs: 
   
   ```Dockerfile
   FROM python:3.7-alpine
   
   # Configure environment
   RUN apk add autoconf \
       bash \
       bison \
       boost-dev \
       cmake \
       cython \
       flex \
       g++ \
       gcc \
       make
   
   RUN pip3 install --no-cache-dir \
       six \
       cython \
       numpy
   
   # confirm that numpy is installed
   RUN python -c "import numpy; print(numpy.__version__)"
   # outputs 1.17.4
   
   RUN pip3 install pyarrow==0.15.1
   ```
   
   ### Environment
   
   - superset version: current master
   - python version: 3.7.5
   - node.js version: N/A
   - npm version: N/A
   
   ### Checklist
   
   Make sure these boxes are checked before submitting your issue - thank you!
   
   - [x] I have checked the superset logs for python stacktraces and included it here as text if there are any.
   - [x] I have reproduced the issue with at least the latest released version of superset.
   - [x] I have checked the issue tracker for the same issue and I haven't found one similar.
   
   ### Additional context
   
   Best route for Alpine at this point seems to be
   - stick to 0.14.1, unless affected by the CVE's that triggered the PyArrow version bump
   - move to next PyArrow release once available
   
   Am I missing something? Any other thoughts / pointers?

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: notifications-unsubscribe@superset.apache.org
For additional commands, e-mail: notifications-help@superset.apache.org