You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@arrow.apache.org by "abcbarryn (via GitHub)" <gi...@apache.org> on 2023/03/01 07:02:48 UTC

[GitHub] [arrow] abcbarryn commented on issue #34277: [Python] pyarrow does not disable SIMD CPU optimizations when set to do so.

abcbarryn commented on issue #34277:
URL: https://github.com/apache/arrow/issues/34277#issuecomment-1449455571

   The google drive link is to a folder that contains both the whl and the source code for the python folder. I used gcc 7.1.1 to build that source. I the main change I ended up making was to the python/CMakeLists.txt, changing both setting for SIMD level to "NONE". To build it portably, you will also need a Linux system running a fairly old distribution.
   
   This is the build script I used for the cpp folder...
   
   ```
   #!/bin/sh
   
   mkdir cpp/build
   pushd cpp/build
   cmake -DCMAKE_INSTALL_PREFIX=/usr/local \
   	-DCMAKE_INSTALL_LIBDIR=lib64 \
   	-DARROW_BUILD_SHARED=ON \
   	-DARROW_SIMD_LEVEL=NONE \
   	-DARROW_OPENSSL_USE_SHARED=OFF \
   	-DARROW_PYTHON=ON \
   	-DARROW_POSITION_INDEPENDENT_CODE=ON \
   	-DARROW_COMPUTE=ON \
   	-DARROW_JEMALLOC_USE_SHARED=OFF \
   	-DARROW_CSV=ON \
   	-DARROW_DATASET=ON \
   	-DARROW_FILESYSTEM=ON \
   	-DARROW_HDFS=ON \
   	-DARROW_JSON=ON \
   	-DARROW_PARQUET=ON \
   	-DARROW_WITH_BROTLI=ON \
   	-DARROW_WITH_BZ2=ON \
   	-DARROW_WITH_LZ4=ON \
   	-DARROW_WITH_SNAPPY=ON \
   	-DARROW_WITH_ZLIB=ON \
   	-DARROW_WITH_ZSTD=ON \
   	-DPARQUET_REQUIRE_ENCRYPTION=ON \
   	..
   make -j4
   make install
   popd
   ```
   
   Then I ran this script to build the python (pyarrow) part...
   
   ```
   #!/bin/sh
   
   export PYARROW_PARALLEL=4
   
   mkdir python/build
   pushd python/build
   cmake -DCMAKE_INSTALL_PREFIX=/usr/local \
   	-DCMAKE_BUILD_TYPE=RELEASE \
   	-DCMAKE_INSTALL_LIBDIR=lib64 \
   	-DARROW_SIMD_LEVEL=NONE \
   	-DARROW_OPENSSL_USE_SHARED=OFF \
   	-DARROW_PYTHON_INCLUDE_DIR=/usr/local/include \
   	-DARROW_PYTHON_LIB_DIR=/usr/local/lib64 \
   	-DARROW_BUILD_SHARED=ON \
   	-DARROW_POSITION_INDEPENDENT_CODE=ON \
   	-DARROW_COMPUTE=ON \
   	-DARROW_JEMALLOC_USE_SHARED=OFF \
   	-DARROW_CSV=ON \
   	-DARROW_DATASET=ON \
   	-DARROW_FILESYSTEM=ON \
   	-DARROW_HDFS=ON \
   	-DARROW_JSON=ON \
   	-DARROW_PARQUET=ON \
   	-DARROW_WITH_BROTLI=ON \
   	-DARROW_WITH_BZ2=ON \
   	-DARROW_WITH_LZ4=ON \
   	-DARROW_WITH_SNAPPY=ON \
   	-DARROW_WITH_ZLIB=ON \
   	-DARROW_WITH_ZSTD=ON \
   	-DPARQUET_REQUIRE_ENCRYPTION=ON \
   	..
   make -j4
   
   popd
   cd python
   python setup.py build_ext --bundle-arrow-cpp bdist_wheel
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org