You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "srinathtcs1993 (via GitHub)" <gi...@apache.org> on 2023/04/12 12:02:14 UTC
[GitHub] [arrow] srinathtcs1993 opened a new issue, #35072: Issue while try adding Pyarrow dependency to Docker file
srinathtcs1993 opened a new issue, #35072:
URL: https://github.com/apache/arrow/issues/35072
### Describe the bug, including details regarding any error messages, version, and platform.
I am trying to add pyarrow to my docker file as a dependency. I am getting an issue as below
CMake Error at thrift_ep-stamp/thrift_ep-download-RELEASE-impl.cmake:9 (message):
Command failed (1):
'/usr/bin/cmake' '-P' '/arrow/cpp/build/thrift_ep-prefix/src/thrift_ep-stamp/download-thrift_ep.cmake'
[91mmake[2]: *** [CMakeFiles/thrift_ep.dir/build.make:99: thrift_ep-prefix/src/thrift_ep-stamp/thrift_ep-download] Error 1
[0m[91mmake[1]: *** [CMakeFiles/Makefile2:828: CMakeFiles/thrift_ep.dir/all] Error 2
make[1]: *** Waiting for unfinished jobs....
[0m-- zstd_ep build command succeeded. See also /arrow/cpp/build/zstd_ep-prefix/src/zstd_ep-stamp/zstd_ep-build-*.log
[ 16%] Performing install step for 'zstd_ep'
-- zstd_ep install command succeeded. See also /arrow/cpp/build/zstd_ep-prefix/src/zstd_ep-stamp/zstd_ep-install-*.log
[ 16%] Completed 'zstd_ep'
[ 16%] Built target zstd_ep
[91mmake: *** [Makefile:146: all] Error 2
[0mThe command '/bin/sh -c mkdir /arrow && wget -q [https://github.com/apache/arrow/archive/apache-arrow-${ARROW_VERSION}.tar.gz](https://github.com/apache/arrow/archive/apache-arrow-$%7BARROW_VERSION%7D.tar.gz) -O /tmp/apache-arrow.tar.gz && echo "${ARROW_SHA1} *apache-arrow.tar.gz" | sha1sum /tmp/apache-arrow.tar.gz && tar -xvf /tmp/apache-arrow.tar.gz -C /arrow --strip-components 1 && mkdir -p /arrow/cpp/build && cd /arrow/cpp/build && cmake -DCMAKE_BUILD_TYPE=$ARROW_BUILD_TYPE -DOPENSSL_ROOT_DIR=/usr/local/ssl -DCMAKE_INSTALL_LIBDIR=lib -DCMAKE_INSTALL_PREFIX=$ARROW_HOME -DARROW_WITH_BZ2=ON -DARROW_WITH_ZLIB=ON -DARROW_WITH_ZSTD=ON -DARROW_WITH_LZ4=ON -DARROW_WITH_SNAPPY=ON -DARROW_PARQUET=ON -DARROW_PYTHON=ON -DARROW_PLASMA=ON -DARROW_BUILD_TESTS=OFF .. && make -j$(nproc) && make install && cd /arrow/python && python setup.py build_e
xt --build-type=$ARROW_BUILD_TYPE --with-parquet && python setup.py install && rm -rf /arrow /tmp/apache-arrow.tar.gz' returned a non-zero code: 2
This is my docker build file code
FROM python:3.8-alpine3.16
RUN apk update \
&& apk upgrade \
&& apk add --no-cache build-base \
autoconf \
bash \
bison \
boost-dev \
cmake \
flex \
libressl-dev \
zlib-dev
RUN apk upgrade expat
RUN pip install --no-cache-dir six numpy cython
ARG ARROW_VERSION=3.0.0
ARG ARROW_SHA1=c1fed962cddfab1966a0e03461376ebb28cf17d3
ARG ARROW_BUILD_TYPE=release
ENV ARROW_HOME=/usr/local \
PARQUET_HOME=/usr/local
#Download and build apache-arrow
RUN mkdir /arrow \
&& wget -q https://github.com/apache/arrow/archive/apache-arrow-${ARROW_VERSION}.tar.gz -O /tmp/apache-arrow.tar.gz \
&& echo "${ARROW_SHA1} *apache-arrow.tar.gz" | sha1sum /tmp/apache-arrow.tar.gz \
&& tar -xvf /tmp/apache-arrow.tar.gz -C /arrow --strip-components 1 \
&& mkdir -p /arrow/cpp/build \
&& cd /arrow/cpp/build \
&& cmake -DCMAKE_BUILD_TYPE=$ARROW_BUILD_TYPE \
-DOPENSSL_ROOT_DIR=/usr/local/ssl \
-DCMAKE_INSTALL_LIBDIR=lib \
-DCMAKE_INSTALL_PREFIX=$ARROW_HOME \
-DARROW_WITH_BZ2=ON \
-DARROW_WITH_ZLIB=ON \
-DARROW_WITH_ZSTD=ON \
-DARROW_WITH_LZ4=ON \
-DARROW_WITH_SNAPPY=ON \
-DARROW_PARQUET=ON \
-DARROW_PYTHON=ON \
-DARROW_PLASMA=ON \
-DARROW_BUILD_TESTS=OFF \
.. \
&& make -j$(nproc) \
&& make install \
&& cd /arrow/python \
&& python setup.py build_ext --build-type=$ARROW_BUILD_TYPE --with-parquet \
&& python setup.py install \
&& rm -rf /arrow /tmp/apache-arrow.tar.gz
kindly let me know what changes should I do to add pyarrow as a dependency to docker file
### Component(s)
Python
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: issues-unsubscribe@arrow.apache.org.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [arrow] srinathtcs1993 commented on issue #35072: Issue while try adding Pyarrow dependency to Docker file
Posted by "srinathtcs1993 (via GitHub)" <gi...@apache.org>.
srinathtcs1993 commented on issue #35072:
URL: https://github.com/apache/arrow/issues/35072#issuecomment-1511052361
Yes, thanks for your support
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [arrow] srinathtcs1993 commented on issue #35072: Issue while try adding Pyarrow dependency to Docker file
Posted by "srinathtcs1993 (via GitHub)" <gi...@apache.org>.
srinathtcs1993 commented on issue #35072:
URL: https://github.com/apache/arrow/issues/35072#issuecomment-1506357769
[Pyarrow_installation_full_log.txt](https://github.com/apache/arrow/files/11218371/Pyarrow_installation_full_log.txt)
Attached the full log here.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [arrow] kou commented on issue #35072: Issue while try adding Pyarrow dependency to Docker file
Posted by "kou (via GitHub)" <gi...@apache.org>.
kou commented on issue #35072:
URL: https://github.com/apache/arrow/issues/35072#issuecomment-1506227062
Or how about installing the `thrift` package by `apk`?
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [arrow] srinathtcs1993 commented on issue #35072: Issue while try adding Pyarrow dependency to Docker file
Posted by "srinathtcs1993 (via GitHub)" <gi...@apache.org>.
srinathtcs1993 commented on issue #35072:
URL: https://github.com/apache/arrow/issues/35072#issuecomment-1506412083
Yes this is correct
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [arrow] srinathtcs1993 commented on issue #35072: Issue while try adding Pyarrow dependency to Docker file
Posted by "srinathtcs1993 (via GitHub)" <gi...@apache.org>.
srinathtcs1993 commented on issue #35072:
URL: https://github.com/apache/arrow/issues/35072#issuecomment-1510843342
Thanks for your suggestion. thrift-dev is working fine, but need to understand why the hash comparison got failed for thrift, since that was working fine couple of months ago,
I have a doubt whether thrift-dev may also end up having the same hash comparison failure in near future.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [arrow] kou commented on issue #35072: Issue while try adding Pyarrow dependency to Docker file
Posted by "kou (via GitHub)" <gi...@apache.org>.
kou commented on issue #35072:
URL: https://github.com/apache/arrow/issues/35072#issuecomment-1510955221
> need to understand why the hash comparison got failed for thrift, since that was working fine couple of months ago,
I think that we can't help you because I think that it's a your environment problem.
> I have a doubt whether thrift-dev may also end up having the same hash comparison failure in near future.
It must not happen. If you install `thrift-dev`, Apache Arrow C++ doesn't use bundled Apache Thrift. It means that you don't need to download Apache Thrift source archive and validate it by yourself.
Can we close this issue?
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [arrow] kou commented on issue #35072: Issue while try adding Pyarrow dependency to Docker file
Posted by "kou (via GitHub)" <gi...@apache.org>.
kou commented on issue #35072:
URL: https://github.com/apache/arrow/issues/35072#issuecomment-1505835380
3.0.0 is too old. Could you try recent version?
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [arrow] kou commented on issue #35072: Issue while try adding Pyarrow dependency to Docker file
Posted by "kou (via GitHub)" <gi...@apache.org>.
kou commented on issue #35072:
URL: https://github.com/apache/arrow/issues/35072#issuecomment-1506414934
Then why did `thrift-0.12.0.tar.gz` downloaded from your network have wrong MD5 hash (`ff9af01fec424b5a279fa8a3c9e95c0c` not `3deebbb4d1ca77dd9c9e009a1ea02183`)?
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [arrow] kou commented on issue #35072: Issue while try adding Pyarrow dependency to Docker file
Posted by "kou (via GitHub)" <gi...@apache.org>.
kou commented on issue #35072:
URL: https://github.com/apache/arrow/issues/35072#issuecomment-1506225612
Could you attach full error log?
It seems that Apache Thrift download error is happen.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [arrow] srinathtcs1993 commented on issue #35072: Issue while try adding Pyarrow dependency to Docker file
Posted by "srinathtcs1993 (via GitHub)" <gi...@apache.org>.
srinathtcs1993 commented on issue #35072:
URL: https://github.com/apache/arrow/issues/35072#issuecomment-1506362338
Apache Thrift is downloaded, but after that it is trying compare the hash value and due to mismatch it is failing
-- Using src='http://www.apache.org/dyn/closer.cgi?action=download&filename=/thrift/0.12.0/thrift-0.12.0.tar.gz'
-- Using src='https://downloads.apache.org/thrift/0.12.0/thrift-0.12.0.tar.gz'
-- Using src='https://github.com/apache/thrift/archive/v0.12.0.tar.gz'
-- verifying file...
file='/arrow/cpp/build/thrift_ep-prefix/src/thrift-0.12.0.tar.gz'
-- MD5 hash of
/arrow/cpp/build/thrift_ep-prefix/src/thrift-0.12.0.tar.gz
does not match expected value
expected: '3deebbb4d1ca77dd9c9e009a1ea02183'
actual: 'ff9af01fec424b5a279fa8a3c9e95c0c'
-- Hash mismatch, removing...
-- Using src='https://apache.claz.org/thrift/0.12.0/thrift-0.12.0.tar.gz'
-- Using src='https://github.com/apache/thrift/archive/v0.12.0.tar.gz'
-- verifying file...
file='/arrow/cpp/build/thrift_ep-prefix/src/thrift-0.12.0.tar.gz'
-- MD5 hash of
/arrow/cpp/build/thrift_ep-prefix/src/thrift-0.12.0.tar.gz
does not match expected value
expected: '3deebbb4d1ca77dd9c9e009a1ea02183'
actual: 'ff9af01fec424b5a279fa8a3c9e95c0c'
-- Hash mismatch, removing...
-- Using src='https://apache.cs.utah.edu/thrift/0.12.0/thrift-0.12.0.tar.gz'
-- Using src='https://github.com/apache/thrift/archive/v0.12.0.tar.gz'
CMAKE is taking the hash value from this attribute, DARROW_THRIFT_BUILD_MD5_CHECKSUM, not sure how to skip this hash value comparison or override a value which we require to make that hash comparison successful
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [arrow] kou commented on issue #35072: Issue while try adding Pyarrow dependency to Docker file
Posted by "kou (via GitHub)" <gi...@apache.org>.
kou commented on issue #35072:
URL: https://github.com/apache/arrow/issues/35072#issuecomment-1506538743
Ah, sorry. `thrift-dev` not `thrift`.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [arrow] srinathtcs1993 commented on issue #35072: Issue while try adding Pyarrow dependency to Docker file
Posted by "srinathtcs1993 (via GitHub)" <gi...@apache.org>.
srinathtcs1993 commented on issue #35072:
URL: https://github.com/apache/arrow/issues/35072#issuecomment-1506502111
I have tried installing thrift using apk
![image](https://user-images.githubusercontent.com/34440654/231689109-60ddcacd-885c-45fb-9c9b-acb1a2bf402d.png)
but getting error again
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [arrow] kou commented on issue #35072: Issue while try adding Pyarrow dependency to Docker file
Posted by "kou (via GitHub)" <gi...@apache.org>.
kou commented on issue #35072:
URL: https://github.com/apache/arrow/issues/35072#issuecomment-1506535243
I can't help you without log.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [arrow] srinathtcs1993 commented on issue #35072: Issue while try adding Pyarrow dependency to Docker file
Posted by "srinathtcs1993 (via GitHub)" <gi...@apache.org>.
srinathtcs1993 commented on issue #35072:
URL: https://github.com/apache/arrow/issues/35072#issuecomment-1506222236
I tried with 4.0.0 till 11.0.0 version, but all throws the same error.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [arrow] kou commented on issue #35072: Issue while try adding Pyarrow dependency to Docker file
Posted by "kou (via GitHub)" <gi...@apache.org>.
kou commented on issue #35072:
URL: https://github.com/apache/arrow/issues/35072#issuecomment-1506383391
Is your network correct?
`3deebbb4d1ca77dd9c9e009a1ea02183` is the correct MD5 hash: https://archive.apache.org/dist//thrift/0.12.0/thrift-0.12.0.tar.gz.md5
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [arrow] kou commented on issue #35072: Issue while try adding Pyarrow dependency to Docker file
Posted by "kou (via GitHub)" <gi...@apache.org>.
kou commented on issue #35072:
URL: https://github.com/apache/arrow/issues/35072#issuecomment-1506463907
> Or how about installing the `thrift` package by `apk`?
may help you.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [arrow] srinathtcs1993 commented on issue #35072: Issue while try adding Pyarrow dependency to Docker file
Posted by "srinathtcs1993 (via GitHub)" <gi...@apache.org>.
srinathtcs1993 commented on issue #35072:
URL: https://github.com/apache/arrow/issues/35072#issuecomment-1506438946
Not sure of that
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [arrow] raulcd closed issue #35072: Issue while try adding Pyarrow dependency to Docker file
Posted by "raulcd (via GitHub)" <gi...@apache.org>.
raulcd closed issue #35072: Issue while try adding Pyarrow dependency to Docker file
URL: https://github.com/apache/arrow/issues/35072
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: issues-unsubscribe@arrow.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org