You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@mxnet.apache.org by GitBox <gi...@apache.org> on 2021/04/27 01:40:01 UTC

[GitHub] [incubator-mxnet] cyrusbehr opened a new issue #20222: Unable to cross compile for aarch64 cuda with USE_CPP_PACKAGE=ON

cyrusbehr opened a new issue #20222:
URL: https://github.com/apache/incubator-mxnet/issues/20222


   This issue is building off of a previous issue [here](https://github.com/apache/incubator-mxnet/issues/20211).
   
   My goal is to cross compile on an x86 host for a aarch64 target, and build mxnet with cuda + cpp support.
   
   As per the suggestion in that previous issue, I am compiling the library in the mxnet `Dockerfile.build.jetson` which can be found [here](https://github.com/apache/incubator-mxnet/blob/e8ad36aeaa5de61ca073acbe598d853f63262721/ci/docker/Dockerfile.build.jetson).
   
   When I use the build commands linked in that example, the compilation succeeds:
   ```
       cmake \
           -DCMAKE_TOOLCHAIN_FILE=${CMAKE_TOOLCHAIN_FILE} \
           -DUSE_CUDA=ON \
           -DMXNET_CUDA_ARCH="5.2" \
           -DUSE_OPENCV=OFF \
           -DUSE_OPENMP=ON \
           -DUSE_LAPACK=OFF \
           -DUSE_BLAS=Open \
           -DCMAKE_BUILD_TYPE=Release \
           -G Ninja /work/mxnet
   ```
   
   However, I need to build the CPP package.
   I am therefore using the following cmake command:
   
   ```
     cmake\
    -DCMAKE_TOOLCHAIN_FILE=${CMAKE_TOOLCHAIN_FILE} \
    -DUSE_OPENMP=ON \
    -DUSE_BLAS=Open \
    -DUSE_CUDA=ON\
    -DUSE_CUDNN=ON\
    -DMXNET_CUDA_ARCH="5.3;6.2;7.2"\
    -DENABLE_CUDA_RTC=OFF\
    -DCMAKE_BUILD_TYPE=Release\
    -DUSE_F16C=OFF\
    -GNinja\
    -DUSE_LAPACK=OFF\
    -DUSE_JEMALLOC=OFF\
    -DUSE_CPP_PACKAGE=ON\
    -DUSE_SIGNAL_HANDLER=OFF\
    -DUSE_OPENCV=OFF\
    -DUSE_MKL_IF_AVAILABLE=OFF\
    -DUSE_MKLDNN=OFF\
    -DBUILD_CPP_EXAMPLES=OFF\
    -DCMAKE_INSTALL_PREFIX=./packaged\
    ..
   
   ```
   
   The build gets to step 579/601 then fails with the following message:
   ```
   [579/601] cd /mxnet/cpp-package/scripts && echo Running:\ OpWrapperGenerator.py && python OpWrapperGenerator.py /mxnet/build_aarch64_cuda/libmxnet.so
   FAILED: cpp-package/CMakeFiles/cpp_package_op_h ../cpp-package/include/mxnet-cpp/op.h cpp-package/MAIN_DEPENDENCY cpp-package/mxnet 
   cd /mxnet/cpp-package/scripts && echo Running:\ OpWrapperGenerator.py && python OpWrapperGenerator.py /mxnet/build_aarch64_cuda/libmxnet.so
   Running: OpWrapperGenerator.py
   Traceback (most recent call last):
     File "OpWrapperGenerator.py", line 433, in <module>
       raise(e)
   OSError: /mxnet/build_aarch64_cuda/libmxnet.so: cannot open shared object file: No such file or directory
   ninja: build stopped: subcommand failed.
   ```
   
   Note, if I run `ls -la` on the build directory, it shows that `libmxnet.so is indeed there`:
   ```
   root@aa10ef4d3442:/mxnet/build_aarch64_cuda# ls -la
   total 1348340
   drwxr-xr-x 10 root root      4096 Apr 27 01:29 .
   drwxr-xr-x 31 root root      4096 Apr 27 00:54 ..
   -rw-r--r--  1 root root   1760312 Apr 27 01:29 .ninja_deps
   -rw-r--r--  1 root root     62173 Apr 27 01:29 .ninja_log
   drwxr-xr-x  5 root root      4096 Apr 27 00:54 3rdparty
   -rw-r--r--  1 root root     43678 Apr 27 00:54 CMakeCache.txt
   drwxr-xr-x 14 root root      4096 Apr 27 00:54 CMakeFiles
   -rw-r--r--  1 root root       384 Apr 27 00:54 CTestTestfile.cmake
   -rw-r--r--  1 root root      2677 Apr 27 00:54 DartConfiguration.tcl
   drwxr-xr-x  3 root root      4096 Apr 27 00:54 Testing
   drwxr-xr-x  2 root root      4096 Apr 27 00:54 bin
   -rw-r--r--  1 root root    925363 Apr 27 00:54 build.ninja
   drwxr-xr-x  2 root root      4096 Apr 27 00:54 cmake
   -rw-r--r--  1 root root      4581 Apr 27 00:54 cmake_install.cmake
   drwxr-xr-x  3 root root      4096 Apr 27 00:54 cpp-package
   -rw-r--r--  1 root root         0 Apr 27 00:54 dummy.c
   drwxr-xr-x  2 root root      4096 Apr 27 01:24 lib
   -rwxr-xr-x  1 root root    918400 Apr 27 01:23 libcustomop_gpu_lib.so
   -rwxr-xr-x  1 root root    229752 Apr 27 00:54 libcustomop_lib.so
   -rw-r--r--  1 root root 919883526 Apr 27 01:29 libmxnet.a
   -rwxr-xr-x  1 root root 455852192 Apr 27 01:29 libmxnet.so
   -rwxr-xr-x  1 root root    207504 Apr 27 00:54 libpass_lib.so
   -rwxr-xr-x  1 root root    257408 Apr 27 00:54 libsubgraph_lib.so
   -rwxr-xr-x  1 root root    233936 Apr 27 01:24 libtransposecsr_lib.so
   -rwxr-xr-x  1 root root    234272 Apr 27 00:54 libtransposerowsp_lib.so
   drwxr-xr-x  3 root root      4096 Apr 27 00:54 tests
   
   ```
   
   This is using version 1.8.0 of mxnet. This looks like a bug to me. Thoughts? 
   
   @leezu perhaps you can comment, as you were very helpful with the previous issue. 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@mxnet.apache.org
For additional commands, e-mail: issues-help@mxnet.apache.org


[GitHub] [incubator-mxnet] cyrusbehr commented on issue #20222: Unable to cross compile for aarch64 cuda with USE_CPP_PACKAGE=ON

Posted by GitBox <gi...@apache.org>.
cyrusbehr commented on issue #20222:
URL: https://github.com/apache/incubator-mxnet/issues/20222#issuecomment-827996968


   Ok sounds good, let me give that a shot. 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@mxnet.apache.org
For additional commands, e-mail: issues-help@mxnet.apache.org


[GitHub] [incubator-mxnet] cyrusbehr commented on issue #20222: Unable to cross compile for aarch64 cuda with USE_CPP_PACKAGE=ON

Posted by GitBox <gi...@apache.org>.
cyrusbehr commented on issue #20222:
URL: https://github.com/apache/incubator-mxnet/issues/20222#issuecomment-828682569


   This did the trick, thanks. 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@mxnet.apache.org
For additional commands, e-mail: issues-help@mxnet.apache.org


[GitHub] [incubator-mxnet] barry-jin commented on issue #20222: Unable to cross compile for aarch64 cuda with USE_CPP_PACKAGE=ON

Posted by GitBox <gi...@apache.org>.
barry-jin commented on issue #20222:
URL: https://github.com/apache/incubator-mxnet/issues/20222#issuecomment-827996254


   Hi @cyrusbehr , the cross-compilation issue for CPP-PACKAGE exists since https://github.com/apache/incubator-mxnet/issues/13303, because OpWrapperGenerator.py will try to dlopen libmxnet.so(aarch64 target) on an x86 host. The work around may be compiling twice: Firstly to compile a x86 target and use OpWrapperGenerator.py to generate cpp-package required op.h. Then command out https://github.com/apache/incubator-mxnet/blob/2d9c3b5e0a12bcafeb2f23eb25f5ef40fd7977b9/cpp-package/CMakeLists.txt#L15
   and cross-compile an aarch64 target. 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@mxnet.apache.org
For additional commands, e-mail: issues-help@mxnet.apache.org


[GitHub] [incubator-mxnet] cyrusbehr closed issue #20222: Unable to cross compile for aarch64 cuda with USE_CPP_PACKAGE=ON

Posted by GitBox <gi...@apache.org>.
cyrusbehr closed issue #20222:
URL: https://github.com/apache/incubator-mxnet/issues/20222


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@mxnet.apache.org
For additional commands, e-mail: issues-help@mxnet.apache.org