You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@mxnet.apache.org by GitBox <gi...@apache.org> on 2019/01/27 20:17:31 UTC

[GitHub] mahmoodn opened a new issue #14002: error: ‘__cpuid’ was not declared in this scope

mahmoodn opened a new issue #14002: error: ‘__cpuid’ was not declared in this scope
URL: https://github.com/apache/incubator-mxnet/issues/14002
 
 
   It the latest version compatible with CUDA 10? I get the following error during the build:
   
   ```
   [ 27%] Building CXX object src/CMakeFiles/mkldnn.dir/cpu/cpu_barrier.cpp.o
   cd /home/mahmood/mx/mxnet/3rdparty/mkldnn/build/src && /usr/bin/c++  -DMKLDNN_DLL -DMKLDNN_DLL_EXPORTS -DMKLDNN_THR=MKLDNN_THR_OMP -DUSE_CBLAS -DUSE_MKL -D__STDC_CONSTANT_MACROS -D__STDC_LIMIT_MACROS -Dmkldnn_EXPORTS -I/home/mahmood/mx/mxnet/3rdparty/mkldnn/external/mklml_lnx_2019.0.1.20180928/include -I/home/mahmood/mx/mxnet/3rdparty/mkldnn/include -I/home/mahmood/mx/mxnet/3rdparty/mkldnn/src -I/home/mahmood/mx/mxnet/3rdparty/mkldnn/src/common -I/home/mahmood/mx/mxnet/3rdparty/mkldnn/src/cpu/xbyak  -fopenmp -std=c++11 -fvisibility-inlines-hidden  -Wall -Werror -Wno-unknown-pragmas -fvisibility=internal -mtune=generic -fPIC -Wformat -Wformat-security -fstack-protector-strong  -Wmissing-field-initializers  -Wno-strict-overflow  -O3 -DNDEBUG -D_FORTIFY_SOURCE=2 -fPIC   -std=gnu++11 -o CMakeFiles/mkldnn.dir/cpu/cpu_barrier.cpp.o -c /home/mahmood/mx/mxnet/3rdparty/mkldnn/src/cpu/cpu_barrier.cpp
   In file included from /home/mahmood/mx/mxnet/3rdparty/mkldnn/src/cpu/cpu_isa_traits.hpp:35:0,
                    from /home/mahmood/mx/mxnet/3rdparty/mkldnn/src/cpu/jit_generator.hpp:21,
                    from /home/mahmood/mx/mxnet/3rdparty/mkldnn/src/cpu/cpu_barrier.hpp:22,
                    from /home/mahmood/mx/mxnet/3rdparty/mkldnn/src/cpu/cpu_barrier.cpp:19:
   /home/mahmood/mx/mxnet/3rdparty/mkldnn/src/cpu/xbyak/xbyak_util.h: In static member function ‘static void Xbyak::util::Cpu::getCpuid(unsigned int, unsigned int*)’:
   /home/mahmood/mx/mxnet/3rdparty/mkldnn/src/cpu/xbyak/xbyak_util.h:227:3: error: ‘__cpuid’ was not declared in this scope
      __cpuid(eaxIn, data[0], data[1], data[2], data[3]);
      ^~~~~~~
   
   ```
   
   
   Some information about system specs are presented below
   
   ```
   $ lsb_release -a
   No LSB modules are available.
   Distributor ID:	Ubuntu
   Description:	Ubuntu 18.04.1 LTS
   Release:	18.04
   Codename:	bionic
   $ gcc -v
   Using built-in specs.
   COLLECT_GCC=gcc
   COLLECT_LTO_WRAPPER=/usr/lib/gcc/x86_64-linux-gnu/7/lto-wrapper
   OFFLOAD_TARGET_NAMES=nvptx-none
   OFFLOAD_TARGET_DEFAULT=1
   Target: x86_64-linux-gnu
   Configured with: ../src/configure -v --with-pkgversion='Ubuntu 7.3.0-27ubuntu1~18.04' --with-bugurl=file:///usr/share/doc/gcc-7/README.Bugs --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --prefix=/usr --with-gcc-major-version-only --program-suffix=-7 --program-prefix=x86_64-linux-gnu- --enable-shared --enable-linker-build-id --libexecdir=/usr/lib --without-included-gettext --enable-threads=posix --libdir=/usr/lib --enable-nls --with-sysroot=/ --enable-clocale=gnu --enable-libstdcxx-debug --enable-libstdcxx-time=yes --with-default-libstdcxx-abi=new --enable-gnu-unique-object --disable-vtable-verify --enable-libmpx --enable-plugin --enable-default-pie --with-system-zlib --with-target-system-zlib --enable-objc-gc=auto --enable-multiarch --disable-werror --with-arch-32=i686 --with-abi=m64 --with-multilib-list=m32,m64,mx32 --enable-multilib --with-tune=generic --enable-offload-targets=nvptx-none --without-cuda-driver --enable-checking=release --build=x86_64-linux-gnu --host=x86_64-linux-gnu --target=x86_64-linux-gnu
   Thread model: posix
   gcc version 7.3.0 (Ubuntu 7.3.0-27ubuntu1~18.04) 
   $ ~/NVIDIA_CUDA-10.0_Samples/1_Utilities/deviceQuery/deviceQuery 
   /home/mahmood/NVIDIA_CUDA-10.0_Samples/1_Utilities/deviceQuery/deviceQuery Starting...
   
    CUDA Device Query (Runtime API) version (CUDART static linking)
   
   Detected 1 CUDA Capable device(s)
   
   Device 0: "Quadro M2000"
     CUDA Driver Version / Runtime Version          10.0 / 10.0
     CUDA Capability Major/Minor version number:    5.2
     Total amount of global memory:                 4041 MBytes (4236902400 bytes)
     ( 6) Multiprocessors, (128) CUDA Cores/MP:     768 CUDA Cores
     GPU Max Clock rate:                            1162 MHz (1.16 GHz)
     Memory Clock rate:                             3303 Mhz
     Memory Bus Width:                              128-bit
     L2 Cache Size:                                 786432 bytes
     Maximum Texture Dimension Size (x,y,z)         1D=(65536), 2D=(65536, 65536), 3D=(4096, 4096, 4096)
     Maximum Layered 1D Texture Size, (num) layers  1D=(16384), 2048 layers
     Maximum Layered 2D Texture Size, (num) layers  2D=(16384, 16384), 2048 layers
     Total amount of constant memory:               65536 bytes
     Total amount of shared memory per block:       49152 bytes
     Total number of registers available per block: 65536
     Warp size:                                     32
     Maximum number of threads per multiprocessor:  2048
     Maximum number of threads per block:           1024
     Max dimension size of a thread block (x,y,z): (1024, 1024, 64)
     Max dimension size of a grid size    (x,y,z): (2147483647, 65535, 65535)
     Maximum memory pitch:                          2147483647 bytes
     Texture alignment:                             512 bytes
     Concurrent copy and kernel execution:          Yes with 1 copy engine(s)
     Run time limit on kernels:                     Yes
     Integrated GPU sharing Host Memory:            No
     Support host page-locked memory mapping:       Yes
     Alignment requirement for Surfaces:            Yes
     Device has ECC support:                        Disabled
     Device supports Unified Addressing (UVA):      Yes
     Device supports Compute Preemption:            No
     Supports Cooperative Kernel Launch:            No
     Supports MultiDevice Co-op Kernel Launch:      No
     Device PCI Domain ID / Bus ID / location ID:   0 / 38 / 0
     Compute Mode:
        < Default (multiple host threads can use ::cudaSetDevice() with device simultaneously) >
   
   deviceQuery, CUDA Driver = CUDART, CUDA Driver Version = 10.0, CUDA Runtime Version = 10.0, NumDevs = 1
   Result = PASS
   
   
   
   ```
   Any guess?
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services