You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@mxnet.apache.org by GitBox <gi...@apache.org> on 2017/12/17 13:19:14 UTC

[GitHub] jinhuang415 opened a new issue #9112: prepare_mkl.sh script issue report

jinhuang415 opened a new issue #9112: prepare_mkl.sh script issue report
URL: https://github.com/apache/incubator-mxnet/issues/9112
 
 
   Note: Providing complete information in the most concise form is the best way to get help. This issue template serves as the checklist for essential information to most of the technical issues and bug reports. For non-technical issues and feature requests, feel free to present the information in what you believe is the best form.
   
   For Q & A and discussion, please start a discussion thread at https://discuss.mxnet.io 
   
   ## Description
   Build MXNet with Intel MKLML under Linux will link with incorrect MKL library instead of MKLML library due to a bug in prepare_mkl.sh.
   
   ## Environment info (Required)
   ----------Python Info----------
   ('Version      :', '2.7.5')
   ('Compiler     :', 'GCC 4.8.5 20150623 (Red Hat 4.8.5-16)')
   ('Build        :', ('default', 'Aug  4 2017 00:39:18'))
   ('Arch         :', ('64bit', 'ELF'))
   ------------Pip Info-----------
   ('Version      :', '8.1.2')
   ('Directory    :', '/usr/lib/python2.7/site-packages/pip')
   ----------MXNet Info-----------
   ('Version      :', '0.12.1')
   ('Directory    :', '/home/jinhuang/.local/lib/python2.7/site-packages/mxnet-0.12.1-py2.7.egg/mxnet')
   Traceback (most recent call last):
     File "diagnose.py", line 108, in check_mxnet
       with open(commit_hash, 'r') as f:
   IOError: [Errno 2] No such file or directory: '/home/jinhuang/.local/lib/python2.7/site-packages/mxnet-0.12.1-py2.7.egg/mxnet/COMMIT_HASH'
   
   ----------System Info----------
   ('Platform     :', 'Linux-3.10.0-693.el7.x86_64-x86_64-with-centos-7.4.1708-Core')
   ('system       :', 'Linux')
   ('node         :', 'mlt-ace')
   ('release      :', '3.10.0-693.el7.x86_64')
   ('version      :', '#1 SMP Tue Aug 22 21:09:27 UTC 2017')
   ----------Hardware Info----------
   ('machine      :', 'x86_64')
   ('processor    :', 'x86_64')
   Architecture:          x86_64
   CPU op-mode(s):        32-bit, 64-bit
   Byte Order:            Little Endian
   CPU(s):                88
   On-line CPU(s) list:   0-87
   Thread(s) per core:    2
   Core(s) per socket:    22
   Socket(s):             2
   NUMA node(s):          2
   Vendor ID:             GenuineIntel
   CPU family:            6
   Model:                 79
   Model name:            Intel(R) Xeon(R) CPU E5-2699 v4 @ 2.20GHz
   Stepping:              1
   CPU MHz:               2199.914
   CPU max MHz:           3600.0000
   CPU min MHz:           1200.0000
   BogoMIPS:              4389.83
   Virtualization:        VT-x
   ...
   ----------Network Test----------
   Setting timeout: 10
   Timing for MXNet: https://github.com/apache/incubator-mxnet, DNS: 0.0017 sec, LOAD: 1.3231 sec.
   Timing for PYPI: https://pypi.python.org/pypi/pip, DNS: 0.0051 sec, LOAD: 1.7166 sec.
   Timing for FashionMNIST: https://apache-mxnet.s3-accelerate.dualstack.amazonaws.com/gluon/dataset/fashion-mnist/train-labels-idx1-ubyte.gz, DNS: 0.0051 sec, LOAD: 1.0842 sec.
   Timing for Conda: https://repo.continuum.io/pkgs/free/, DNS: 0.0068 sec, LOAD: 1.4465 sec.
   Timing for Gluon Tutorial(en): http://gluon.mxnet.io, DNS: 0.0067 sec, LOAD: 0.5077 sec.
   Timing for Gluon Tutorial(cn): https://zh.gluon.ai, DNS: 0.0054 sec, LOAD: 0.4266 sec.
   
   
   Package used (Python/R/Scala/Julia):
   (I'm using Python)
   
   ## Build info (Required if built from source)
   
   Compiler (gcc/clang/mingw/visual studio):    GCC
   
   MXNet commit hash:
   ae0fe77c25c51f2afecb178c610cfcd0b8a18aac
   
   Build config:
   make -j 12 USE_OPENCV=1 USE_MKL2017=1 USE_MKL2017_EXPERIMENTAL=1 USE_BLAS=mkl USE_PROFILER=1 MKLML_ROOT=/home/jinhuang/downloads/mklml_lnx_2018.0.1.20171007
   
   ## Error Message:
   No error message generated, but the generated libmxnet.so will link with unexpected MKL library instead of MKLML library and result in unexpected behavior. The root cause is in prepare_mkl.sh USE_MKLML flag will be wrongly set to 0 and in sub-module makefile this flag will be used to determine the linked library, MKLML library will only be linked when USE_MKLML is set to 1.
   
   ## Minimum reproducible example
   Just follow the build command in "build config" section to build MXNet with MKLML and issue will occur.
   
   ## Steps to reproduce
   1. Follow the command in "build config" section to build MXNet with MKLML.
   2. "ldd lib/libmxnet.so | grep libmkl" will see the library linked with unexpected MKL library instead of MKLML library.
   
   ## What have you tried to solve it?
   1.  Applied a fix in prepare_mkl.sh to fix this issue.
   2.  Added a test script to check if MKLML is installed correctly (linked with expected MKLML library) after 
    build/install.
   
   Other info:  this issue is also reported in another thread https://github.com/apache/incubator-mxnet/issues/8881

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services