You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@mxnet.apache.org by GitBox <gi...@apache.org> on 2020/04/24 00:25:20 UTC

[GitHub] [incubator-mxnet-ci] ChaiBapchya commented on pull request #23: [DO NOT MERGE] Test g3

ChaiBapchya commented on pull request #23:
URL: https://github.com/apache/incubator-mxnet-ci/pull/23#issuecomment-618736828


   Created an AMI using the above mentioned scripts & following the steps as mentioned in the updated Readme.
   
   Launched a G3.8xl instance with the created AMI.
   ```
   sudo python3 ci/build.py --platform ubuntu_build_cuda /work/runtime_functions.sh build_ubuntu_gpu_cuda101_cudnn7
   sudo python3 ci/build.py --platform ubuntu_gpu_cu101 /work/runtime_functions.sh integrationtest_ubuntu_gpu_python
   ```
   Fail due to libnvidia-ml.so.1 not found
   ```
   2020-04-23 23:35:41,571 - root - INFO - Started container: fe0a2fbcd8eb
   + CI_CUDA_COMPUTE_CAPABILITIES='-gencode=arch=compute_52,code=sm_52 -gencode=arch=compute_70,code=sm_70'
   + CI_CMAKE_CUDA_ARCH='5.2 7.0'
   + set +x
   + export PYTHONPATH=./python/
   + PYTHONPATH=./python/
   + export MXNET_STORAGE_FALLBACK_LOG_VERBOSE=0
   + MXNET_STORAGE_FALLBACK_LOG_VERBOSE=0
   + export MXNET_SUBGRAPH_VERBOSE=0
   + MXNET_SUBGRAPH_VERBOSE=0
   + export DMLC_LOG_STACK_TRACE_DEPTH=10
   + DMLC_LOG_STACK_TRACE_DEPTH=10
   + pytest example/image-classification/test_score.py
   WARNING:root:Unable to import numpy/mxnet. Skipping conftest.
   ============================= test session starts ==============================
   platform linux -- Python 3.5.2, pytest-5.3.5, py-1.8.1, pluggy-0.13.1
   rootdir: /work/mxnet, inifile: pytest.ini
   plugins: timeout-1.3.4, xdist-1.31.0, forked-1.1.3, env-0.6.2, cov-2.8.1
   timeout: 1200.0s
   timeout method: signal
   timeout func_only: False
   collected 0 items / 1 error
   
   ==================================== ERRORS ====================================
   _________ ERROR collecting example/image-classification/test_score.py __________
   example/image-classification/test_score.py:25: in <module>
       import mxnet as mx
   python/mxnet/__init__.py:23: in <module>
       from .context import Context, current_context, cpu, gpu, cpu_pinned
   python/mxnet/context.py:23: in <module>
       from .base import classproperty, with_metaclass, _MXClassPropertyMetaClass
   python/mxnet/base.py:351: in <module>
       _LIB = _load_lib()
   python/mxnet/base.py:342: in _load_lib
       lib = ctypes.CDLL(lib_path[0], ctypes.RTLD_LOCAL)
   /usr/lib/python3.5/ctypes/__init__.py:347: in __init__
       self._handle = _dlopen(self._name, mode)
   E   OSError: libnvidia-ml.so.1: cannot open shared object file: No such file or directory
   !!!!!!!!!!!!!!!!!!!! Interrupted: 1 error during collection !!!!!!!!!!!!!!!!!!!!
   =============================== 1 error in 0.13s ===============================
   2020-04-23 23:35:42,274 - root - INFO - Waiting for status of container fe0a2fbcd8eb for 600 s.
   2020-04-23 23:35:42,445 - root - INFO - Container exit status: {'StatusCode': 2, 'Error': None}
   2020-04-23 23:35:42,445 - root - ERROR - Container exited with an error 😞
   2020-04-23 23:35:42,445 - root - INFO - Executed command for reproduction:
   
   ci/build.py --platform ubuntu_gpu_cu101 /work/runtime_functions.sh integrationtest_ubuntu_gpu_python
   
   2020-04-23 23:35:42,446 - root - INFO - Stopping container: fe0a2fbcd8eb
   2020-04-23 23:35:42,447 - root - INFO - Removing container: fe0a2fbcd8eb
   2020-04-23 23:35:42,456 - root - CRITICAL - Execution of ['/work/runtime_functions.sh', 'integrationtest_ubuntu_gpu_python'] failed wit
   h status: 2
   ```


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org