You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@mxnet.apache.org by GitBox <gi...@apache.org> on 2021/04/26 14:51:38 UTC

[GitHub] [incubator-mxnet] leezu opened a new issue #20219: MXNet 1.8.0.post0 segmentation fault on DLAMI

leezu opened a new issue #20219:
URL: https://github.com/apache/incubator-mxnet/issues/20219


   Log on a new DL Base AMI 
   ```
   
   % pip3 install --user mxnet-cu110                                                                                                                                   ~ ip-172-31-14-207
   Collecting mxnet-cu110
     Downloading mxnet_cu110-1.8.0.post0-py2.py3-none-manylinux2014_x86_64.whl (323.5 MB)
        |████████████████████████████████| 323.5 MB 183 bytes/s
   Requirement already satisfied: numpy<2.0.0,>1.16.0 in /usr/local/lib/python3.6/dist-packages (from mxnet-cu110) (1.19.5)
   Requirement already satisfied: requests<3,>=2.20.0 in /usr/local/lib/python3.6/dist-packages (from mxnet-cu110) (2.25.1)
   Collecting graphviz<0.9.0,>=0.8.1
     Downloading graphviz-0.8.4-py2.py3-none-any.whl (16 kB)
   Requirement already satisfied: idna<3,>=2.5 in /usr/local/lib/python3.6/dist-packages (from requests<3,>=2.20.0->mxnet-cu110) (2.10)
   Requirement already satisfied: chardet<5,>=3.0.2 in /usr/local/lib/python3.6/dist-packages (from requests<3,>=2.20.0->mxnet-cu110) (4.0.0)
   Requirement already satisfied: urllib3<1.27,>=1.21.1 in /usr/local/lib/python3.6/dist-packages (from requests<3,>=2.20.0->mxnet-cu110) (1.26.4)
   Requirement already satisfied: certifi>=2017.4.17 in /usr/local/lib/python3.6/dist-packages (from requests<3,>=2.20.0->mxnet-cu110) (2020.12.5)
   Installing collected packages: graphviz, mxnet-cu110
   Successfully installed graphviz-0.8.4 mxnet-cu110-1.8.0.post0
   WARNING: You are using pip version 21.0.1; however, version 21.1 is available.
   You should consider upgrading via the '/usr/bin/python3 -m pip install --upgrade pip' command.
   % ls -l /usr/local                                                                                                                                               8s ~ ip-172-31-14-207
   total 64
   drwxr-xr-x  3 root root  4096 Apr 26 14:44 bin
   lrwxrwxrwx  1 root root    20 Apr 26 14:44 cuda -> /usr/local/cuda-11.1
   drwxr-xr-x 22 root root  4096 Apr 22 18:50 cuda-10.0
   drwxr-xr-x 19 root root  4096 Apr 22 18:45 cuda-10.1
   drwxr-xr-x 20 root root  4096 Apr 22 18:48 cuda-10.2
   drwxr-xr-x 25 root root  4096 Apr 22 18:43 cuda-11.0
   drwxr-xr-x 19 root root  4096 Apr 22 18:38 cuda-11.1
   drwxr-xr-x  3 root root  4096 Apr 22 18:51 dcgm
   drwxr-xr-x  3 root root  4096 Apr 22 17:39 etc
   drwxr-xr-x  2 root root  4096 Feb 24 17:30 games
   drwxr-xr-x  3 root root  4096 Apr 22 18:54 include
   drwxr-xr-x  4 root root  4096 Apr 22 17:39 init
   drwxr-xr-x  7 root root  4096 Apr 26 14:42 lib
   lrwxrwxrwx  1 root root     9 Feb 24 17:30 man -> share/man
   drwxr-xr-x  2 root root  4096 Feb 24 17:30 sbin
   drwxr-xr-x 15 root root  4096 Apr 26 14:42 share
   drwxr-xr-x  4 root root  4096 Apr 26 14:44 src
   drwxrwsr-x  2 root staff 4096 Apr 26 14:42 stow
   % sudo rm /usr/local/cuda                                                                                                                                           ~ ip-172-31-14-207
   % sudo ln -s /usr/local/cuda-11.0 /usr/local/cuda                                                                                                                   ~ ip-172-31-14-207
   % python3 -c 'import mxnet as mx; print(mx.nd.arange(3))'                                                                                                           ~ ip-172-31-14-207
   python3: Relink `/lib/x86_64-linux-gnu/libsystemd.so.0' with `/lib/x86_64-linux-gnu/librt.so.1' for IFUNC symbol `clock_gettime'
   python3: Relink `/lib/x86_64-linux-gnu/libudev.so.1' with `/lib/x86_64-linux-gnu/librt.so.1' for IFUNC symbol `clock_gettime'
   zsh: segmentation fault (core dumped)  python3 -c 'import mxnet as mx; print(mx.nd.arange(3))'
   
   
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@mxnet.apache.org
For additional commands, e-mail: issues-help@mxnet.apache.org


[GitHub] [incubator-mxnet] leezu commented on issue #20219: MXNet 1.8.0.post0 segmentation fault on DLAMI

Posted by GitBox <gi...@apache.org>.
leezu commented on issue #20219:
URL: https://github.com/apache/incubator-mxnet/issues/20219#issuecomment-826923250


   Be sure to have both `/usr/local/cuda/lib64` and `/usr/local/cuda/lib` on the LD_LIBRARY_PATH. Something changed in DLAMI and `/usr/local/cuda/lib` got removed when deploying my setup to recent DLAMI.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@mxnet.apache.org
For additional commands, e-mail: issues-help@mxnet.apache.org


[GitHub] [incubator-mxnet] leezu removed a comment on issue #20219: MXNet 1.8.0.post0 segmentation fault on DLAMI

Posted by GitBox <gi...@apache.org>.
leezu removed a comment on issue #20219:
URL: https://github.com/apache/incubator-mxnet/issues/20219#issuecomment-826934119


   The issue was present in DLAMI 36 and was fixed in DLAMI 37


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@mxnet.apache.org
For additional commands, e-mail: issues-help@mxnet.apache.org


[GitHub] [incubator-mxnet] leezu closed issue #20219: MXNet 1.8.0.post0 segmentation fault on DLAMI

Posted by GitBox <gi...@apache.org>.
leezu closed issue #20219:
URL: https://github.com/apache/incubator-mxnet/issues/20219


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@mxnet.apache.org
For additional commands, e-mail: issues-help@mxnet.apache.org


[GitHub] [incubator-mxnet] leezu commented on issue #20219: MXNet 1.8.0.post0 segmentation fault on DLAMI

Posted by GitBox <gi...@apache.org>.
leezu commented on issue #20219:
URL: https://github.com/apache/incubator-mxnet/issues/20219#issuecomment-826934119


   The issue was present in DLAMI 36 and was fixed in DLAMI 37


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@mxnet.apache.org
For additional commands, e-mail: issues-help@mxnet.apache.org