You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@mxnet.apache.org by GitBox <gi...@apache.org> on 2021/04/26 14:30:24 UTC

[GitHub] [incubator-mxnet] leezu opened a new issue #20218: MXNet 1.8.0.post0 OSError: libnccl.so.2 while importing mxnet

leezu opened a new issue #20218:
URL: https://github.com/apache/incubator-mxnet/issues/20218


   https://stackoverflow.com/questions/66786887/getting-oserror-libnccl-so-2-while-importing-mxnet
   
   > My configurations are:
   > 
   > gpu: Nvidia GeForce MX450 with 2GB Graphics Card
   > Cuda version: 10.1
   > MxNet Version: mxnet-cu101: 1.8.0
   > 
   > While trying to mxnet using import mxnet I am getting the following error:
   > 
   > OSError: libnccl.so.2: cannot open shared object file: No such file or directory
   > 
   > I have been using mxnet for last 2years, but never faced the issue before. As far as I knew mxnet installed them on their own as dependencies. However it didn't happen this time.
   
   
   This also happens with cuda 11.0


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@mxnet.apache.org
For additional commands, e-mail: issues-help@mxnet.apache.org


[GitHub] [incubator-mxnet] leezu closed issue #20218: MXNet 1.8.0.post0 OSError: libnccl.so.2 while importing mxnet

Posted by GitBox <gi...@apache.org>.
leezu closed issue #20218:
URL: https://github.com/apache/incubator-mxnet/issues/20218


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@mxnet.apache.org
For additional commands, e-mail: issues-help@mxnet.apache.org


[GitHub] [incubator-mxnet] TristonC commented on issue #20218: MXNet 1.8.0.post0 OSError: libnccl.so.2 while importing mxnet

Posted by GitBox <gi...@apache.org>.
TristonC commented on issue #20218:
URL: https://github.com/apache/incubator-mxnet/issues/20218#issuecomment-828808192


   On the release [page](https://mxnet.apache.org/versions/1.8.0/get_started?platform=linux&language=python&processor=gpu&environ=pip&)  it says: 
   ```
   CUDA should be installed first. Starting from version 1.8.0, CUDNN and NCCL should be installed as well.
   
   Is this done on purpose?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@mxnet.apache.org
For additional commands, e-mail: issues-help@mxnet.apache.org


[GitHub] [incubator-mxnet] leezu removed a comment on issue #20218: MXNet 1.8.0.post0 OSError: libnccl.so.2 while importing mxnet

Posted by GitBox <gi...@apache.org>.
leezu removed a comment on issue #20218:
URL: https://github.com/apache/incubator-mxnet/issues/20218#issuecomment-826897465


   Log on a new DL Base AMI 
   ```
   
   % pip3 install --user mxnet-cu110                                                                                                                                   ~ ip-172-31-14-207
   Collecting mxnet-cu110
     Downloading mxnet_cu110-1.8.0.post0-py2.py3-none-manylinux2014_x86_64.whl (323.5 MB)
        |████████████████████████████████| 323.5 MB 183 bytes/s
   Requirement already satisfied: numpy<2.0.0,>1.16.0 in /usr/local/lib/python3.6/dist-packages (from mxnet-cu110) (1.19.5)
   Requirement already satisfied: requests<3,>=2.20.0 in /usr/local/lib/python3.6/dist-packages (from mxnet-cu110) (2.25.1)
   Collecting graphviz<0.9.0,>=0.8.1
     Downloading graphviz-0.8.4-py2.py3-none-any.whl (16 kB)
   Requirement already satisfied: idna<3,>=2.5 in /usr/local/lib/python3.6/dist-packages (from requests<3,>=2.20.0->mxnet-cu110) (2.10)
   Requirement already satisfied: chardet<5,>=3.0.2 in /usr/local/lib/python3.6/dist-packages (from requests<3,>=2.20.0->mxnet-cu110) (4.0.0)
   Requirement already satisfied: urllib3<1.27,>=1.21.1 in /usr/local/lib/python3.6/dist-packages (from requests<3,>=2.20.0->mxnet-cu110) (1.26.4)
   Requirement already satisfied: certifi>=2017.4.17 in /usr/local/lib/python3.6/dist-packages (from requests<3,>=2.20.0->mxnet-cu110) (2020.12.5)
   Installing collected packages: graphviz, mxnet-cu110
   Successfully installed graphviz-0.8.4 mxnet-cu110-1.8.0.post0
   WARNING: You are using pip version 21.0.1; however, version 21.1 is available.
   You should consider upgrading via the '/usr/bin/python3 -m pip install --upgrade pip' command.
   % ls -l /usr/local                                                                                                                                               8s ~ ip-172-31-14-207
   total 64
   drwxr-xr-x  3 root root  4096 Apr 26 14:44 bin
   lrwxrwxrwx  1 root root    20 Apr 26 14:44 cuda -> /usr/local/cuda-11.1
   drwxr-xr-x 22 root root  4096 Apr 22 18:50 cuda-10.0
   drwxr-xr-x 19 root root  4096 Apr 22 18:45 cuda-10.1
   drwxr-xr-x 20 root root  4096 Apr 22 18:48 cuda-10.2
   drwxr-xr-x 25 root root  4096 Apr 22 18:43 cuda-11.0
   drwxr-xr-x 19 root root  4096 Apr 22 18:38 cuda-11.1
   drwxr-xr-x  3 root root  4096 Apr 22 18:51 dcgm
   drwxr-xr-x  3 root root  4096 Apr 22 17:39 etc
   drwxr-xr-x  2 root root  4096 Feb 24 17:30 games
   drwxr-xr-x  3 root root  4096 Apr 22 18:54 include
   drwxr-xr-x  4 root root  4096 Apr 22 17:39 init
   drwxr-xr-x  7 root root  4096 Apr 26 14:42 lib
   lrwxrwxrwx  1 root root     9 Feb 24 17:30 man -> share/man
   drwxr-xr-x  2 root root  4096 Feb 24 17:30 sbin
   drwxr-xr-x 15 root root  4096 Apr 26 14:42 share
   drwxr-xr-x  4 root root  4096 Apr 26 14:44 src
   drwxrwsr-x  2 root staff 4096 Apr 26 14:42 stow
   % sudo rm /usr/local/cuda                                                                                                                                           ~ ip-172-31-14-207
   % sudo ln -s /usr/local/cuda-11.0 /usr/local/cuda                                                                                                                   ~ ip-172-31-14-207
   % python3 -c 'import mxnet as mx; print(mx.nd.arange(3))'                                                                                                           ~ ip-172-31-14-207
   python3: Relink `/lib/x86_64-linux-gnu/libsystemd.so.0' with `/lib/x86_64-linux-gnu/librt.so.1' for IFUNC symbol `clock_gettime'
   python3: Relink `/lib/x86_64-linux-gnu/libudev.so.1' with `/lib/x86_64-linux-gnu/librt.so.1' for IFUNC symbol `clock_gettime'
   zsh: segmentation fault (core dumped)  python3 -c 'import mxnet as mx; print(mx.nd.arange(3))'
   
   
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@mxnet.apache.org
For additional commands, e-mail: issues-help@mxnet.apache.org


[GitHub] [incubator-mxnet] leezu commented on issue #20218: MXNet 1.8.0.post0 OSError: libnccl.so.2 while importing mxnet

Posted by GitBox <gi...@apache.org>.
leezu commented on issue #20218:
URL: https://github.com/apache/incubator-mxnet/issues/20218#issuecomment-828829999


   Good catch. So it's done on purpose. I ran into the issue due to a problem with LD_LIBRARY_PATH config on a newer DLAMI version


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@mxnet.apache.org
For additional commands, e-mail: issues-help@mxnet.apache.org


[GitHub] [incubator-mxnet] leezu commented on issue #20218: MXNet 1.8.0.post0 OSError: libnccl.so.2 while importing mxnet

Posted by GitBox <gi...@apache.org>.
leezu commented on issue #20218:
URL: https://github.com/apache/incubator-mxnet/issues/20218#issuecomment-826897465


   Log on a new DL Base AMI 
   ```
   
   % pip3 install --user mxnet-cu110                                                                                                                                   ~ ip-172-31-14-207
   Collecting mxnet-cu110
     Downloading mxnet_cu110-1.8.0.post0-py2.py3-none-manylinux2014_x86_64.whl (323.5 MB)
        |████████████████████████████████| 323.5 MB 183 bytes/s
   Requirement already satisfied: numpy<2.0.0,>1.16.0 in /usr/local/lib/python3.6/dist-packages (from mxnet-cu110) (1.19.5)
   Requirement already satisfied: requests<3,>=2.20.0 in /usr/local/lib/python3.6/dist-packages (from mxnet-cu110) (2.25.1)
   Collecting graphviz<0.9.0,>=0.8.1
     Downloading graphviz-0.8.4-py2.py3-none-any.whl (16 kB)
   Requirement already satisfied: idna<3,>=2.5 in /usr/local/lib/python3.6/dist-packages (from requests<3,>=2.20.0->mxnet-cu110) (2.10)
   Requirement already satisfied: chardet<5,>=3.0.2 in /usr/local/lib/python3.6/dist-packages (from requests<3,>=2.20.0->mxnet-cu110) (4.0.0)
   Requirement already satisfied: urllib3<1.27,>=1.21.1 in /usr/local/lib/python3.6/dist-packages (from requests<3,>=2.20.0->mxnet-cu110) (1.26.4)
   Requirement already satisfied: certifi>=2017.4.17 in /usr/local/lib/python3.6/dist-packages (from requests<3,>=2.20.0->mxnet-cu110) (2020.12.5)
   Installing collected packages: graphviz, mxnet-cu110
   Successfully installed graphviz-0.8.4 mxnet-cu110-1.8.0.post0
   WARNING: You are using pip version 21.0.1; however, version 21.1 is available.
   You should consider upgrading via the '/usr/bin/python3 -m pip install --upgrade pip' command.
   % ls -l /usr/local                                                                                                                                               8s ~ ip-172-31-14-207
   total 64
   drwxr-xr-x  3 root root  4096 Apr 26 14:44 bin
   lrwxrwxrwx  1 root root    20 Apr 26 14:44 cuda -> /usr/local/cuda-11.1
   drwxr-xr-x 22 root root  4096 Apr 22 18:50 cuda-10.0
   drwxr-xr-x 19 root root  4096 Apr 22 18:45 cuda-10.1
   drwxr-xr-x 20 root root  4096 Apr 22 18:48 cuda-10.2
   drwxr-xr-x 25 root root  4096 Apr 22 18:43 cuda-11.0
   drwxr-xr-x 19 root root  4096 Apr 22 18:38 cuda-11.1
   drwxr-xr-x  3 root root  4096 Apr 22 18:51 dcgm
   drwxr-xr-x  3 root root  4096 Apr 22 17:39 etc
   drwxr-xr-x  2 root root  4096 Feb 24 17:30 games
   drwxr-xr-x  3 root root  4096 Apr 22 18:54 include
   drwxr-xr-x  4 root root  4096 Apr 22 17:39 init
   drwxr-xr-x  7 root root  4096 Apr 26 14:42 lib
   lrwxrwxrwx  1 root root     9 Feb 24 17:30 man -> share/man
   drwxr-xr-x  2 root root  4096 Feb 24 17:30 sbin
   drwxr-xr-x 15 root root  4096 Apr 26 14:42 share
   drwxr-xr-x  4 root root  4096 Apr 26 14:44 src
   drwxrwsr-x  2 root staff 4096 Apr 26 14:42 stow
   % sudo rm /usr/local/cuda                                                                                                                                           ~ ip-172-31-14-207
   % sudo ln -s /usr/local/cuda-11.0 /usr/local/cuda                                                                                                                   ~ ip-172-31-14-207
   % python3 -c 'import mxnet as mx; print(mx.nd.arange(3))'                                                                                                           ~ ip-172-31-14-207
   python3: Relink `/lib/x86_64-linux-gnu/libsystemd.so.0' with `/lib/x86_64-linux-gnu/librt.so.1' for IFUNC symbol `clock_gettime'
   python3: Relink `/lib/x86_64-linux-gnu/libudev.so.1' with `/lib/x86_64-linux-gnu/librt.so.1' for IFUNC symbol `clock_gettime'
   zsh: segmentation fault (core dumped)  python3 -c 'import mxnet as mx; print(mx.nd.arange(3))'
   
   
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@mxnet.apache.org
For additional commands, e-mail: issues-help@mxnet.apache.org


[GitHub] [incubator-mxnet] TristonC edited a comment on issue #20218: MXNet 1.8.0.post0 OSError: libnccl.so.2 while importing mxnet

Posted by GitBox <gi...@apache.org>.
TristonC edited a comment on issue #20218:
URL: https://github.com/apache/incubator-mxnet/issues/20218#issuecomment-828808192


   On the release [page](https://mxnet.apache.org/versions/1.8.0/get_started?platform=linux&language=python&processor=gpu&environ=pip&)  it says: 
   ```
   CUDA should be installed first. Starting from version 1.8.0, CUDNN and NCCL should be installed as well.
   ```
   Is this done on purpose?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@mxnet.apache.org
For additional commands, e-mail: issues-help@mxnet.apache.org