You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@mxnet.apache.org by GitBox <gi...@apache.org> on 2021/04/26 14:30:24 UTC
[GitHub] [incubator-mxnet] leezu opened a new issue #20218: MXNet 1.8.0.post0 OSError: libnccl.so.2 while importing mxnet
leezu opened a new issue #20218:
URL: https://github.com/apache/incubator-mxnet/issues/20218
https://stackoverflow.com/questions/66786887/getting-oserror-libnccl-so-2-while-importing-mxnet
> My configurations are:
>
> gpu: Nvidia GeForce MX450 with 2GB Graphics Card
> Cuda version: 10.1
> MxNet Version: mxnet-cu101: 1.8.0
>
> While trying to mxnet using import mxnet I am getting the following error:
>
> OSError: libnccl.so.2: cannot open shared object file: No such file or directory
>
> I have been using mxnet for last 2years, but never faced the issue before. As far as I knew mxnet installed them on their own as dependencies. However it didn't happen this time.
This also happens with cuda 11.0
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@mxnet.apache.org
For additional commands, e-mail: issues-help@mxnet.apache.org
[GitHub] [incubator-mxnet] leezu closed issue #20218: MXNet 1.8.0.post0 OSError: libnccl.so.2 while importing mxnet
Posted by GitBox <gi...@apache.org>.
leezu closed issue #20218:
URL: https://github.com/apache/incubator-mxnet/issues/20218
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@mxnet.apache.org
For additional commands, e-mail: issues-help@mxnet.apache.org
[GitHub] [incubator-mxnet] TristonC commented on issue #20218: MXNet 1.8.0.post0 OSError: libnccl.so.2 while importing mxnet
Posted by GitBox <gi...@apache.org>.
TristonC commented on issue #20218:
URL: https://github.com/apache/incubator-mxnet/issues/20218#issuecomment-828808192
On the release [page](https://mxnet.apache.org/versions/1.8.0/get_started?platform=linux&language=python&processor=gpu&environ=pip&) it says:
```
CUDA should be installed first. Starting from version 1.8.0, CUDNN and NCCL should be installed as well.
Is this done on purpose?
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@mxnet.apache.org
For additional commands, e-mail: issues-help@mxnet.apache.org
[GitHub] [incubator-mxnet] leezu removed a comment on issue #20218: MXNet 1.8.0.post0 OSError: libnccl.so.2 while importing mxnet
Posted by GitBox <gi...@apache.org>.
leezu removed a comment on issue #20218:
URL: https://github.com/apache/incubator-mxnet/issues/20218#issuecomment-826897465
Log on a new DL Base AMI
```
% pip3 install --user mxnet-cu110 ~ ip-172-31-14-207
Collecting mxnet-cu110
Downloading mxnet_cu110-1.8.0.post0-py2.py3-none-manylinux2014_x86_64.whl (323.5 MB)
|████████████████████████████████| 323.5 MB 183 bytes/s
Requirement already satisfied: numpy<2.0.0,>1.16.0 in /usr/local/lib/python3.6/dist-packages (from mxnet-cu110) (1.19.5)
Requirement already satisfied: requests<3,>=2.20.0 in /usr/local/lib/python3.6/dist-packages (from mxnet-cu110) (2.25.1)
Collecting graphviz<0.9.0,>=0.8.1
Downloading graphviz-0.8.4-py2.py3-none-any.whl (16 kB)
Requirement already satisfied: idna<3,>=2.5 in /usr/local/lib/python3.6/dist-packages (from requests<3,>=2.20.0->mxnet-cu110) (2.10)
Requirement already satisfied: chardet<5,>=3.0.2 in /usr/local/lib/python3.6/dist-packages (from requests<3,>=2.20.0->mxnet-cu110) (4.0.0)
Requirement already satisfied: urllib3<1.27,>=1.21.1 in /usr/local/lib/python3.6/dist-packages (from requests<3,>=2.20.0->mxnet-cu110) (1.26.4)
Requirement already satisfied: certifi>=2017.4.17 in /usr/local/lib/python3.6/dist-packages (from requests<3,>=2.20.0->mxnet-cu110) (2020.12.5)
Installing collected packages: graphviz, mxnet-cu110
Successfully installed graphviz-0.8.4 mxnet-cu110-1.8.0.post0
WARNING: You are using pip version 21.0.1; however, version 21.1 is available.
You should consider upgrading via the '/usr/bin/python3 -m pip install --upgrade pip' command.
% ls -l /usr/local 8s ~ ip-172-31-14-207
total 64
drwxr-xr-x 3 root root 4096 Apr 26 14:44 bin
lrwxrwxrwx 1 root root 20 Apr 26 14:44 cuda -> /usr/local/cuda-11.1
drwxr-xr-x 22 root root 4096 Apr 22 18:50 cuda-10.0
drwxr-xr-x 19 root root 4096 Apr 22 18:45 cuda-10.1
drwxr-xr-x 20 root root 4096 Apr 22 18:48 cuda-10.2
drwxr-xr-x 25 root root 4096 Apr 22 18:43 cuda-11.0
drwxr-xr-x 19 root root 4096 Apr 22 18:38 cuda-11.1
drwxr-xr-x 3 root root 4096 Apr 22 18:51 dcgm
drwxr-xr-x 3 root root 4096 Apr 22 17:39 etc
drwxr-xr-x 2 root root 4096 Feb 24 17:30 games
drwxr-xr-x 3 root root 4096 Apr 22 18:54 include
drwxr-xr-x 4 root root 4096 Apr 22 17:39 init
drwxr-xr-x 7 root root 4096 Apr 26 14:42 lib
lrwxrwxrwx 1 root root 9 Feb 24 17:30 man -> share/man
drwxr-xr-x 2 root root 4096 Feb 24 17:30 sbin
drwxr-xr-x 15 root root 4096 Apr 26 14:42 share
drwxr-xr-x 4 root root 4096 Apr 26 14:44 src
drwxrwsr-x 2 root staff 4096 Apr 26 14:42 stow
% sudo rm /usr/local/cuda ~ ip-172-31-14-207
% sudo ln -s /usr/local/cuda-11.0 /usr/local/cuda ~ ip-172-31-14-207
% python3 -c 'import mxnet as mx; print(mx.nd.arange(3))' ~ ip-172-31-14-207
python3: Relink `/lib/x86_64-linux-gnu/libsystemd.so.0' with `/lib/x86_64-linux-gnu/librt.so.1' for IFUNC symbol `clock_gettime'
python3: Relink `/lib/x86_64-linux-gnu/libudev.so.1' with `/lib/x86_64-linux-gnu/librt.so.1' for IFUNC symbol `clock_gettime'
zsh: segmentation fault (core dumped) python3 -c 'import mxnet as mx; print(mx.nd.arange(3))'
```
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@mxnet.apache.org
For additional commands, e-mail: issues-help@mxnet.apache.org
[GitHub] [incubator-mxnet] leezu commented on issue #20218: MXNet 1.8.0.post0 OSError: libnccl.so.2 while importing mxnet
Posted by GitBox <gi...@apache.org>.
leezu commented on issue #20218:
URL: https://github.com/apache/incubator-mxnet/issues/20218#issuecomment-828829999
Good catch. So it's done on purpose. I ran into the issue due to a problem with LD_LIBRARY_PATH config on a newer DLAMI version
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@mxnet.apache.org
For additional commands, e-mail: issues-help@mxnet.apache.org
[GitHub] [incubator-mxnet] leezu commented on issue #20218: MXNet 1.8.0.post0 OSError: libnccl.so.2 while importing mxnet
Posted by GitBox <gi...@apache.org>.
leezu commented on issue #20218:
URL: https://github.com/apache/incubator-mxnet/issues/20218#issuecomment-826897465
Log on a new DL Base AMI
```
% pip3 install --user mxnet-cu110 ~ ip-172-31-14-207
Collecting mxnet-cu110
Downloading mxnet_cu110-1.8.0.post0-py2.py3-none-manylinux2014_x86_64.whl (323.5 MB)
|████████████████████████████████| 323.5 MB 183 bytes/s
Requirement already satisfied: numpy<2.0.0,>1.16.0 in /usr/local/lib/python3.6/dist-packages (from mxnet-cu110) (1.19.5)
Requirement already satisfied: requests<3,>=2.20.0 in /usr/local/lib/python3.6/dist-packages (from mxnet-cu110) (2.25.1)
Collecting graphviz<0.9.0,>=0.8.1
Downloading graphviz-0.8.4-py2.py3-none-any.whl (16 kB)
Requirement already satisfied: idna<3,>=2.5 in /usr/local/lib/python3.6/dist-packages (from requests<3,>=2.20.0->mxnet-cu110) (2.10)
Requirement already satisfied: chardet<5,>=3.0.2 in /usr/local/lib/python3.6/dist-packages (from requests<3,>=2.20.0->mxnet-cu110) (4.0.0)
Requirement already satisfied: urllib3<1.27,>=1.21.1 in /usr/local/lib/python3.6/dist-packages (from requests<3,>=2.20.0->mxnet-cu110) (1.26.4)
Requirement already satisfied: certifi>=2017.4.17 in /usr/local/lib/python3.6/dist-packages (from requests<3,>=2.20.0->mxnet-cu110) (2020.12.5)
Installing collected packages: graphviz, mxnet-cu110
Successfully installed graphviz-0.8.4 mxnet-cu110-1.8.0.post0
WARNING: You are using pip version 21.0.1; however, version 21.1 is available.
You should consider upgrading via the '/usr/bin/python3 -m pip install --upgrade pip' command.
% ls -l /usr/local 8s ~ ip-172-31-14-207
total 64
drwxr-xr-x 3 root root 4096 Apr 26 14:44 bin
lrwxrwxrwx 1 root root 20 Apr 26 14:44 cuda -> /usr/local/cuda-11.1
drwxr-xr-x 22 root root 4096 Apr 22 18:50 cuda-10.0
drwxr-xr-x 19 root root 4096 Apr 22 18:45 cuda-10.1
drwxr-xr-x 20 root root 4096 Apr 22 18:48 cuda-10.2
drwxr-xr-x 25 root root 4096 Apr 22 18:43 cuda-11.0
drwxr-xr-x 19 root root 4096 Apr 22 18:38 cuda-11.1
drwxr-xr-x 3 root root 4096 Apr 22 18:51 dcgm
drwxr-xr-x 3 root root 4096 Apr 22 17:39 etc
drwxr-xr-x 2 root root 4096 Feb 24 17:30 games
drwxr-xr-x 3 root root 4096 Apr 22 18:54 include
drwxr-xr-x 4 root root 4096 Apr 22 17:39 init
drwxr-xr-x 7 root root 4096 Apr 26 14:42 lib
lrwxrwxrwx 1 root root 9 Feb 24 17:30 man -> share/man
drwxr-xr-x 2 root root 4096 Feb 24 17:30 sbin
drwxr-xr-x 15 root root 4096 Apr 26 14:42 share
drwxr-xr-x 4 root root 4096 Apr 26 14:44 src
drwxrwsr-x 2 root staff 4096 Apr 26 14:42 stow
% sudo rm /usr/local/cuda ~ ip-172-31-14-207
% sudo ln -s /usr/local/cuda-11.0 /usr/local/cuda ~ ip-172-31-14-207
% python3 -c 'import mxnet as mx; print(mx.nd.arange(3))' ~ ip-172-31-14-207
python3: Relink `/lib/x86_64-linux-gnu/libsystemd.so.0' with `/lib/x86_64-linux-gnu/librt.so.1' for IFUNC symbol `clock_gettime'
python3: Relink `/lib/x86_64-linux-gnu/libudev.so.1' with `/lib/x86_64-linux-gnu/librt.so.1' for IFUNC symbol `clock_gettime'
zsh: segmentation fault (core dumped) python3 -c 'import mxnet as mx; print(mx.nd.arange(3))'
```
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@mxnet.apache.org
For additional commands, e-mail: issues-help@mxnet.apache.org
[GitHub] [incubator-mxnet] TristonC edited a comment on issue #20218: MXNet 1.8.0.post0 OSError: libnccl.so.2 while importing mxnet
Posted by GitBox <gi...@apache.org>.
TristonC edited a comment on issue #20218:
URL: https://github.com/apache/incubator-mxnet/issues/20218#issuecomment-828808192
On the release [page](https://mxnet.apache.org/versions/1.8.0/get_started?platform=linux&language=python&processor=gpu&environ=pip&) it says:
```
CUDA should be installed first. Starting from version 1.8.0, CUDNN and NCCL should be installed as well.
```
Is this done on purpose?
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@mxnet.apache.org
For additional commands, e-mail: issues-help@mxnet.apache.org