You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@mxnet.apache.org by GitBox <gi...@apache.org> on 2021/02/19 19:37:54 UTC
[GitHub] [incubator-mxnet] larroy opened a new issue #11801: flaky kvstore test
larroy opened a new issue #11801:
URL: https://github.com/apache/incubator-mxnet/issues/11801
http://jenkins.mxnet-ci.amazon-ml.com/blue/organizations/jenkins/incubator-mxnet/detail/master/1221/pipeline/
```
+ ../../tools/launch.py -n 7 --launcher local python dist_sync_kvstore.py --type=gluon_step_cpu
terminate called after throwing an instance of 'dmlc::Error'
what(): [05:33:59] /work/mxnet/3rdparty/ps-lite/include/ps/kv_app.h:587: Check failed: lens->size() == keys.size() (2 vs. 1)
Stack trace returned 9 entries:
[bt] (0) /work/mxnet/python/mxnet/../../lib/libmxnet.so(dmlc::StackTrace[abi:cxx11]()+0x55) [0x7fccbfc095b5]
[bt] (1) /work/mxnet/python/mxnet/../../lib/libmxnet.so(dmlc::LogMessageFatal::~LogMessageFatal()+0x28) [0x7fccbfc0a0e8]
[bt] (2) /work/mxnet/python/mxnet/../../lib/libmxnet.so(int ps::KVWorker<char>::Pull_<ps::SArray<char>, ps::SArray<int> >(ps::SArray<unsigned long> const&, ps::SArray<char>*, ps::SArray<int>*, int, std::function<void ()> const&)::{lambda()#1}::operator()()+0x71d) [0x7fccc2a7a5ed]
[bt] (3) /work/mxnet/python/mxnet/../../lib/libmxnet.so(ps::KVWorker<char>::RunCallback(int)+0xf6) [0x7fccc2a22f16]
[bt] (4) /work/mxnet/python/mxnet/../../lib/libmxnet.so(ps::KVWorker<char>::Process(ps::Message const&)+0x2b4) [0x7fccc2a75d54]
[bt] (5) /work/mxnet/python/mxnet/../../lib/libmxnet.so(ps::Customer::Receiving()+0x57c) [0x7fccc2a82c5c]
[bt] (6) /usr/lib/x86_64-linux-gnu/libstdc++.so.6(+0xb8c80) [0x7fccbc442c80]
[bt] (7) /lib/x86_64-linux-gnu/libpthread.so.0(+0x76ba) [0x7fccc724d6ba]
[bt] (8) /lib/x86_64-linux-gnu/libc.so.6(clone+0x6d) [0x7fccc6f8341d]
Exception in thread Thread-4:
Traceback (most recent call last):
File "/usr/lib/python2.7/threading.py", line 801, in __bootstrap_inner
self.run()
File "/usr/lib/python2.7/threading.py", line 754, in run
self.__target(*self.__args, **self.__kwargs)
File "/work/mxnet/tools/../3rdparty/dmlc-core/tracker/dmlc_tracker/local.py", line 44, in exec_cmd
raise RuntimeError('Get nonzero return code=%d' % ret)
RuntimeError: Get nonzero return code=-6
Sending interrupt signal to process
Terminated
script returned exit code 143
```
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@mxnet.apache.org
For additional commands, e-mail: issues-help@mxnet.apache.org