You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@mxnet.apache.org by GitBox <gi...@apache.org> on 2018/06/27 23:48:28 UTC

[GitHub] marcoabreu opened a new issue #11441: Failing KVStore test dist-kvstore tests GPU

marcoabreu opened a new issue #11441: Failing KVStore test dist-kvstore tests GPU
URL: https://github.com/apache/incubator-mxnet/issues/11441
 
 
   http://jenkins.mxnet-ci.amazon-ml.com/blue/organizations/jenkins/incubator-mxnet/detail/PR-11433/1/pipeline/1569 
   This test throws a lot of errors:
   
   ```
   + ../../tools/launch.py -n 7 --launcher local python dist_sync_kvstore.py
   
   worker 0 is initialized
   
   worker 3 is initialized
   
   worker 4 is initialized
   
   worker 2 is initialized
   
   worker 5 is initialized
   
   worker 1 is initialized
   
   worker 6 is initialized
   
   worker 6 is done with non compression tests
   
   worker 1 is done with non compression tests
   
   Traceback (most recent call last):
   
     File "dist_sync_kvstore.py", line 384, in <module>
   
       kv = init_kv()
   
     File "dist_sync_kvstore.py", line 72, in init_kv
   
       kv.init(keys_shape, [mx.nd.ones(shape)] * len(keys_shape))
   
     File "../../python/mxnet/kvstore.py", line 154, in init
   
       check_call(_LIB.MXKVStoreInitEx(self.handle, mx_uint(len(ckeys)), ckeys, cvals))
   
     File "../../python/mxnet/base.py", line 210, in check_call
   
       raise MXNetError(py_str(_LIB.MXGetLastError()))
   
   mxnet.base.MXNetError: [23:38:17] src/kvstore/./kvstore_local.h:84: Check failed: str_key_dict_.find(str_key) == str_key_dict_.end() duplicate init of key 3
   
   
   
   Stack trace returned 10 entries:
   
   [bt] (0) /work/mxnet/python/mxnet/../../lib/libmxnet.so(dmlc::StackTrace[abi:cxx11]()+0x5b) [0x7f2efc0db08b]
   
   [bt] (1) /work/mxnet/python/mxnet/../../lib/libmxnet.so(dmlc::LogMessageFatal::~LogMessageFatal()+0x28) [0x7f2efc0dbbf8]
   
   [bt] (2) /work/mxnet/python/mxnet/../../lib/libmxnet.so(mxnet::kvstore::KVStoreLocal::Init(std::vector<std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >, std::allocator<std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > > > const&, std::vector<mxnet::NDArray, std::allocator<mxnet::NDArray> > const&)+0x50d) [0x7f2efedb375d]
   
   [bt] (3) /work/mxnet/python/mxnet/../../lib/libmxnet.so(MXKVStoreInitEx+0x4be) [0x7f2efed2721e]
   
   [bt] (4) /usr/lib/x86_64-linux-gnu/libffi.so.6(ffi_call_unix64+0x4c) [0x7f2f5d8e7e40]
   
   [bt] (5) /usr/lib/x86_64-linux-gnu/libffi.so.6(ffi_call+0x2eb) [0x7f2f5d8e78ab]
   
   [bt] (6) /usr/lib/python2.7/lib-dynload/_ctypes.x86_64-linux-gnu.so(_ctypes_callproc+0x48f) [0x7f2f5daf73df]
   
   [bt] (7) /usr/lib/python2.7/lib-dynload/_ctypes.x86_64-linux-gnu.so(+0x11d82) [0x7f2f5dafbd82]
   
   [bt] (8) python(PyEval_EvalFrameEx+0x578f) [0x4c15bf]
   
   [bt] (9) python(PyEval_EvalCodeEx+0x306) [0x4b9ab6]
   
   ```

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services