You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@mxnet.apache.org by GitBox <gi...@apache.org> on 2019/09/12 08:46:56 UTC

[GitHub] [incubator-mxnet] leezu commented on issue #16114: improve dataloader signals and messages

leezu commented on issue #16114: improve dataloader signals and messages
URL: https://github.com/apache/incubator-mxnet/pull/16114#issuecomment-530729242
 
 
   Thanks @zhreshold! I wonder if adding a default timeout is a backwards incompatible change that affects our semantic versioning guarantee? (some user code may currently run 130 seconds per batch, and it will stop working when users upgrade to MXNet 1.6)
   
   If I understand correctly, the motivation to introduce timeout is:    
   > Sometimes full `shared_memory` will cause all workers to hang and causes timeout. In these
   > cases please reduce `num_workers` or increase system `shared_memory` size instead.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services