You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@mxnet.apache.org by GitBox <gi...@apache.org> on 2020/10/10 18:42:07 UTC

[GitHub] [incubator-mxnet] leezu opened a new issue #19330: CI failure: extremely laggy filesystem

leezu opened a new issue #19330:
URL: https://github.com/apache/incubator-mxnet/issues/19330


   ## Description
   CI failure due to extremely laggy filesystem:
   
   ```
   [2020-10-10T00:53:29.773Z] [960/977] Building CXX object 3rdparty/mkldnn/src/cpu/CMakeFiles/dnnl_cpu.dir/cpu_reorder.cpp.o
   
   [2020-10-10T00:53:50.333Z] Cannot contact mxnetlinux-cpu_cqe3ns49vo: java.lang.InterruptedException
   
   [2020-10-10T01:10:43.825Z] wrapper script does not seem to be touching the log file in /home/jenkins_slave/workspace/build-cpu-clang10@tmp/durable-11bd7127
   
   [2020-10-10T01:10:43.825Z] (JENKINS-48300: if on an extremely laggy filesystem, consider -Dorg.jenkinsci.plugins.durabletask.BourneShellScript.HEARTBEAT_CHECK_INTERVAL=86400)
   
   script returned exit code -1
   ```
   
   https://jenkins.mxnet-ci.amazon-ml.com/blue/organizations/jenkins/mxnet-validation%2Fmiscellaneous/detail/PR-19254/6/pipeline/
   
   cc @josephevans @ChaiBapchya 


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@mxnet.apache.org
For additional commands, e-mail: issues-help@mxnet.apache.org


[GitHub] [incubator-mxnet] szha edited a comment on issue #19330: CI failure: extremely laggy filesystem

Posted by GitBox <gi...@apache.org>.
szha edited a comment on issue #19330:
URL: https://github.com/apache/incubator-mxnet/issues/19330#issuecomment-706622385


   The mention in the log is "**if** on an extremely laggy filesystem" which suggests that it's just one of the possible causes for "Cannot contact mxnetlinux-cpu_cqe3ns49vo: java.lang.InterruptedException". Unfortunately the instance has already been released by the autoscaling function so I wasn't able to recover the instance metrics to look for the cause.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@mxnet.apache.org
For additional commands, e-mail: issues-help@mxnet.apache.org


[GitHub] [incubator-mxnet] szha commented on issue #19330: CI failure: extremely laggy filesystem

Posted by GitBox <gi...@apache.org>.
szha commented on issue #19330:
URL: https://github.com/apache/incubator-mxnet/issues/19330#issuecomment-706622385


   The mention in the log is "**if** on an extremely laggy filesystem" which suggests that it's just one of the possible causes for "Cannot contact mxnetlinux-cpu_cqe3ns49vo: java.lang.InterruptedException"


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@mxnet.apache.org
For additional commands, e-mail: issues-help@mxnet.apache.org