You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@mxnet.apache.org by GitBox <gi...@apache.org> on 2020/07/19 03:08:28 UTC

[GitHub] [incubator-mxnet] DickJC123 opened a new issue #18756: pytest worker crash seen on newly introduced unittest test_profiler_gpu.py::test_aggregate_duplication

DickJC123 opened a new issue #18756:
URL: https://github.com/apache/incubator-mxnet/issues/18756


   ## Description
   tests/python/gpu/test_profiler_gpu.py has recently started importing unittests/test_profiler.py, and so now runs those tests for the first time with a gpu default context.  With that change, I have seen on centos-gpu:
   ```
   worker 'gw3' crashed while running 'tests/python/gpu/test_profiler_gpu.py::test_aggregate_duplication'
   ```
   ## Occurrences
   http://jenkins.mxnet-ci.amazon-ml.com/blue/organizations/jenkins/mxnet-validation%2Fcentos-gpu/detail/PR-18694/11/pipeline
   
   @leezu You might want to look into this or watch for more occurences.
   ## What have you tried to solve it?
   
   1.  I've bypassed this test in my current PR with a `del test_aggregate_duplication` after the newly introduced import.
   2.
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [incubator-mxnet] leezu commented on issue #18756: pytest worker crash seen on newly introduced unittest test_profiler_gpu.py::test_aggregate_duplication

Posted by GitBox <gi...@apache.org>.
leezu commented on issue #18756:
URL: https://github.com/apache/incubator-mxnet/issues/18756#issuecomment-661250450


   @DickJC123 I'm not familiar with the feature, but @Zha0q1 added the feature and may help


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [incubator-mxnet] leezu edited a comment on issue #18756: pytest worker crash seen on newly introduced unittest test_profiler_gpu.py::test_aggregate_duplication

Posted by GitBox <gi...@apache.org>.
leezu edited a comment on issue #18756:
URL: https://github.com/apache/incubator-mxnet/issues/18756#issuecomment-661250450


   @DickJC123 I'm not familiar with the feature, but @Zha0q1 added the feature in https://github.com/apache/incubator-mxnet/pull/15240 and may help. Thanks for disabling the flaky test


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [incubator-mxnet] leezu edited a comment on issue #18756: pytest worker crash seen on newly introduced unittest test_profiler_gpu.py::test_aggregate_duplication

Posted by GitBox <gi...@apache.org>.
leezu edited a comment on issue #18756:
URL: https://github.com/apache/incubator-mxnet/issues/18756#issuecomment-661250450


   @DickJC123 I'm not familiar with the feature, but @Zha0q1 added the feature in https://github.com/apache/incubator-mxnet/pull/15240 and may help.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org