You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@mxnet.apache.org by GitBox <gi...@apache.org> on 2018/05/28 08:09:49 UTC

[GitHub] nihui opened a new pull request #11076: Use depthwise convolution by cuDNNv7 if available, updated version

nihui opened a new pull request #11076: Use depthwise convolution by cuDNNv7 if available, updated version
URL: https://github.com/apache/incubator-mxnet/pull/11076
 
 
   this pull request is based on https://github.com/apache/incubator-mxnet/pull/10804
   with the following further changes:
   1. reduce ident changes
   2. prefer cudnn depthwise convolution over mxnet implementation
   
   still use the explicit #if #else #endif statement over
   the new variable effective_num_group solution for backward code path compability
   because the new variable effective_num_group may confuse readers with standard group convolution
   
   > some feedback about the speed
   >
   > hardware: tesla-m40 24G x 2
   > system: centos-7
   > nvidia-387.26
   > cuda-9.1
   > cudnn-v7.1
   >
   > model: mobilenet-v2
   > batchsize 256 (128 per gpu)
   >
   > mxnet implementation: 68s/10iter
   > cudnnv7 implementation: 9.5s/10iter
   
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services