You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@mxnet.apache.org by "Ni Hui (JIRA)" <ji...@apache.org> on 2018/05/28 08:25:00 UTC

[jira] [Created] (MXNET-491) Use depthwise convolution by cuDNNv7 if available, updated version

Ni Hui created MXNET-491:
----------------------------

             Summary: Use depthwise convolution by cuDNNv7 if available, updated version
                 Key: MXNET-491
                 URL: https://issues.apache.org/jira/browse/MXNET-491
             Project: Apache MXNet
          Issue Type: Improvement
            Reporter: Ni Hui


Use group convolution by cuDNNv7 to improve GPU memory usage.
this pull request is based on #10804
with the following further changes:

    reduce ident changes
    prefer cudnn depthwise convolution over mxnet implementation

still use the explicit #if #else #endif statement over
the new variable effective_num_group solution for backward code path compability
because the new variable effective_num_group may confuse readers with standard group convolution

    some feedback about the speed

    hardware: tesla-m40 24G x 2
    system: centos-7
    nvidia-387.26
    cuda-9.1
    cudnn-v7.1

    model: mobilenet-v2
    batchsize 256 (128 per gpu)

    mxnet implementation: 68s/10iter
    cudnnv7 implementation: 9.5s/10iter





--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@mxnet.apache.org
For additional commands, e-mail: issues-help@mxnet.apache.org