You are viewing a plain text version of this content. The canonical link for it is here.

Posted to commits@mxnet.apache.org by GitBox <gi...@apache.org> on 2021/04/22 21:44:08 UTC

[GitHub] [incubator-mxnet] ptrendx commented on pull request #19905: [PERF] Moving GPU softmax to RTC and optimizations

ptrendx commented on pull request #19905:
URL: https://github.com/apache/incubator-mxnet/pull/19905#issuecomment-825204848


   About the vectorization being independent from RTC - generally I agree with you and the first approach to vectorization was actually before RTC was introduced. There was a problem, however, in that using it produced quite a lot of kernels bloating the library size and increasing the GPU memory usage (see PR #17767 and then issue https://github.com/apache/incubator-mxnet/issues/18280). That is why I reintroduced it as part of the RTC effort to make sure that only the needed kernels get compiled.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org