You are viewing a plain text version of this content. The canonical link for it is here.

Posted to commits@tvm.apache.org by GitBox <gi...@apache.org> on 2021/04/28 21:03:03 UTC

[GitHub] [tvm] t-vi commented on pull request #7935: [SPARSE] Improve sparse performance on ROCM

t-vi commented on pull request #7935:
URL: https://github.com/apache/tvm/pull/7935#issuecomment-828778315


   I don't think the descriptions are entirely accurate, but the Vega ISA manual says
   
   > This does not access LDS memory and may be called even if no LDS memory is allocated to the wave. It uses LDS hardware to implement an arbitrary swizzle across threads in a wavefront. 
   
   so I would expect that the performance lies somewhere between using LDS and registers. I can imagine that doing a lot less writing might save time in this specific case, but it probably is best to check with AMD before drawing global conclusions.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org