You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@tvm.apache.org by wu...@apache.org on 2022/01/24 16:14:05 UTC
[tvm] branch main updated (74a2fa8 -> d066441)
This is an automated email from the ASF dual-hosted git repository.
wuwei pushed a change to branch main
in repository https://gitbox.apache.org/repos/asf/tvm.git.
from 74a2fa8 [microNPU] Add support for requantize (#9910)
add d066441 [PTX-MMA] Add full PTX MMA code generation support (#9909)
No new revisions were added by this update.
Summary of changes:
include/tvm/tir/builtin.h | 11 +
src/target/source/codegen_cuda.cc | 33 +
src/target/source/ptx_mma.cc | 1374 ++++++++++++++++++++
.../{llvm/codegen_blob.h => source/ptx_mma.h} | 39 +-
src/tir/op/builtin.cc | 3 +
tests/python/unittest/test_tir_ptx_mma.py | 1356 +++++++++++++++++++
6 files changed, 2794 insertions(+), 22 deletions(-)
create mode 100644 src/target/source/ptx_mma.cc
copy src/target/{llvm/codegen_blob.h => source/ptx_mma.h} (54%)
create mode 100644 tests/python/unittest/test_tir_ptx_mma.py