You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@tvm.apache.org by GitBox <gi...@apache.org> on 2022/11/29 17:26:54 UTC

[GitHub] [tvm] vvchernov commented on pull request #13472: [QNN][ONNX][Relay] support mix-precision input types for QLinearMatMulfix

vvchernov commented on PR #13472:
URL: https://github.com/apache/tvm/pull/13472#issuecomment-1331017399

   Hello @AndrewZhaoLuo! I've done this fix to support quantized distilbert-like model with mixed-precision in QLinearMatMut. The model is supported by ONNX RT. After this fix and fix from #13469 the model is supported by TVM with VirtualMachine. Nevertheless you are right the primitive op with mix-precision does not work on ORT and more over for TVM too. I'm working on it.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@tvm.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org