You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@tvm.apache.org by ar...@apache.org on 2022/01/19 22:02:17 UTC

[tvm] branch cc_reviewers-testing updated (5893d80 -> 3de33ba)

This is an automated email from the ASF dual-hosted git repository.

areusch pushed a change to branch cc_reviewers-testing
in repository https://gitbox.apache.org/repos/asf/tvm.git.


    omit 5893d80  Fix target triggers
    omit 8b4e74c  remove merge bot stuff
    omit 96dcee9  Add Action to add cc'ed people as reviewers
    omit f87c1a0  Fix lint and add tests
    omit be555cd  Add action to label mergeable PRs
     add 46da676  fix icelake target for avx512 and vnni (#9928)
     add 220b122  Fix HexagonSDK.cmake (#9914)
     add 0a159c4  Restore the use of ONNX_DEFAULT_CONFIGS["use_nt_batch_matmul"] (#9925)
     add 670de9b  [CI] Fix pip cache config bug (#9933)
     add 79c59fe  dynamic to static use infer_type_local (#9869)
     add 89ae603  [USMP] Hill Climb allocator (#9704)
     add 4419241  [TEST] Remove `llvm -device=arm_cpu` and `cuda -libs=cudnn` from (#9905)
     add e7c8141  [Relay/Frontend][TFLite] Change the output shape calculation based on keep_dim option in fully connected (#9840)
     add 6f6fc68  [TIR] Encode conditional accesses info into block read/write regions (#9880)
     add b3c6625  [Int8] Support cublas on e2e int8 models (also tried cudnn but doesn't work) (#9898)
     add 1b1cfb3  remove clang compile warnings (#9942)
     add 84ee90c  [ONNX] Fix onnx convtranspose error (#9938)
     add 6eb4ed8  [Fix] relay onnx frontend bug when [A, B, M, N] * [1, B, N, K] (#9911)
     add be0677d  [Caffe Frontend] supporting group > 1 cases for Deconv op (#8260)
     add 3c8de42  [Caffe Frontend] extending Eltwise to handle multiple inputs (#8136)
     add 596333b  [MetaSchedule] Schedule Rule: Auto Inline (#9943)
     add 24bccd2  [microNPU] Remove remaining UnsupportedLayout checks (#9791)
     add 133bb9c  [microNPU][2c] Add performance modelling to cascader (#9778)
     add 77c66f0  [MetaSchedule] random compute location (#9940)
     add 1e5373f  [MetaSchedule] PostProcessor: Verify GPU Code (#9945)
     add 211291f  [CUDNN] Refactor descriptor initialization, remove `cudnn.conv.output_shape_from_cudnn` (#9948)
     add 364e2db  [microNPU] Add support for scalar values (#9794)
     add 31de5bc  [HotFix] Skip the flaky MetaSchedule Auto-Unroll test (#9956)
     add 4f29562  Enable NPU and CMSIS in ci_qemu (#9957)
     add b1bd18e  [Runtime][Pipeline executor] Global parameters group name and runtime modules parameters map. (#9846)
     add 904b4ae  [CI] Upgrade ONNX (#9965)
     add 19717aa  [TOPI,x86] Improve performance on int8 conv2d on x86 (#9966)
     add da3b63e  [Hexagon] Return pathlib.Path from get_hexagon_rpc_path() (#9969)
     add 928be03  [Hexagon] Add missing #include <iterator> (#9968)
     add 6d68184  [Doc][Fix] Fix qnn op parameters hint order (#9622)
     add ac1a43b  Propagate ssh-agent authentication socket (#9926)
     add c3ace20  [TIR][USMP] Integrating USMP to AoT Executor (#9565)
     add f1501d0  Disallow copy to/from external HexagonBuffer (#9930)
     add 611b430  [Fix] Fix flaky test of #9952 (#9958)
     add 1351ede  Add contribute page about CI (#9906)
     add 6286ac1  [bugfix] Fix the behavior of TVMScript printer (#9974)
     add a9b1d5b  Add sccache to docker images (#9844)
     add fd5915a  [Relay] Add `conv2d_backward_weight` op (without topi) (#9954)
     add f9171f1  [MetaSchedule] Schedule Rule: Add RFactor (#9975)
     add bae144c  Add Action to add cc'ed people as reviewers (#9934)
     add cc5382e  Add runtime.ModuleGetFormat method enabling export of BYOC generated sources which require a .cpp/.cc file extension (#9243)
     add 3de33ba  Submit cc'ed reviewers one at a time

This update added new revisions after undoing existing revisions.
That is to say, some revisions that were in the old version of the
branch are not in the new version.  This situation occurs
when a user --force pushes a change and generates a repository
containing something like this:

 * -- * -- B -- O -- O -- O   (5893d80)
            \
             N -- N -- N   refs/heads/cc_reviewers-testing (3de33ba)

You should already have received notification emails for all of the O
revisions, and so the following emails describe only the N revisions
from the common base, B.

Any revisions marked "omit" are not gone; other references still
refer to them.  Any revisions marked "discard" are gone forever.

No new revisions were added by this update.

Summary of changes:
 .github/workflows/cc_bot.yml                       |   2 +-
 Jenkinsfile                                        |   4 +-
 cmake/modules/HexagonSDK.cmake                     |   2 +-
 docker/Dockerfile.ci_arm                           |  13 +-
 docker/Dockerfile.ci_cpu                           |  10 +-
 docker/Dockerfile.ci_gpu                           |  10 +-
 docker/Dockerfile.ci_i386                          |   2 +-
 docker/Dockerfile.ci_lint                          |   6 +-
 docker/Dockerfile.ci_qemu                          |  10 +-
 docker/Dockerfile.ci_wasm                          |   2 +-
 docker/build.sh                                    |   2 +
 docker/install/ubuntu_install_rust.sh              |  12 +-
 ...tall_universal.sh => ubuntu_install_sccache.sh} |   8 +-
 .../install/ubuntu_install_wasmtime.sh             |  12 +-
 docs/contribute/ci.rst                             | 176 +++++
 docs/contribute/committer_guide.rst                |  36 +-
 docs/contribute/community.rst                      |   6 +-
 docs/contribute/document.rst                       |  53 +-
 docs/contribute/git_howto.rst                      |  45 +-
 docs/contribute/index.rst                          |   3 +-
 docs/contribute/pull_request.rst                   |   2 +
 include/tvm/meta_schedule/schedule_rule.h          |  12 +-
 include/tvm/runtime/module.h                       |   5 +
 include/tvm/tir/schedule/schedule.h                |   8 +
 include/tvm/tir/stmt.h                             |   7 +
 include/tvm/tir/transform.h                        |   8 +
 include/tvm/tir/usmp/algo/greedy.h                 |  85 ++
 include/tvm/tir/usmp/algorithms.h                  |  62 ++
 .../runtime.cc => include/tvm/tir/usmp/analysis.h  |  34 +-
 include/tvm/tir/usmp/transform.h                   |  64 ++
 include/tvm/tir/usmp/utils.h                       |  47 +-
 python/tvm/contrib/cudnn.py                        |  62 --
 python/tvm/contrib/ethosu/cascader/__init__.py     |   3 +
 .../ethosu/cascader/block_config.py}               |  32 +-
 .../tvm/contrib/ethosu/cascader/device_config.py   | 661 ++++++++++++++++
 python/tvm/contrib/ethosu/cascader/graph.py        |  19 +-
 python/tvm/contrib/ethosu/cascader/parts.py        |  13 +-
 python/tvm/contrib/hexagon/build.py                |   4 +-
 python/tvm/contrib/pipeline_executor.py            | 111 ++-
 python/tvm/meta_schedule/postproc/__init__.py      |   1 +
 .../verify_gpu_code.py}                            |  15 +-
 python/tvm/meta_schedule/schedule_rule/__init__.py |   3 +
 .../tvm/meta_schedule/schedule_rule/add_rfactor.py |  49 ++
 .../tvm/meta_schedule/schedule_rule/auto_inline.py |  67 ++
 .../random_compute_location.py}                    |  13 +-
 python/tvm/meta_schedule/testing/schedule_rule.py  |  55 ++
 .../tvm/meta_schedule/testing/space_generation.py  |  28 +-
 python/tvm/meta_schedule/testing/te_workload.py    | 877 +++++++++++++++++++++
 python/tvm/micro/model_library_format.py           |   4 +-
 .../tvm/relay/backend/contrib/ethosu/legalize.py   |  20 +-
 .../relay/backend/contrib/ethosu/te/convolution.py |  55 +-
 .../tvm/relay/backend/contrib/ethosu/te/inline.py  |   5 +-
 .../relay/backend/contrib/ethosu/tir/compiler.py   |  11 +-
 .../relay/backend/contrib/ethosu/tir/scheduler.py  |   5 +-
 python/tvm/relay/frontend/caffe.py                 |  88 ++-
 python/tvm/relay/frontend/onnx.py                  |  66 +-
 python/tvm/relay/frontend/tflite.py                |   8 +
 python/tvm/relay/op/_tensor_grad.py                |  53 +-
 python/tvm/relay/op/nn/_nn.py                      |  78 ++
 python/tvm/relay/op/nn/nn.py                       |  51 ++
 python/tvm/relay/qnn/op/qnn.py                     |  40 +-
 python/tvm/relay/testing/__init__.py               |   1 +
 python/tvm/runtime/module.py                       |  22 +-
 python/tvm/testing/utils.py                        |   2 -
 python/tvm/tir/schedule/schedule.py                |  26 +
 python/tvm/topi/cuda/dense.py                      |   5 +-
 python/tvm/topi/generic/conv2d.py                  |  10 +-
 python/tvm/topi/testing/__init__.py                |   1 +
 .../topi/testing/conv2d_backcward_weight_python.py |  76 ++
 python/tvm/topi/x86/utils.py                       |   4 +-
 src/contrib/ethosu/cascader/block_config.cc        |  60 ++
 src/contrib/ethosu/cascader/block_config.h         |  93 +++
 src/contrib/ethosu/cascader/common.h               |  26 +
 src/contrib/ethosu/cascader/graph.cc               |  13 +-
 src/contrib/ethosu/cascader/graph.h                |  18 +-
 src/contrib/ethosu/cascader/parts/ethosu.cc        | 142 +++-
 src/contrib/ethosu/cascader/parts/ethosu.h         |  34 +-
 src/contrib/ethosu/cascader/parts/inline.cc        |   4 +-
 src/contrib/ethosu/cascader/parts/inline.h         |   2 +-
 src/driver/driver_api.cc                           |   7 +-
 src/meta_schedule/postproc/verify_gpu_code.cc      | 130 +++
 src/meta_schedule/schedule_rule/add_rfactor.cc     | 122 +++
 src/meta_schedule/schedule_rule/auto_inline.cc     | 174 ++++
 .../schedule_rule/random_compute_location.cc       | 123 +++
 src/meta_schedule/utils.h                          |  20 +
 src/printer/tvmscript_printer.cc                   |  18 +-
 src/relay/backend/aot_executor_codegen.cc          | 235 ++++--
 src/relay/backend/build_module.cc                  |   4 +-
 src/relay/backend/contrib/ethosu/source_module.cc  |   2 +
 src/relay/backend/utils.h                          |  60 +-
 src/relay/backend/vm/compiler.cc                   |   3 +-
 src/relay/ir/indexed_graph.cc                      |   2 +
 src/relay/op/nn/convolution.cc                     |  89 +++
 src/relay/qnn/op/convolution.cc                    |  54 +-
 src/relay/transforms/dynamic_to_static.cc          |  14 +-
 src/runtime/contrib/cudnn/conv_forward.cc          | 228 +-----
 src/runtime/contrib/cudnn/cudnn_utils.cc           |  95 +++
 src/runtime/contrib/cudnn/cudnn_utils.h            |   9 +-
 .../hexagon/android/sim/hexagon_device_sim.cc      |   1 +
 src/runtime/hexagon/hexagon/hexagon_buffer.cc      |  23 +-
 src/runtime/meta_data.h                            |  47 --
 src/runtime/module.cc                              |   9 +
 src/runtime/pipeline/pipeline_executor.cc          |  42 +-
 src/runtime/pipeline/pipeline_executor.h           |  21 +-
 src/runtime/pipeline/pipeline_struct.h             |  42 +
 src/target/metadata_module.cc                      |   2 +-
 src/target/metadata_module.h                       |   4 +-
 src/target/source/codegen_c.cc                     |  55 --
 src/target/source/codegen_c.h                      |  13 +-
 src/target/source/codegen_source_base.cc           |  55 ++
 src/target/source/codegen_source_base.h            |  12 +
 src/target/source/source_module.cc                 | 265 +++++--
 src/target/source/source_module.h                  |   4 +-
 src/target/target_kind.cc                          |   1 +
 src/tir/analysis/block_access_region_detector.cc   |  29 +-
 src/tir/analysis/device_constraint_utils.cc        |  16 +-
 src/tir/schedule/analysis.h                        | 116 +++
 src/tir/schedule/analysis/analysis.cc              | 471 +++++++++++
 src/tir/schedule/concrete_schedule.cc              |   9 +
 src/tir/schedule/concrete_schedule.h               |   2 +
 src/tir/schedule/primitive.h                       |  11 +
 src/tir/schedule/primitive/block_annotate.cc       |   6 +-
 src/tir/schedule/primitive/sampling.cc             |  64 ++
 src/tir/schedule/schedule.cc                       |   2 +
 src/tir/schedule/traced_schedule.cc                |  14 +
 src/tir/schedule/traced_schedule.h                 |   1 +
 src/tir/schedule/utils.h                           |  94 +++
 src/tir/transforms/compact_buffer_region.cc        |  10 +-
 src/tir/transforms/ir_utils.cc                     |  76 +-
 src/tir/transforms/ir_utils.h                      |  18 +-
 src/tir/transforms/make_unpacked_api.cc            |  23 +-
 src/tir/transforms/unify_thread_binding.cc         |   2 +-
 src/tir/usmp/algo/greedy.cc                        | 160 ++--
 src/tir/usmp/algo/hill_climb.cc                    | 339 ++++++++
 src/tir/usmp/analysis/extract_buffer_info.cc       |  19 +-
 src/tir/usmp/transform/assign_pool_info.cc         | 120 +++
 .../convert_pool_allocations_to_offsets.cc         |  47 +-
 src/tir/usmp/unified_static_memory_planner.cc      |  98 +++
 src/tir/usmp/utils.cc                              |  68 +-
 tests/cpp/build_module_test.cc                     |  16 +-
 tests/cpp/runtime/hexagon_buffer.cc                |  14 +
 tests/python/contrib/test_cudnn.py                 |  10 +-
 tests/python/contrib/test_ethosu/cascader/infra.py |  66 ++
 .../cascader/test_ethosu_block_config.py           | 262 ++++++
 .../cascader/test_ethosu_conv2d_matcher.py         |  71 +-
 .../cascader/test_ethosu_inline_matcher.py         |   3 +-
 .../test_ethosu/cascader/test_ethosu_part.py       |  28 +-
 .../cascader/test_ethosu_part_performance.py       | 200 +++++
 .../contrib/test_ethosu/cascader/test_graph.py     |   5 +-
 tests/python/contrib/test_ethosu/infra.py          |   2 +-
 tests/python/contrib/test_ethosu/test_codegen.py   |   5 +-
 tests/python/contrib/test_ethosu/test_compiler.py  |   3 +-
 tests/python/frontend/caffe/test_forward.py        |  69 ++
 tests/python/frontend/onnx/test_forward.py         |  45 ++
 tests/python/frontend/pytorch/qnn_test.py          |  15 +-
 tests/python/frontend/tflite/test_forward.py       |   1 +
 tests/python/relay/aot/aot_test_utils.py           |   6 +-
 tests/python/relay/aot/corstone300.ld              |   2 +
 tests/python/relay/aot/corstone300.mk              |  12 +-
 tests/python/relay/aot/default.mk                  |  13 +-
 tests/python/relay/aot/test_c_device_api.py        |  61 +-
 tests/python/relay/aot/test_crt_aot.py             |  40 +-
 tests/python/relay/aot/test_crt_aot_usmp.py        | 266 +++++++
 tests/python/relay/test_any.py                     |   2 +-
 tests/python/relay/test_op_grad_level2.py          |  44 +-
 tests/python/relay/test_pipeline_executor.py       |  94 ++-
 .../test_meta_schedule_postproc_verify_gpu_code.py | 231 ++++++
 ...test_meta_schedule_schedule_rule_add_rfactor.py |  80 ++
 ...test_meta_schedule_schedule_rule_auto_inline.py | 300 +++++++
 ...hedule_schedule_rule_random_compute_location.py |  93 +++
 .../test_tir_analysis_get_block_access_region.py   |  66 ++
 .../python/unittest/test_tir_schedule_sampling.py  |  77 ++
 .../test_tir_transform_compact_buffer_region.py    |   1 +
 tests/python/unittest/test_tir_usmp_algo.py        |  21 +-
 .../unittest/test_tir_usmp_algo_hill_climb.py      | 397 ++++++++++
 ...ransform_convert_pool_allocations_to_offsets.py |   4 +-
 .../python/unittest/test_tvmscript_syntax_sugar.py |  23 +
 tests/scripts/ci.py                                |   9 +
 tests/scripts/github_cc_reviewers.py               |  10 +-
 tests/scripts/task_config_build_qemu.sh            |   2 +
 180 files changed, 8911 insertions(+), 1342 deletions(-)
 copy docker/install/{ubuntu_install_universal.sh => ubuntu_install_sccache.sh} (79%)
 copy tests/lint/pylint.sh => docker/install/ubuntu_install_wasmtime.sh (72%)
 mode change 100755 => 100644
 create mode 100644 docs/contribute/ci.rst
 create mode 100644 include/tvm/tir/usmp/algo/greedy.h
 create mode 100644 include/tvm/tir/usmp/algorithms.h
 copy src/tir/op/runtime.cc => include/tvm/tir/usmp/analysis.h (57%)
 create mode 100644 include/tvm/tir/usmp/transform.h
 copy python/tvm/{target/virtual_device.py => contrib/ethosu/cascader/block_config.py} (53%)
 create mode 100644 python/tvm/contrib/ethosu/cascader/device_config.py
 copy python/tvm/meta_schedule/{measure_callback/update_cost_model.py => postproc/verify_gpu_code.py} (70%)
 create mode 100644 python/tvm/meta_schedule/schedule_rule/add_rfactor.py
 create mode 100644 python/tvm/meta_schedule/schedule_rule/auto_inline.py
 copy python/tvm/meta_schedule/{measure_callback/add_to_database.py => schedule_rule/random_compute_location.py} (70%)
 create mode 100644 python/tvm/meta_schedule/testing/schedule_rule.py
 copy apps/android_deploy/app/src/main/jni/Application.mk => python/tvm/meta_schedule/testing/space_generation.py (53%)
 create mode 100644 python/tvm/meta_schedule/testing/te_workload.py
 create mode 100644 python/tvm/topi/testing/conv2d_backcward_weight_python.py
 create mode 100644 src/contrib/ethosu/cascader/block_config.cc
 create mode 100644 src/contrib/ethosu/cascader/block_config.h
 create mode 100644 src/meta_schedule/postproc/verify_gpu_code.cc
 create mode 100644 src/meta_schedule/schedule_rule/add_rfactor.cc
 create mode 100644 src/meta_schedule/schedule_rule/auto_inline.cc
 create mode 100644 src/meta_schedule/schedule_rule/random_compute_location.cc
 create mode 100644 src/tir/usmp/algo/hill_climb.cc
 create mode 100644 src/tir/usmp/transform/assign_pool_info.cc
 create mode 100644 src/tir/usmp/unified_static_memory_planner.cc
 create mode 100644 tests/python/contrib/test_ethosu/cascader/test_ethosu_block_config.py
 create mode 100644 tests/python/contrib/test_ethosu/cascader/test_ethosu_part_performance.py
 create mode 100644 tests/python/relay/aot/test_crt_aot_usmp.py
 create mode 100644 tests/python/unittest/test_meta_schedule_postproc_verify_gpu_code.py
 create mode 100644 tests/python/unittest/test_meta_schedule_schedule_rule_add_rfactor.py
 create mode 100644 tests/python/unittest/test_meta_schedule_schedule_rule_auto_inline.py
 create mode 100644 tests/python/unittest/test_meta_schedule_schedule_rule_random_compute_location.py
 create mode 100644 tests/python/unittest/test_tir_usmp_algo_hill_climb.py