You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@tvm.apache.org by gi...@apache.org on 2022/11/04 18:39:52 UTC

[tvm] branch last-successful updated (6cd1bb5e89 -> ccb7d07159)

This is an automated email from the ASF dual-hosted git repository.

github-bot pushed a change to branch last-successful
in repository https://gitbox.apache.org/repos/asf/tvm.git


    from 6cd1bb5e89 [Hexagon] Update search pattern to find .so address for on device runs (#13230)
     add b9e5c02d0b [TOPI][Hexagon] Implement quantized depthwise conv2d (#12499)
     add 20082614be [skip ci] Revert "[skip ci] Remove naut-thomas from .asf.yaml (#13231)" (#13232)
     add da76587505 [runtime] Fix typo in runtime/registry.h (#13224)
     add 9dd0225a70 [TIR] Use Optional<Stmt> for IfThenElseNode::else_case (#13218)
     add f07f22fac9 [MetaSchedule][Minor] Fix Memory Database Module Equality (#13198)
     add e971956d76 [Relay][transform][SimplifyExpr] simplify adjacent muls and adds with constants (#13213)
     add 25a0d47d2b [Arith][TIR] Check for constant offsets of known literal constraints (#13023)
     add 76cd298bd9 [Hexagon] Add E2E test demonstrating how to apply blocked layout schedule to conv2d via metaschedule (#13180)
     add c9b10a80fd [ci][java] Use mavenCentral for dependencies (#13239)
     add e48dded3b4 [BugFix][LLVM] Add UseInitArray flag in target_options_ (#13115)
     add c3bb62e5c4 [skip ci][docs] Disable scipy intersphinx linking (#13245)
     add 67fa959f18 [CI] Make additional_flags parameter optional in tests/scripts/ci.py (#13238)
     add 0683ece044 [MetaSchedule] Fix thread bindings of MultiLevelTilingTensorCore (#13243)
     add c69f8ce9c9 [Relay] Add ClipAndConsecutiveCast and CastClip to SimplifyExpr (#13236)
     add 3259580bd5 [Hexagon] Make pytest use a random port if not running in CI (#13244)
     add 9cdc97fe23 [DLPack][runtime] Update DLPack to v0.7 (#13177)
     add 2c1fecd097 [TIR][Primitive] Support rolling_buffer schedule primitive in TensorIR (#13033)
     add e9e8c4b263 fix GPU other build (#13235)
     add 5d15428994 [Relay] Add set_attrs_type registry to broadcast_to op (#13096)
     add 6551b71588 [COMMUNITY] Jyotsna Verma -> Reviewer (#13251)
     add 87f52af48a [skip ci] Revert "fix GPU other build (#13235)" (#13261)
     add 7536068e80 [MetaSchedule] Swap the order of RewriteTensorize and VerifyGPUCode to reduce tuning time (#13259)
     add 84fadc45d0 [CI] Skip failing Caffe tests due to broken URL (#13228)
     add 4ecf303695 [TVMC] Apply constant folding when converting layout (#13216)
     add c3c1454233 Apply group write permissions to Python virtual environment (#13252)
     add da4bb4a65b Fix a typo in rpc/client.py (#12842)
     add d261fa8838 [Hexagon] Add pylint on tests (#13233)
     add 404d95f054 [build][relay][te][tir] remove unused vars / args (#13266)
     add ff6aaeb12a [Frontend][Tensorflow2] Import graph_def to default graph before calling function_def_to_graph_def (#13260)
     add d998187e16 [Frontend][PaddlePaddle] Fix UnboundLocalError: local variable 'shape… (#13247)
     add e9ba9865a7 [skip ci] Revert "[ci] Protect release branches (#13208)" (#13274)
     add f15afd2251 [Docs] Minimal dependencies for Fedora/CentOS (#13248)
     add 9df3a33ff4 [build][doc] Fix clang doxygen warnings (#13270)
     add 0d553127e5 [build][tir] fix clang redundant-move warning (#13268)
     add 75921fb559 [ETHOSN] Inline non-compute-intensive partitions (#13092)
     add 47da418fbf [ETHOSN] Throw error message when inference fails (#13022)
     add 1d1db35236 [MetaSchedule] Fix Task Hanging in EvolutionarySearch (#13246)
     add 215f0e2fc2 [Bugfix][TIR] Fix version conflict with `typing` for Python 3.9  (#13269)
     add b98b9f92da [MetaSchedule] Improve the script for TorchBench model tuning & benchmarking (#13255)
     add 90ed632280 [Relay] Add tensor rank check for `nn.instance_norm` (#13280)
     add b1a099b213 [Relay] Enhancement for fold_scale_axis and simplify_expr (#13275)
     add de8a79d9ba [skip-ci][COMMUNITY] New committer Ashutosh Parkhi (#13286)
     add ccb7d07159 [TIR][Arith] Use TryCompare to narrow inequalities if possible (#13024)

No new revisions were added by this update.

Summary of changes:
 .asf.yaml                                          |   8 +-
 3rdparty/dlpack                                    |   2 +-
 CMakeLists.txt                                     |   3 +
 CONTRIBUTORS.md                                    |   4 +-
 apps/android_deploy/app/download-models.gradle     |   2 +-
 docker/install/ubuntu_install_python.sh            |   1 +
 docs/conf.py                                       |   2 +-
 docs/install/from_source.rst                       |   9 +
 include/tvm/arith/analyzer.h                       |  11 +-
 include/tvm/meta_schedule/search_strategy.h        |   2 +
 include/tvm/runtime/c_runtime_api.h                |  73 ++-
 include/tvm/runtime/device_api.h                   |  11 +
 include/tvm/runtime/registry.h                     |   4 +-
 include/tvm/tir/schedule/schedule.h                |  17 +
 include/tvm/tir/stmt.h                             |   4 +-
 jvm/core/src/main/java/org/apache/tvm/Device.java  |  79 +--
 jvm/core/src/main/java/org/apache/tvm/NDArray.java |  31 +-
 .../main/java/org/apache/tvm/rpc/RPCSession.java   |  25 +-
 python/tvm/_ffi/runtime_ctypes.py                  |  93 ++--
 python/tvm/contrib/hexagon/pytest_plugin.py        |   2 +-
 python/tvm/driver/tvmc/transform.py                |   1 +
 .../search_strategy/evolutionary_search.py         |   4 +
 python/tvm/meta_schedule/testing/torchbench/run.py | 272 +++++++---
 .../tvm/meta_schedule/testing/torchbench/utils.py  |   8 +-
 python/tvm/relay/frontend/paddlepaddle.py          |   2 +-
 python/tvm/relay/frontend/tensorflow2.py           |  30 +-
 python/tvm/relay/op/contrib/ethosn.py              |  55 +-
 python/tvm/rpc/client.py                           |  21 +-
 python/tvm/runtime/ndarray.py                      |  26 +-
 python/tvm/tir/schedule/_type_checker.py           |  49 +-
 python/tvm/tir/schedule/schedule.py                | 108 ++++
 python/tvm/topi/hexagon/qnn/__init__.py            |   1 +
 .../topi/hexagon/qnn/qdepthwise_conv2d_slice.py    | 217 ++++++++
 python/tvm/topi/hexagon/slice_ops/dwconv2d.py      |   5 +-
 python/tvm/topi/hexagon/utils.py                   |  19 +
 src/arith/ir_mutator_with_analyzer.cc              |  12 +-
 src/arith/ir_visitor_with_analyzer.cc              |   4 +-
 src/arith/rewrite_simplify.cc                      | 151 ++++--
 src/arith/rewrite_simplify.h                       |  21 +
 src/arith/transitive_comparison_analyzer.cc        | 168 ++++--
 src/contrib/hybrid/codegen_hybrid.cc               |   4 +-
 src/meta_schedule/database/json_database.cc        |   3 +-
 src/meta_schedule/database/memory_database.cc      |   3 +-
 src/meta_schedule/postproc/postproc.cc             |   4 +-
 .../multi_level_tiling_tensor_core.cc              |   5 +
 src/meta_schedule/schedule_rule/schedule_rule.cc   |   2 +-
 .../search_strategy/evolutionary_search.cc         |  18 +-
 src/printer/tir_text_printer.cc                    |   4 +-
 src/printer/tvmscript_printer.cc                   |   4 +-
 src/relay/analysis/util.cc                         |   3 +-
 src/relay/backend/annotate_used_memory.cc          |   4 +
 src/relay/backend/contrib/ethosn/codegen_ethosn.h  |  17 +
 .../backend/contrib/ethosn/inline_partitions.cc    | 126 +++++
 src/relay/backend/graph_plan_memory.cc             |   2 +-
 src/relay/backend/te_compiler_cache.cc             |   4 +-
 src/relay/collage/mock_cost_estimator.h            |  10 +
 src/relay/ir/dataflow_matcher.cc                   |  10 -
 src/relay/op/nn/nn.cc                              |   1 +
 src/relay/op/tensor/transform.cc                   |   1 +
 src/relay/transforms/pattern_utils.h               |  29 ++
 src/relay/transforms/remove_standalone_reshapes.cc |   4 +-
 src/relay/transforms/simplify_expr.cc              | 217 +++++++-
 src/runtime/aot_executor/aot_executor.cc           |   4 +-
 src/runtime/contrib/ethosn/ethosn_device.cc        |  70 ++-
 src/runtime/contrib/ethosn/ethosn_runtime.h        |  33 ++
 src/runtime/hexagon/hexagon_common.h               |   4 +-
 src/runtime/hexagon/hexagon_device_api.cc          |   6 +-
 src/runtime/hexagon/hexagon_device_api.h           |   3 +-
 src/target/llvm/codegen_llvm.cc                    |   4 +-
 src/target/llvm/llvm_instance.cc                   |   2 +
 src/target/source/codegen_c.cc                     |   4 +-
 src/target/spirv/codegen_spirv.cc                  |   4 +-
 src/target/stackvm/codegen_stackvm.cc              |   4 +-
 src/te/operation/compute_op.cc                     |   4 +-
 src/tir/analysis/block_access_region_detector.cc   |   4 +-
 src/tir/analysis/estimate_flops.cc                 |   4 +-
 src/tir/ir/data_type_rewriter.cc                   |   2 +-
 src/tir/ir/stmt.cc                                 |   4 +-
 src/tir/ir/stmt_functor.cc                         |  10 +-
 src/tir/schedule/concrete_schedule.cc              |  12 +
 src/tir/schedule/concrete_schedule.h               |   2 +
 src/tir/schedule/primitive.h                       |  16 +
 src/tir/schedule/primitive/cache_index.cc          |   2 +-
 src/tir/schedule/primitive/rolling_buffer.cc       | 474 +++++++++++++++++
 src/tir/schedule/schedule.cc                       |   3 +
 src/tir/schedule/traced_schedule.cc                |  12 +
 src/tir/schedule/traced_schedule.h                 |   2 +
 src/tir/transforms/common_subexpr_elim_tools.cc    |   6 +-
 src/tir/transforms/compact_buffer_region.cc        |   4 +-
 src/tir/transforms/coproc_sync.cc                  |   4 +-
 src/tir/transforms/inject_virtual_thread.cc        |   6 +-
 src/tir/transforms/ir_utils.cc                     |   2 +-
 src/tir/transforms/lift_attr_scope.cc              |   4 +-
 src/tir/transforms/profile_instrumentation.cc      |   6 +-
 src/tir/transforms/remove_no_op.cc                 |   4 +-
 src/tir/transforms/simplify.cc                     |   4 +-
 src/tir/transforms/storage_access.cc               |   4 +-
 src/tir/transforms/vectorize_loop.cc               |   6 +-
 tests/cpp/runtime/contrib/ethosn/inference_test.cc |  74 +++
 tests/lint/pylint.sh                               |  17 +-
 tests/python/contrib/test_ethosn/infrastructure.py |  28 +-
 tests/python/contrib/test_ethosn/test_addition.py  |   4 +-
 .../python/contrib/test_ethosn/test_concatenate.py |   2 +-
 .../contrib/test_ethosn/test_depth_to_space.py     |   2 +-
 .../contrib/test_ethosn/test_inline_partitions.py  | 167 ++++++
 .../python/contrib/test_ethosn/test_leaky_relu.py  |   2 +-
 tests/python/contrib/test_ethosn/test_multiply.py  |   4 +-
 tests/python/contrib/test_ethosn/test_networks.py  |   5 +-
 tests/python/contrib/test_ethosn/test_relu.py      |   2 +-
 .../python/contrib/test_ethosn/test_requantize.py  |   4 +-
 tests/python/contrib/test_ethosn/test_reshape.py   |   6 +-
 tests/python/contrib/test_ethosn/test_split.py     |   4 +-
 tests/python/contrib/test_ethosn/test_tanh.py      |   2 +-
 .../python/contrib/test_ethosn/test_topologies.py  |  61 ++-
 .../metaschedule_e2e/test_resnet50_int8.py         | 174 +++++++
 tests/python/contrib/test_hexagon/pytest_util.py   |  36 +-
 .../test_hexagon/test_async_dma_pipeline.py        | 514 ++++++++++--------
 .../test_hexagon/test_benchmark_maxpool2d.py       | 190 +++----
 .../contrib/test_hexagon/test_cache_read_write.py  |   4 +
 .../test_hexagon/test_fixed_point_conversion.py    |  13 +-
 .../test_hexagon/test_fixed_point_multiply.py      | 167 +++---
 .../contrib/test_hexagon/test_memory_alloc.py      |  22 +-
 .../contrib/test_hexagon/test_meta_schedule.py     | 147 +++---
 .../contrib/test_hexagon/test_parallel_hvx.py      |  81 +--
 .../test_hexagon/test_parallel_hvx_load_vtcm.py    | 252 +++++----
 .../contrib/test_hexagon/test_parallel_scalar.py   |  63 ++-
 .../contrib/test_hexagon/test_run_unit_tests.py    |   4 +
 tests/python/contrib/test_hexagon/test_sigmoid.py  |  40 +-
 .../test_hexagon/test_software_pipeline_async.py   | 269 +++++-----
 .../contrib/test_hexagon/test_vtcm_bandwidth.py    |  53 +-
 .../test_hexagon/test_wo_qnn_canonicalization.py   |   5 +-
 .../topi/test_depthwise_conv2d_slice.py            | 337 ++++++++++++
 .../test_hexagon/topi/test_dwconv2d_slice.py       | 314 -----------
 tests/python/driver/tvmc/test_transform.py         |  57 ++
 tests/python/frontend/caffe/test_forward.py        |   3 +
 tests/python/relay/test_pass_fold_scale_axis.py    |  51 ++
 tests/python/relay/test_pass_simplify_expr.py      | 155 ++++--
 .../python/unittest/test_arith_rewrite_simplify.py |   2 +-
 tests/python/unittest/test_index_map.py            |   8 +-
 .../unittest/test_meta_schedule_search_strategy.py |  56 ++
 .../unittest/test_tir_schedule_rolling_buffer.py   | 573 +++++++++++++++++++++
 .../test_tir_transform_inject_software_pipeline.py |   4 +-
 .../python/unittest/test_tir_transform_simplify.py |  60 +++
 tests/scripts/ci.py                                |  11 +-
 144 files changed, 5114 insertions(+), 1706 deletions(-)
 create mode 100644 python/tvm/topi/hexagon/qnn/qdepthwise_conv2d_slice.py
 create mode 100644 src/relay/backend/contrib/ethosn/inline_partitions.cc
 create mode 100644 src/tir/schedule/primitive/rolling_buffer.cc
 create mode 100644 tests/cpp/runtime/contrib/ethosn/inference_test.cc
 create mode 100644 tests/python/contrib/test_ethosn/test_inline_partitions.py
 create mode 100644 tests/python/contrib/test_hexagon/topi/test_depthwise_conv2d_slice.py
 delete mode 100644 tests/python/contrib/test_hexagon/topi/test_dwconv2d_slice.py
 create mode 100644 tests/python/driver/tvmc/test_transform.py
 create mode 100644 tests/python/unittest/test_tir_schedule_rolling_buffer.py