review.tizen.org Git - platform/upstream/tvm.git/log

projects / platform / upstream / tvm.git / log

windclarion [Wed, 2 Sep 2020 17:19:57 +0000 (01:19 +0800)]

[RUNTIME][CRT] use macro to replace hardcode number (#6365)

Signed-off-by: windclarion <windclarion@gmail.com>

masahi [Wed, 2 Sep 2020 15:55:38 +0000 (00:55 +0900)]

[Torch] Support logsumexp, clean up unnecessary infer_shape usage (#6374)

* clean up infer_shape usage, add logsumexp op

* add more tests for logsumexp

* remove commented code

commit | commitdiff | tree

Andrew Reusch [Wed, 2 Sep 2020 14:35:01 +0000 (07:35 -0700)]

acquire gil while finalizing PackedFunc (#6378)

commit | commitdiff | tree

lixiaoquan [Wed, 2 Sep 2020 06:04:37 +0000 (14:04 +0800)]

[Relay] Enhance relay.split(), allow splitted dim to be dynamic (#6289)

* [Relay] Enhance relay.split(), allow splitted dim to be dynamic

* Add assert in shape function

* Fix CI

commit | commitdiff | tree

Haichen Shen [Wed, 2 Sep 2020 04:46:19 +0000 (21:46 -0700)]

add missing dependency (#6375)

commit | commitdiff | tree

Haozheng Fan [Tue, 1 Sep 2020 23:40:00 +0000 (07:40 +0800)]

[Relay] support i64 indices (#6143)

commit | commitdiff | tree

Trevor Morris [Tue, 1 Sep 2020 18:06:16 +0000 (11:06 -0700)]

[BYOC][JSON] Support input nodes with multiple entries (#6368)

* Support input nodes with multiple data entries

* Rename input_var_idx_ to input_var_eid_

commit | commitdiff | tree

masahi [Tue, 1 Sep 2020 13:40:42 +0000 (22:40 +0900)]

[Torch] Fix aten::max and aten::min conversion (#6372)

* fix aten::max and aten::min conversion

* remove print

commit | commitdiff | tree

ANSHUMAN TRIPATHY [Mon, 31 Aug 2020 21:19:33 +0000 (02:49 +0530)]

[TIR][Transform]Block scope hoisting added (#6238)

* Block scope hoisting added

* lowering flow added with 2 variants

* Fake commit to trigger ci with pass default enabled

* CI Failure resolved

* Optimize for if var list iteration

* More test case added

* Fake commit to disable failed test cases

* Pass default value restored

* [1] Review comment handled

* [2] Review comments handled

commit | commitdiff | tree

tkat0 [Mon, 31 Aug 2020 15:10:14 +0000 (00:10 +0900)]

[DOCKER] Fix Dockerfile.demo_android (#6361)

* [DOCKER] Fix Dockerfile.demo_android

* fix

commit | commitdiff | tree

Chenfan [Mon, 31 Aug 2020 06:07:24 +0000 (14:07 +0800)]

[Ansor][AutoTVM v2.0] Phase 2: Update heavy operations with parallel_for (#6348)

* Update auto_scheduler with parallel_for

* Update

* Update

* Update

* Update inferbound

commit | commitdiff | tree

gigo [Mon, 31 Aug 2020 06:07:05 +0000 (14:07 +0800)]

[DOC] Fix mistyped word (#6362)

* Fix the doc mistyped word in `tvm.te.hybrid.build` function

Co-authored-by: gigo <gigo_liao@qbitsemi.com>

commit | commitdiff | tree

Tianqi Chen [Sat, 29 Aug 2020 21:34:50 +0000 (14:34 -0700)]

[DOCKER] Use clear name that is separate from ASF brand for cache (#6360)

commit | commitdiff | tree

masahi [Sat, 29 Aug 2020 06:14:02 +0000 (15:14 +0900)]

[Torch] Add cast to double, fix flatten conversion (#6357)

* support cast to double and fix flatten conversion

* also support batch flatten, add test

* add flatten test

* clean up

commit | commitdiff | tree

Junru Shao [Sat, 29 Aug 2020 05:59:35 +0000 (22:59 -0700)]

[Target][Codegen] Use target class in all codegens (#6347)

* [Target][Codegen] Make all code generator use Target class instead of target string

* Remove dep to TargetNode::str() in LLVM module

* Allow for llvm nvptx codegen

* ...

* Address comments from Cody

* Rename UpdateTargetConfig => UpdateTargetConfigKeyValueEntry

commit | commitdiff | tree

Tianqi Chen [Fri, 28 Aug 2020 21:42:15 +0000 (14:42 -0700)]

[CMAKE] Compatible for ROCm before 3.7 (#6359)

commit | commitdiff | tree

Max Willsey [Fri, 28 Aug 2020 19:13:47 +0000 (12:13 -0700)]

Improve Rust bindings: Map, Array, String, various IR nodes (#6339)

* Fix datatype

* Add initialize macro

* Add some TIR nodes

* Better downcasting

* Improve Array and add Map

* Convert to new string API

* Clean up some warnings

* Add ConstIntBound type

* Run cargo fmt

* Remove debug prints

* Add some more ops

* Fix some string code

Co-authored-by: Jared Roesch <jroesch@octoml.ai>

commit | commitdiff | tree

Dmitriy Smirnov [Fri, 28 Aug 2020 18:46:13 +0000 (19:46 +0100)]

quanitze operation expanded to take const argument (#6127)

* quanitze operation expanded to take const argument

* amendments

used get_tensor_expr, added _test_forward_quantize_dequantize_const test

commit | commitdiff | tree

Andrew Reusch [Fri, 28 Aug 2020 16:27:09 +0000 (09:27 -0700)]

Add docker/lint.sh, for running dockerized lint scripts locally (#6333)

* Add -i option to docker/bash.sh

* Allows scripts to invoke dockerized commands interactively, for
   better Ctrl+C.

* Add docker/lint.sh to run lint step locally in the docker VM.

* This allows developers to run lint using the official versions of
   the lint tools without needing to lookup the docker image name.
* Move all lint scripts to tests/lint/
* Point Makefile to those new scripts.
* Update apache rat script to filter untracked/gitignore'd files when
   run with `docker/lint.sh`.

* fix bash_source[0]

* explicitly set the author for CI

* try environment variable override

* try config option

* remove =traditional from ignored option to increase git compat

* address comments, fix behavior under git worktrees

* address cppdocs comments

* address lint.sh comments

* address zhi comments, update pull_request rst

commit | commitdiff | tree

Andrew Liu [Fri, 28 Aug 2020 04:27:38 +0000 (21:27 -0700)]

typo (#6352)

commit | commitdiff | tree

Cody Yu [Thu, 27 Aug 2020 22:11:42 +0000 (15:11 -0700)]

[Ansor][AutoTVM v2.0] Phase 2: Evolutionary Search (#6310)

* init commit

* Add rest rules

* refactor

* address comments

* improve test

* address comments

commit | commitdiff | tree

mbaret [Thu, 27 Aug 2020 21:03:45 +0000 (22:03 +0100)]

[BYOC][ETHOSN] Add support for quantized convolution (#6335)

* [BYOC][ETHOSN] Add support for quantized convolution

This PR adds support for quantized convolution. This
includes mapping it via a composite function and all
the necessary methods to convert from Relay to the
APIs in Support Library.

Co-authored-by: Leo Blonk <Leo.Blonk@arm.com>
Co-authored-by: Tristan O'Connor <tristan.oconnor@arm.com>
* Fix padding change

Change-Id: I0794b0ac6190478e2d1b858ad0dd90f37fc0207b

* Add docs to Tvm2Npu methods

Change-Id: Iab865619b449a3d0dd6bb0dbdcb198acd529fc4e

* Remove generate tests

Change-Id: I51f90499f7ce82a1ce49f0731d3d50627e1d0225

Co-authored-by: Leo Blonk <Leo.Blonk@arm.com>
Co-authored-by: Tristan O'Connor <tristan.oconnor@arm.com>

commit | commitdiff | tree

雾雨魔理沙 [Thu, 27 Aug 2020 18:32:40 +0000 (11:32 -0700)]

[Relay][Training] Make AutoDiff thread through global function. (#6336)

* save

* lint

* lint

* fix warning

* fix test

* save

commit | commitdiff | tree

lhutton1 [Thu, 27 Aug 2020 03:50:38 +0000 (04:50 +0100)]

[BYOC][ACL] Improved pooling support (#6248)

* [BYOC][ACL] Improved pooling support

Adds support in ACL for the following relay pooling operators and composite functions:
  * nn.avg_pool2d (fp32), cast + nn.avg_pool2d(uint8) + cast => AVG pool
  * nn.global_max_pool2d => Global MAX pool
  * nn.global_avg_pool2d, cast + nn.global_avg_pool2d(uint8) + cast => Global AVG pool
  * power(2) + nn.avg_pool2d + sqrt => L2 pooling (for fp32 only)

Tests updated to reflect these changes.

Change-Id: I1644b67b60ebb252344eb9695a521d2d958c724e

* Address comments

Change-Id: Ibe8a61b4c42da246ce54701c89ea985b423c8f83

* Fix not checking output saturation

Change-Id: Ia6f3d9db31cfb8c417d8556d29961210fea418b2

* Use defined set of trials

Change-Id: Ib180e3a0cbb84d6fa00c7e1994f58cb62662db15

* Rebase master

Change-Id: I5c932751cd38da06d6f2b397be5d8ab7fdeb169f

commit | commitdiff | tree

FernChen [Thu, 27 Aug 2020 03:26:58 +0000 (11:26 +0800)]

[Caffe Frontend] introduce caffe frontend for tvm (#6206)

* [Caffe Frontend] introduce caffe frontend for tvm.

* [Caffe Frontend] fix bugs for generating caption in tutorial.

* [Caffe Frontend] delete statement for python2 and modify the function name.

* [Caffe Frontend] change the directory which will hold the tmp files
when testing the caffe frondend.

* [Caffe Frontend] delete tutorial about caffe frontend.

* [Caffe Frontend] delete some print statements

Co-authored-by: fernchen <zifeng.cf@alibaba-inc.com>

commit | commitdiff | tree

Rishabh Jain [Thu, 27 Aug 2020 03:21:45 +0000 (08:51 +0530)]

[Relay/TOPI][TFLite] Implemented MATRIX_SET_DIAG Operator for Relay/TOPI and TFLite Frontend. (#6303)

* Corrected docstring error.

* Minor changes.

* Changed MATRIX_SET_DIAG registration from broadcast to injective.

commit | commitdiff | tree

Tristan Konolige [Thu, 27 Aug 2020 02:29:24 +0000 (19:29 -0700)]

[TESTS] add gpuonly tests for python unittests and integration (#6346)

commit | commitdiff | tree

quic-sanirudh [Thu, 27 Aug 2020 02:11:24 +0000 (07:41 +0530)]

Add `init` member to ReduceNode (#6138)

- This patch adds a new member to ReduceNode called init which allows
  initialization with a custom ProducerLoad or a Float/Int immediate.
- This allows initialization of the output Tensor of a reduction with
  another Tensor instead of the `identity_element` defined in the
  CommReducer
- One example use case for this node is to initialize the Output of a
  convolution reduction with the Bias values thereby saving the
  Bias-add computation.

commit | commitdiff | tree

mvermeulen [Thu, 27 Aug 2020 01:55:31 +0000 (20:55 -0500)]

ROCm changed name of library and removed old one in ROCm 3.7 release. (#6345)

commit | commitdiff | tree

yukatayu [Wed, 26 Aug 2020 16:52:23 +0000 (01:52 +0900)]

[MSVC] Make able to compile with MSVC (#6341)

* fix: make suitable for msvc, clang

* clang-format

* refactor: use DMLC_ATTRIBUTE

commit | commitdiff | tree

Lily Orth-Smith [Wed, 26 Aug 2020 08:21:05 +0000 (01:21 -0700)]

[DYN][RELAY] Resize support for NCHW-convertible layouts (#6293)

* fix lint

* fix typo

* remove channel_axis from resize shape func

* fix lint

commit | commitdiff | tree

Giuseppe Rossini [Wed, 26 Aug 2020 06:35:22 +0000 (07:35 +0100)]

Use auto-tuner to improve conv2d_gemm performance (#6117)

* Use auto-tuner to improve conv2d_gemm performance

The following tuning entities have been introduced:
- Unrolling and vectorizing input matrix transform
- Reordering gemm to exploit parallel threads
- Unrolling `gemm_quantized` intrinsic
- Interleaving `gemm_quantized` intrinsic

Change-Id: Icd3ab005663f78a80672e71ef368f6d0efa4a401

* Rebasing

Change-Id: Id27b6de705b16b93df8e885868961fa0321497be

* Fix python linting

Change-Id: I77d880424c3e7ce9de67c970ddb2cf2a92b52f79

* Fusing batch into inner dimensions before parallelizing

Change-Id: Ic58d1138ab96d58d12f5855f0e1044f10d9e6e9b

commit | commitdiff | tree

雾雨魔理沙 [Wed, 26 Aug 2020 03:37:22 +0000 (20:37 -0700)]

save (#6338)

commit | commitdiff | tree

雾雨魔理沙 [Tue, 25 Aug 2020 18:40:34 +0000 (11:40 -0700)]

[Relay] Make check stricter by using Feature. Fixed multiple bugs. (#6326)

* save

lint

lint

lint

fix lint

lint

update

lint

save

save

save

lint

format

format

save

save

fix

use a form more suitable for numeric check

save

* save

* save

* lint

* save

* lint

* fix

* fix

commit | commitdiff | tree

Zhi [Tue, 25 Aug 2020 16:07:02 +0000 (09:07 -0700)]

[FIX][VM] Fix relay vm optimize (#6322)

* [FIX][VM] Fix relay vm optimize

* retrigger ci

commit | commitdiff | tree

lhutton1 [Tue, 25 Aug 2020 16:04:20 +0000 (17:04 +0100)]

[BYOC][ACL] Enable remote device via environment variables (#6279)

* [BYOC][ACL] Enable remote device via environment variables

Improves the ACL remote testing infrastructure by allowing a remote device to be specified via environment variables. This means external scripts can be used to enable the runtime tests. By default an RPC server will not be used and the runtime tests will be skipped.

Change-Id: I8fc0b88106683ac6f1cbff44c8954726325cda21

* Use json file as configuration for tests

Change-Id: Iadce931d91056ed3a2d57a49f14af1ce771ae14b

* Do not load the test config during class creation

Change-Id: If718b5d163e399711111830f878db325db9c5f84

* Add check for existence of file

Change-Id: I2568bca7f4c3ad22ee8f9d065a9486ee3114f35c

commit | commitdiff | tree

Dongming Yang [Tue, 25 Aug 2020 00:08:41 +0000 (08:08 +0800)]

[Frontend][Relay] Keras softmax and prelu fix (#6278) (#6278)

* prelu and softmax with NHWC layout consideration

* fix lint

* fix lint

Co-authored-by: Dongming Yang <dongming.yang@streamcomputing.com>

commit | commitdiff | tree

masahi [Mon, 24 Aug 2020 20:15:23 +0000 (05:15 +0900)]

[OpFusion] Make the max number of fused ops configurable (#6327)

commit | commitdiff | tree

Yong Wu [Mon, 24 Aug 2020 13:19:57 +0000 (21:19 +0800)]

[Relay] Support for PyTorch Non-Maximum Suppression (#6314)

* [Relay] Support for PyTorch Non-Maximum Suppression

* fix comment

* add verify_model_vm

commit | commitdiff | tree

Chenfan [Mon, 24 Aug 2020 10:46:31 +0000 (18:46 +0800)]

[Ansor][AutoTVM v2.0] Phase 2: Basic GPU Sketch Search Policy (#6269)

* Add PreloadMeasuredStates & Split search_policy.py

* Add GPU sketch rule

* Update

* Bug fix for log record

* Lint fix

* Update tutorial

* Update

* UT fix

* Remove tutorial

* Update

* Update

* Update UT

* Lint fix

* Update

* Update

commit | commitdiff | tree

Tianqi Chen [Sat, 22 Aug 2020 15:30:48 +0000 (08:30 -0700)]

[RUNTIME][FFI] Fix cython FFI compact with np.int64 (#6321)

commit | commitdiff | tree

雾雨魔理沙 [Sat, 22 Aug 2020 05:11:50 +0000 (22:11 -0700)]

[Relay] Make check stricter: disallow inserting function with free vars into module. (#6313)

* save

lint

lint

fix test

fix test

* fix

commit | commitdiff | tree

Yao Wang [Sat, 22 Aug 2020 01:54:26 +0000 (18:54 -0700)]

[Frontend][Pytorch]Add Pytorch advanced indexing (#6318)

* Add Pytorch advanced indexing

* Minor fix for test

* Fix for cuda

commit | commitdiff | tree

Lily Orth-Smith [Sat, 22 Aug 2020 01:03:43 +0000 (18:03 -0700)]

[BUG_FIX] Fix resize test (#6298)

* fix resize tests

* add different scale to resize tests

* fix dynamic to static resize test

* fix error throwing in topi resize

* fix topi and importer tests

* fix lint

* flakey test failed

* make resize test less sensitive; had floating point rounding err on gpu

* remove nearest_neighbor + half_pixel option from pytorch importer

* remove nearest_neighbor + half_pixel in upsample3d

commit | commitdiff | tree

sandyhu533 [Fri, 21 Aug 2020 20:47:47 +0000 (04:47 +0800)]

[RELAY][MXNET][FRONTEND] add support for MXNET numpy operators (#6054)

* [RELAY][MXNET][FRONTEND] add supports for OPs in numpy from mxnet

* Update test_forward.py

* Update mxnet.py

* Update mxnet.py

* Update test_forward.py

* update and bugfix

* test for multiple dtypes

* Update test_forward.py

* add data type and optimize coding style

* replace pytest.skip with @pytest.mark.skipif

* Update test_forward.py

* update pytest style

* Update test_forward.py

* Update test_forward.py

* Update test_forward.py

* Update test_forward.py

Co-authored-by: Ubuntu <ubuntu@ip-172-31-39-169.ap-northeast-1.compute.internal>

commit | commitdiff | tree

Tianqi Chen [Fri, 21 Aug 2020 17:53:39 +0000 (10:53 -0700)]

[TIR] Enforce buffer pointer var type to be consistent with dtype. (#6317)

Now that we have type_annotation in tir::Var.
We should make sure that the type annotation to be consistent with the dtype
in Buffer declaration and Allocation.

This change allows future passes to directly use the content type information via type_annotation.

This PR turns on the enforcement on Buffer and also fixed a few cases for Allocate.
A follow up PR need to fix a few more cases in the hybrid script parsing
before everything can be made consistent.

commit | commitdiff | tree

Rishabh Jain [Fri, 21 Aug 2020 17:32:42 +0000 (23:02 +0530)]

Retrigger build. (#6304)

commit | commitdiff | tree

Lily Orth-Smith [Fri, 21 Aug 2020 16:06:53 +0000 (09:06 -0700)]

[RELAY][DYN] Dynamic upsampling relay op (#6273)

* implementing upsampling op

* fix lint

* fix lint again

* add doc to upsampling shape func

* fix set attrs build problem

* fixing imports

* reverting data layout transform changes

* moved layout template to header file

* changing python module from nn.dyn to dyn.nn

* adding support for more layouts to upsampling

* fix lint

* fix upsampling doc

* change _nn.py doc

* failed flakey test

* fix build after merge

commit | commitdiff | tree

Zhao Wu [Fri, 21 Aug 2020 14:20:19 +0000 (22:20 +0800)]

[TEST][FLAKY] fix random fail (#6312)

* [TEST][FLAKY] fix random fail

* increase size and error check range

commit | commitdiff | tree

Josh Fromm [Fri, 21 Aug 2020 14:04:56 +0000 (07:04 -0700)]

[Target] Add python binding to new JSON target construction. (#6315)

* Add python binding to new JSON target construction.

* Added json string parsing and new test.

* Add error type.

* Add error type in json decoding check.

* Fix sphinx formatting.

commit | commitdiff | tree

Siju Samuel [Fri, 21 Aug 2020 04:07:04 +0000 (09:37 +0530)]

[COREML]multiple output support, reshape, split ops added (#6296)

* [COREML]multiple output support, reshape, split ops added

* Review comments addressed

commit | commitdiff | tree

masahi [Fri, 21 Aug 2020 02:29:45 +0000 (11:29 +0900)]

[Torch] Fix dtype handling for modules with integer parameters (#6311)

* return the correct type for GetAttr node

* keep _get_pytorch_value_type intact

* add test and handle quantized param

commit | commitdiff | tree

Matthew LeMay [Fri, 21 Aug 2020 00:45:17 +0000 (20:45 -0400)]

Changed TVMCTVMContext to TVMContext (#6306)

commit | commitdiff | tree

Dmitriy Smirnov [Thu, 20 Aug 2020 18:44:04 +0000 (19:44 +0100)]

Constant input attr added to fully connected operation in TFLite frontend (#6228)

* Constant input attr added to fully connected operation

An ability to handle constant input attr added to fully connected operation
Unit tests amended.

* renamed wrap_input to const_input

* removed extra spaces

commit | commitdiff | tree

Lily Orth-Smith [Thu, 20 Aug 2020 01:57:05 +0000 (18:57 -0700)]

[RELAY][DYN] Implementation of the dynamic pad operator (#6284)

commit | commitdiff | tree

Lianmin Zheng [Thu, 20 Aug 2020 01:38:10 +0000 (18:38 -0700)]

[AutoScheduler] Fix flaky test (#6307)

commit | commitdiff | tree

ZHANG Hao [Wed, 19 Aug 2020 16:49:00 +0000 (00:49 +0800)]

[Relay] change device annotation from post DFS to recursive (#6124)

* change device annotation from post DFS to recursive

* add testcast for recursive device propogation

commit | commitdiff | tree

mbaret [Wed, 19 Aug 2020 16:39:33 +0000 (17:39 +0100)]

[BYOC][ETHOSN] Introduce the Ethos-N BYOC integration (#6222)

* [BYOC][ETHOSN] Introduce the Ethos-N BYOC integration

This is the first of 3 PRs to introduce the Ethos-N
integration into TVM via the BYOC framework. It adds
support for partitioning and compiling for the
Ethos-N77 target with CPU fallback for unsupported
operators. Additionally, runtime support is added in
the form of an Ethos-N runtime module. In this initial
PR, only quantized concatenate and split are supported
with follow-up PRs adding support for many further operators.

Co-authored-by: Leo Blonk <Leo.Blonk@arm.com>
Co-authored-by: Tristan O'Connor <tristan.oconnor@arm.com>
Co-authored-by: Leandro Nunes <leandro.nunes@arm.com>
Co-authored-by: Ramana Radhakrishnan <ramana.radhakrishnan@arm.com>
Co-authored-by: Luke Hutton <luke.hutton@arm.com>
* Turn off USE_ETHOSN_HW by default

Change-Id: Ie2ce4528e16e93aa83df46f8a229c0ce89b45252

* Update capabilities file

Change-Id: Iebd0c62d6bc7e446662abdee4882ac874ad98aa3

* Fix missing header

Change-Id: I0c89e380dd1d795755a1884c06a7b317a99fe297

* Update cmake comments on ETHOSN_HW

Change-Id: I2e96a1c818a82e5174fd94e483b0bdb3e4375a7d

* Add checker for case when USE_ETHOSN=OFF and USE_ETHOSN_HW=ON

Change-Id: Id5c9cfb866914a0298b44ead40fcbe3764ce443c

* Fix 'available' boolean

Change-Id: I78e54fb9f472d2815886bea4d94b7247e0d129de

* Check availability in op registration

Change-Id: Iecfea7dca7301dd684199c9b32f99f2113fdfd56

* Remove unnecessary line

Change-Id: Idf5cab853027adb0b0292de877e6dc02683821d7

* Simplify getting output_size

Change-Id: If4643924768c2d7ea98525e9f792b7223cc2bcdf

* Remove unnecessary new line

Change-Id: Ia689c59cac28bd91e237ceecd829d8cf56d0d9c1

* Remove NOLINTS

Change-Id: I149b97b28b516c7d9288a0858b2fbf1497e70250

* Remove unused parts of PR

Change-Id: I2db5b89d8fe2c114ab92305cdcf06d0fc45f4d2a

* Fix CI Ethos-N settings

Change-Id: Idd955755d6f6d1cd3843462f627d0d952729e467

* Removed unnecessary line in infra

Change-Id: I0ea866adf5d9166db85dd82d013a631d991ae633

* Remove unnecessary len in infra

Change-Id: I869e8233d41c6ab7c2dc80f47d976c974043b80c

* Rename 'cpu_ops' to 'host_ops'

Change-Id: I79a6ffcfd48cd055d279f493c672ec82f0c68e5c

* Added explanation on mocking

Change-Id: I1e88c07a47464e44cb45c6a327ec9c7e2d70cc94

* IsEthosOp -> IsEthosnOp

Change-Id: I4fc1b462a74f8fae231ebafac614dd8d45be0feb

* Improve documentation in ethosn_api.h

Change-Id: I5586a7ba7ce71da667a6a9c6dd2e591028eb43b2

* No longer iterate over module when compiling

Change-Id: I80e1d494c6d574be06a2375e831343485712914d

* Move EthosnCompiler implementations into codegen.cc

Change-Id: I5bb6e9f62722d930d9dc040ac62bf87f29dd74c5

* Fix linting

Change-Id: Ia44ec741a5330ad289cc6b5cd2bb1ed784fe6afc

* Refactor EthosnAPI compilation functions into EthosnCompiler

Change-Id: Iee0aecbe43a84fefb437ab9ff064e3f8b42c80a4

* Improve docs for Tvm2Npu

Change-Id: Ia39e9e1508513ca39c1d585fbccc3ae38fcbb9fb

* Move more implementation out of headers

Change-Id: I1e33084ceb520b75f06b4d7a4acff5b9b2225bd5

* Move implementation in ethosn_api.h

Change-Id: I51ab386892a2aa84aa47d03641aac8468f5737ae

* Improve docs for capabilities.h

Change-Id: Iaaee508aafa1cbb7650a04ed87bd6c1b91823a58

* Use else() in cmake

Change-Id: I4b64a87f32b3616ec87c9937d9fc998b8dc5d7b4

* Use GetDataSize

Change-Id: I16988f3adbe6e03fc47fa0a77cb5febb7a02eaab

* Use const&

Change-Id: I664982d219f9a7d1f961dbfe84d12f66e2e5f5cb

* Fix python linting

Change-Id: Id965ccc037fd40cbdfcb58d922cc8d5fb8c87dfe

* Remove load/save to file

Change-Id: I7f8c3f5c8948c3f15551d28e3fee6e00120663ef

* data->data

Change-Id: Ifb861ebbfeaaf4b154f4b1515f83a46aecf86e50

* Remove specific cpu target

Change-Id: I920568cc7a81cd77d44f8604f571340a330f3e62

* Test export/load module

Change-Id: Ib605458127485e2015ac012ec515ced5900705f3

* Fix cmake garbage

Change-Id: I32f3c967192c7c278ef33c52cac5fb5da682cd1b

Co-authored-by: Leo Blonk <Leo.Blonk@arm.com>
Co-authored-by: Tristan O'Connor <tristan.oconnor@arm.com>
Co-authored-by: Leandro Nunes <leandro.nunes@arm.com>
Co-authored-by: Ramana Radhakrishnan <ramana.radhakrishnan@arm.com>
Co-authored-by: Luke Hutton <luke.hutton@arm.com>

commit | commitdiff | tree

masahi [Wed, 19 Aug 2020 11:50:36 +0000 (20:50 +0900)]

[Torch] Fix cast to long (#6301)

* [Torch] fix cast to long

* retrigger

commit | commitdiff | tree

Krzysztof Parzyszek [Wed, 19 Aug 2020 09:17:07 +0000 (04:17 -0500)]

[Hexagon] Initial support for Hexagon codegen (#6261)

* [Hexagon] Initial support for Hexagon codegen

This commit does not support parallel execution or prefetch.
LLVM 7 or later is required.

* Set native_vector_bits_ based on target features

* Initialize hvx_bytes

* Remove commented out line

commit | commitdiff | tree

Lianmin Zheng [Wed, 19 Aug 2020 00:51:10 +0000 (17:51 -0700)]

[Ansor][AutoTVM v2.0] Phase 1: XGBoost Cost Model (#6270)

* port xgb cost model

* add xgboost cost model

* fix lint

* address comments

* address comments

* Fix

commit | commitdiff | tree

Chenfan [Tue, 18 Aug 2020 23:21:09 +0000 (07:21 +0800)]

[Support] Add parallel_for support to run a loop in parallel (#6275)

commit | commitdiff | tree

Andrew Reusch [Tue, 18 Aug 2020 23:20:38 +0000 (16:20 -0700)]

Add tvm::support::hexdump() debug utility (#6154)

commit | commitdiff | tree

Yizhi Liu [Tue, 18 Aug 2020 23:19:39 +0000 (16:19 -0700)]

[Autodiff] Optimize and eliminate the Jacobian tensor for te.autodiff (#6078)

* [Autodiff] Optimize and eliminate the Jacobian tensor for te.autodiff

Co-authored-by: Sergei Grechanik <sergei.grechanik+h@gmail.com>
* fix lint

* fix clang-format

* add comments and magic number

* clang-lint

* address some comments

* remove FreeVarsVisitor

* fix constexpr lint

* fix lint

* fix lint

* add Map.Merge

* lint

* change Array::Concat & Map::Merge to global functions

* fix lint

* move functions to global

* static -> inline

Co-authored-by: Sergei Grechanik <sergei.grechanik+h@gmail.com>

commit | commitdiff | tree

Tianqi Chen [Tue, 18 Aug 2020 23:08:03 +0000 (16:08 -0700)]

[RUNTIME][REFACTOR] Use new to avoid exit-time de-allocation order problem in DeviceAPI (#6292)

commit | commitdiff | tree

Dmitriy Smirnov [Tue, 18 Aug 2020 16:43:29 +0000 (17:43 +0100)]

Gather operation with indices as tensor expr in TFLite frontend (#6168)

* gather with indices as tensor expr

Added handling of indices as tensor expr
to gather operation, unit tests amended
Code cheking out of boundary error refactored
in more "pythonic" way. Fixed bug in negative
axis value normalisation

* replaced with get_tensor_expr

commit | commitdiff | tree

masahi [Tue, 18 Aug 2020 14:06:33 +0000 (23:06 +0900)]

[Torch] Support index_select (#6295)

* support index select

* minor fix

Co-authored-by: masa <masa@pop-os.localdomain>

commit | commitdiff | tree

Tianqi Chen [Tue, 18 Aug 2020 05:50:57 +0000 (22:50 -0700)]

[TEST] use rpc.LocalSession for simple tests (#6294)

To avoid flaky due to networking.

commit | commitdiff | tree

GaryYuyjl [Tue, 18 Aug 2020 05:32:30 +0000 (13:32 +0800)]

[TOPI] Support int4/int8 conv2d tensor core with HWNC layout (#6121)

* int4 tensorcore

* a draft for new int4 schedule

* update layout

* add inline option

* clean code

* increase search space

* fix kernel shape

* update intrinsic

* update intrinsic

* support int4/int8 hwnc layout

* remove useless code

* remove useless code

* remove useless code

* remove useless code

* fix int8 transpose

* fix assert

* add asf header

* CI

* CI

* CI

* fix bug

fix bug

Co-authored-by: Leyuan Wang <laurawly@gmail.com>

commit | commitdiff | tree

Tianqi Chen [Mon, 17 Aug 2020 22:45:56 +0000 (15:45 -0700)]

[COMMUNITY] @kparzysz-quic -> committer (#6290)

commit | commitdiff | tree

Chris Sullivan [Mon, 17 Aug 2020 22:22:51 +0000 (15:22 -0700)]

[ONNX] Update slice to infer attributes when not graph inputs (#6276)

* Update ONNX Slice converter to infer slice attributes when necessary.

* Linting

commit | commitdiff | tree

Zhao Wu [Mon, 17 Aug 2020 16:38:31 +0000 (00:38 +0800)]

[random] support random fill (#5913)

commit | commitdiff | tree

Junru Shao [Mon, 17 Aug 2020 14:18:24 +0000 (07:18 -0700)]

[Build] Add cmake options into libinfo (#6286)

* [Build] Add cmake options into libinfo

* Address comments from @tqchen

* Add LLVM version to libinfo

commit | commitdiff | tree

Iswariya Manivannan [Mon, 17 Aug 2020 03:53:38 +0000 (05:53 +0200)]

[Frontend][Relay] Fix node indices attribute error for tensorflow 2.3 (#6288)

* Fix errors caused due to node attributes

* Add node_indices attr for old keras pkg support

commit | commitdiff | tree

Leandro Nunes [Sun, 16 Aug 2020 17:45:44 +0000 (18:45 +0100)]

TVMC - a command line driver for TVM (#6112)

* Introduce a command line driver to compile, run and tune models, using TVM graph runtime
* Include tvmc tests and integrate tvmc with linting, testing and CI
* RFC: https://discuss.tvm.ai/t/rfc-a-tvm-command-line-interface/5165

Co-authored-by: Marcus Shawcroft <marcus.shawcroft@arm.com>
Co-authored-by: Matthew Barrett <Matthew.Barrett@arm.com>
Co-authored-by: Dmitriy Smirnov <dmitriy.smirnov@arm.com>
Co-authored-by: Luke Hutton <luke.hutton@arm.com>
Co-authored-by: Giuseppe Rossini <giuseppe.rossini@arm.com>
Co-authored-by: Matthew Barrett <matthew.barrett@arm.com>
Co-authored-by: Elen Kalda <elen.kalda@arm.com>
Co-authored-by: Ramana Radhakrishnan <ramana.radhakrishnan@arm.com>
Co-authored-by: Jeremy Johnson <jeremy.johnson@arm.com>
Co-authored-by: Ina Dobreva <Ina.Dobreva@arm.com>
Co-authored-by: Marcus Shawcroft <marcus.shawcroft@arm.com>
Co-authored-by: Matthew Barrett <Matthew.Barrett@arm.com>
Co-authored-by: Dmitriy Smirnov <dmitriy.smirnov@arm.com>
Co-authored-by: Luke Hutton <luke.hutton@arm.com>
Co-authored-by: Giuseppe Rossini <giuseppe.rossini@arm.com>
Co-authored-by: Elen Kalda <elen.kalda@arm.com>
Co-authored-by: Ramana Radhakrishnan <ramana.radhakrishnan@arm.com>
Co-authored-by: Jeremy Johnson <jeremy.johnson@arm.com>
Co-authored-by: Ina Dobreva <Ina.Dobreva@arm.com>

commit | commitdiff | tree

Logan Weber [Sat, 15 Aug 2020 21:48:24 +0000 (14:48 -0700)]

Add Quantize/Dequantize Partitioning (#5940)

* Implement quant/dequant partitioning

on our way

get clooooooser

clean up (part 1)

clean up (part 2)

clean up (part 3)

clean up (part 4)

clean clean

cleaanaannanaaananaananaananaan

clkjsdflkjlfsjdflkj

revert parser changes

add docs

roll lint

roll lint

* add option to toggle fully integral check

* convert dtype collector to C++

* remove need for `with_dtype`

* remove unused imports

* roll lint

* partially address feedback

* roll lint

* upgrade to new parser

* retrigger CI

* roll the dice again

commit | commitdiff | tree

Junru Shao [Sat, 15 Aug 2020 14:05:40 +0000 (07:05 -0700)]

[Build] Reflect Compile-Time CMake Options into libtvm.so (#6280)

* Initial comit

* Address comments from @tqchen

commit | commitdiff | tree

Tianqi Chen [Sat, 15 Aug 2020 03:41:30 +0000 (20:41 -0700)]

[CI] Update ci-cpu to the latest (#6283)

commit | commitdiff | tree

Tristan Konolige [Sat, 15 Aug 2020 03:06:24 +0000 (20:06 -0700)]

Improve error messages for memory verifier and gpu memory verifier (#6281)

* [FIX] Print exactly what issues the GPU memory verifier encountered.

* [FIX] Print exactly why memory verifier failed.

commit | commitdiff | tree

Junru Shao [Sat, 15 Aug 2020 01:33:37 +0000 (18:33 -0700)]

[Target] Creating Target from JSON-like Configuration (#6218)

* [Target] Creating Target from JSON-like Configuration

* Address comments from Cody

* fix unittest

* More testcases as suggested by @comaniac

commit | commitdiff | tree

Dmitriy Smirnov [Fri, 14 Aug 2020 18:54:29 +0000 (19:54 +0100)]

Added support for tflite quantized maximum and minimum (#6018)

* Added support for tflite quantized maximum and minimum

* Unit test simplified

Bugfix in unit test. Unit test slightly simplified

* re-trigger CI

* renamed use_real_qnn to ignore_qnn_params

commit | commitdiff | tree

Matthew Brookhart [Fri, 14 Aug 2020 17:27:20 +0000 (10:27 -0700)]

Update precision in the ONNX strided_slice, update precision of ToScalar (#6272)

* Update precision in the ONNX strided_slice, update precision of ToScalar

* fix tests

commit | commitdiff | tree

Tristan Konolige [Fri, 14 Aug 2020 16:15:00 +0000 (09:15 -0700)]

[TESTS] Decrease test times by introducing testing model (#6235)

Adds a new testing model `tvm.relay.testing.synthetic` which is a small,
but representative model. Replaces resnet with this model in many tests.

commit | commitdiff | tree

Jared Roesch [Fri, 14 Aug 2020 16:13:42 +0000 (09:13 -0700)]

[Parser] Add support for parsing the any dimension. (#6277)

* Add case for any dimensions

* Fix second test case

commit | commitdiff | tree

wjliu [Fri, 14 Aug 2020 05:17:38 +0000 (13:17 +0800)]

add dilation in x86 NCHWc depthwise conv support (#4962) (#6267)

commit | commitdiff | tree

lhutton1 [Thu, 13 Aug 2020 16:21:35 +0000 (17:21 +0100)]

[BYOC][ACL] Add support for dense (fully connected) layer (#6254)

* [BYOC][ACL] Add support for dense (fully connected) layer

This patch adds the ability to offload dense (or fully connected) operators to ACL.

For fp32 a single dense layer can be offloaded, or the composite variant: nn.dense, nn.bias_add? (ACL does not currently offer fused activation).
For uint8: qnn.dense, nn.bias_add?, qnn.requantize

Change-Id: I83ea00b2aa6bdc5d9ef5cd6d54bbf981e523bd14

* Don't offload dense layer with unsupported datatype

Change-Id: I856eb2298499fdf22c172ba7f85d21033d3cc920

commit | commitdiff | tree

Lily Orth-Smith [Thu, 13 Aug 2020 16:08:34 +0000 (09:08 -0700)]

[Relay][Dyn] Dynamic full operator (#6260)

* moved full from other branch

* fixed some typos

* fix lint

* add final newline

* fix int64 test

commit | commitdiff | tree

cloud-mxd [Thu, 13 Aug 2020 15:36:59 +0000 (23:36 +0800)]

fix cuda half math function is undefined: hpow, htanh (#6253)

commit | commitdiff | tree

Tom Gall [Thu, 13 Aug 2020 15:34:34 +0000 (10:34 -0500)]

Trivial fix, up the rodata section for the discovery board to 512 bytes. (#6259)

This is more reasonable as the trivial tflite example module needs 208 bytes.

Signed-off-by: Tom Gall <tom.gall@linaro.org>

commit | commitdiff | tree

Tianqi Chen [Thu, 13 Aug 2020 15:34:04 +0000 (08:34 -0700)]

[LINT] Fix clang-format (#6264)

commit | commitdiff | tree

Giuseppe Rossini [Thu, 13 Aug 2020 14:41:45 +0000 (15:41 +0100)]

Improve NHWC depthwise convolution for AArch64 (#6095)

* Improve NHWC depthwise convolution for aarch64

We created a default schedule (no auto-tuning or tensorization) named
depthwise_conv2d_nhwc which does a decent job at optimizing depthwise
for NHWC layouts (on aarch64).

Change-Id: I01e32903f6c1950623f33eae18484e70244fe0af

* Add tuning knobs in depthwise schedule

Change-Id: I15080e7f12b16e6c6aba99a04e42023845eeabf1

* Introduce padding policy

Change-Id: If12a6d05dce9153861550ddef1ee5216809dd1e1

* Vectorize padding

Change-Id: I7e2062a40358bf111c0366a449945eb077fb2e30

* Legalize depthwise convolution (2x improvement) and fix tuning issue

Change-Id: I4b82c58b167e40b0b7747d28293bbb488c505dd9

* Adding assert on padding

Change-Id: Idf8eeaaface5eb7799109cd00f437e404778b9cd

* Fix python linting

Change-Id: Iac16a8daea1268f0eb331fe4ec18a62408106cf9

* Removing commented code

Change-Id: I1412f22ad9864273d77a7bf38a6768694339b7f0

* Revert test file to make CI pass

Change-Id: Ica3eff8f9f0fd4c6f32f7ae80adc922f8b16cec9

* Enabling only arm_cpu tests

Change-Id: Icbaafcb39e892a5d1a4685133c1699e4d1a8e07e

* Rebasing

Change-Id: Ibb23f1d4e0d0107e4e3b3571437161cdc2ee2909

commit | commitdiff | tree

Tom Gall [Thu, 13 Aug 2020 03:18:37 +0000 (22:18 -0500)]

update tutorial to new TARGET as micro_dev is no more (#6262)

Signed-off-by: Tom Gall <tom.gall@linaro.org>

commit | commitdiff | tree

Leandro Nunes [Wed, 12 Aug 2020 21:18:36 +0000 (22:18 +0100)]

[CI] Add apt repository for clang-11 and llvm-11 (#6256)

- Add specific apt repositories to install clang-11 and llvm-11
- Fix #6255

commit | commitdiff | tree

Lianmin Zheng [Wed, 12 Aug 2020 21:00:49 +0000 (14:00 -0700)]

[Ansor][AutoTVM v2.0] Phase 1: feature extraction for cost models (#6190)

* [AutoScheduler] add feature extraction

* fix lint

* fix gpu test

* address comments

* improve flop estimation

* rebase

* refactor with group

* fix

* Apply suggestions from code review

commit | commitdiff | tree

Trevor Morris [Wed, 12 Aug 2020 16:00:55 +0000 (09:00 -0700)]

Support mxnet _contrib_SyncBatchNorm (#6245)

commit | commitdiff | tree

Siju Samuel [Wed, 12 Aug 2020 15:50:42 +0000 (21:20 +0530)]

[COREML]Reduceops support added to frontend (#6252)

commit | commitdiff | tree

Balint Cristian [Wed, 12 Aug 2020 15:50:16 +0000 (18:50 +0300)]

Fix newer GCC compiler warnings. (#6257)

commit | commitdiff | tree

Zhi [Wed, 12 Aug 2020 08:12:56 +0000 (01:12 -0700)]

[TOPI] Fix reduction (#6250)

commit | commitdiff | tree

Tianqi Chen [Wed, 12 Aug 2020 00:32:57 +0000 (17:32 -0700)]

Revert "fix cuda half math function is undefined: hpow, htanh (#6225)" (#6249)

This reverts commit ed04cdd35f1990959ec788be0131b1388fd11d31.

commit | commitdiff | tree

Jared Roesch [Tue, 11 Aug 2020 23:31:03 +0000 (16:31 -0700)]

[Parser] Parser 2.0 part 2 (#6162)

* Add code from livestream with JK

* Fix errors parsing ResNet

* Parse metadata section efficiently and do most of plumbing to resolve metadata section references.

* WIP

* Change meta reference to an operator

* Meta references now work

* MetaReference expansion now works

* Start working on source map and move diagnostic context

* Convert tokenizer and parser to use new machinery

* Kill to_json

* Fix comment in type_infer.cc

* Remove old parser

* Rename parser tests and remove old ones

* Record span end information

* Convert to using spans everywhere

* Add span fields back to all Relay constructors

* Start passing spans

* Pass spans around visitors

* Format

* Fix

* Fix

* disable reference lint from string helpers

* Fix tokenizer

* Fix issue with empty metadata section

* Document new span fields and small tweaks

* Formatting

* Add span doc fields

* Add format tweak

* Improve errors and fix the semantic version tags in Prelude

* Update gradient.rly

* Clean up broken spans

* Clean up parser tests and turn on previously skipped tests

* Update errors to handle skipped cases

* Tweak

* Tweak

* Format

* Fix some minor issues with ADT tests

* Format

* Fix path

* WIP

* WIP

* Fix ir_text_printer

* format

* Formatted

* More formatting

* Repair test cases

* Fix CI

* Retrigger CI

Domain: Machine Learning / ML Framework;

RSS Atom