review.tizen.org Git - platform/upstream/tvm.git/log

projects / platform / upstream / tvm.git / log

summary | shortlog | log | commit | commitdiff | tree
first ⋅ prev ⋅ next

commit | commitdiff | tree

Tianqi Chen [Sat, 11 Apr 2020 00:07:20 +0000 (17:07 -0700)]

[RUNTIME] Introduce RValue reference(move) support to TypedPackedFunc (#5271)

* [RUNTIME] Introduce RValue reference(move) support to TypedPackedFunc

This PR introduces RValue reference support the PackedFunc calling convention to address the above issue.
Specifically, when an argument is a r-value reference, we will use a assign a different type code(`kObjectRValueRefArg`),
and pass `Object**` (the address to the Object pointer) instead through the values array.
The callee can choose to move out this Object pointer and set the original Object pointer from the caller side to be nullptr.

We also add an experimental move support to the python side(marked as _move so to indicate the dev nature).
This enhancement will enable copy on write optimizations through out the TVM stack.

* Address review comments

* fix compilation

commit | commitdiff | tree

Huacong Yang [Fri, 10 Apr 2020 21:46:03 +0000 (05:46 +0800)]

[RELAY][FRONTEND][CAFFE2] add Mul and ConvTranspose operator (#5302)

commit | commitdiff | tree

Cody Yu [Fri, 10 Apr 2020 21:32:56 +0000 (14:32 -0700)]

[BYOC] Refine AnnotateTarget and MergeCompilerRegion Passes (#5277)

* add target to region

* refactor annotate_target

* Make all unit test working

* quick fix

* enable BN, unit test failed

* Fix vm test, unit test. Refactor annotate_target a bit.

* quick fix fusion

* revert fusion change

* style fix

* Refactor merge region pass

* format

* minor fix

* Skip e2e test

* lint

* support AnnotateTarget multiple runs

* Add HasAttr and revert DNNL codegen

* address comment

Co-authored-by: Zhi Chen <chzhi@amazon.com>

commit | commitdiff | tree

Tianqi Chen [Fri, 10 Apr 2020 17:58:59 +0000 (10:58 -0700)]

[CI] Fix the hexagon string (#5304)

commit | commitdiff | tree

Yizhi Liu [Fri, 10 Apr 2020 15:11:21 +0000 (08:11 -0700)]

[Arith] linear system and equation solver (#5171)

* [arith] linear system and equation solver

Co-authored-by: Sergei Grechanik <sergei.grechanik+h@gmail.com>
* avoid constructing analyzer every time

* generate random test cases and address comments

Co-authored-by: Sergei Grechanik <sergei.grechanik@gmail.com>
* rename linear_system to int_constraints

* add comments and use random seed

* message for reporting failure with seed

* add SEqualReduce to IntConstraints; allow variables & ranges to be None

Co-authored-by: Sergei Grechanik <sergei.grechanik+h@gmail.com>
Co-authored-by: Sergei Grechanik <sergei.grechanik@gmail.com>

commit | commitdiff | tree

Samuel [Fri, 10 Apr 2020 15:08:56 +0000 (20:38 +0530)]

[PYTORCH]Repeat, Reciprocal & Reshape Op support (#5280)

commit | commitdiff | tree

MORITA Kazutaka [Fri, 10 Apr 2020 14:47:53 +0000 (23:47 +0900)]

[FRONTEND][TENSORFLOW] Fix gather_nd indices (#5279)

* [FRONTEND][TENSORFLOW] Fix gather_nd indices

* retrigger CI

commit | commitdiff | tree

weiliangweiliang [Fri, 10 Apr 2020 14:47:19 +0000 (22:47 +0800)]

Update device_annotation.cc (#5291)

commit | commitdiff | tree

Zhi [Fri, 10 Apr 2020 14:46:23 +0000 (07:46 -0700)]

[REFACTOR][IR] Move to runtime::String (#5276)

* Use runtime::String

* move string to tvm namespace

* add const char* constructor

* implicit cast from std::string

commit | commitdiff | tree

hlu1 [Fri, 10 Apr 2020 14:42:54 +0000 (07:42 -0700)]

[NDArray] Set shape_ in NDArray::FromDLPack (#5301)

commit | commitdiff | tree

Krzysztof Parzyszek [Fri, 10 Apr 2020 13:47:59 +0000 (08:47 -0500)]

[RUNTIME] Initial implementation of Hexagon runtime support (#5252)

* [RUNTIME] Initial implementation of Hexagon runtime support

This is only the TVM runtime. The FastRPC libraries, simulator driver,
etc. will be provided in subsequent commits.

* Fix pylint complaints

* Fix some more pylint complaints

* Add link to the Hexagon SDK website

* Extract VTCM marker into a common variable

* Implement device->device memory copy

* Disable unsigned PDs by default

* Ensure that --hvx_length is present in sim_args if HVX is enabled

* Remove the line about clang from README.md

Apparently things work with libstdc++.

* Mention to set USE_RPC=OFF when building libtvm_runtime.so for Hexagon

* Remember to use codegen_hvx in validate_hvx_length

* Add a line about minimum version of LLVM

commit | commitdiff | tree

Cody Yu [Fri, 10 Apr 2020 07:29:39 +0000 (00:29 -0700)]

[BYOC] Refine DNNL Codegen (#5288)

* Improve DNNL

* Add bind params

* trigger ci

commit | commitdiff | tree

shoubhik [Fri, 10 Apr 2020 05:32:28 +0000 (22:32 -0700)]

Adding support for TFLite QnnSub operator. (#5230)

commit | commitdiff | tree

Tianqi Chen [Fri, 10 Apr 2020 05:04:08 +0000 (22:04 -0700)]

[NODE] General serialzation of leaf objects into bytes. (#5299)

This PR refactors the serialization mechanism to support general
serialization of leaf objects into bytes.

The new feature superceded the original GetGlobalKey feature for singletons.
Added serialization support for runtime::String.

commit | commitdiff | tree

Animesh Jain [Fri, 10 Apr 2020 04:01:35 +0000 (21:01 -0700)]

Legalize - Use Non-recursive Rewriter. (#5296)

* Legalize - Use Non-recursive Rewriter.

* Cleanup.

commit | commitdiff | tree

Yizhi Liu [Fri, 10 Apr 2020 03:58:43 +0000 (20:58 -0700)]

[Node] Provide guide to user who has difficulty register SEqualReduce (#5300)

commit | commitdiff | tree

Samuel [Fri, 10 Apr 2020 01:56:47 +0000 (07:26 +0530)]

[TENSORFLOW]reduce ops updated (#5180)

commit | commitdiff | tree

yongfeng-nv [Fri, 10 Apr 2020 01:49:37 +0000 (21:49 -0400)]

Create loops according to storage scope and thread hierarchies (#5190)

* Set IterVar index to 0 for local thread bound IterVars.

* Lint fix

* Use rank instead of scope name to predicate. Add tests.

* Handle cases other than local/threadIdx.

* Turn warp to the old behavior.

* Modify test to cover global/blockIdx.

* Fix a typo.

* Update test_te_schedule_ops.py with more testing coverage in test_local_stage_predicate; remove test_schedule_schedule_ops.py which was added by mistake.

commit | commitdiff | tree

Tianqi Chen [Fri, 10 Apr 2020 00:55:20 +0000 (17:55 -0700)]

[CI] Temporary disable CRT test (#5297)

commit | commitdiff | tree

Tianqi Chen [Thu, 9 Apr 2020 19:48:51 +0000 (12:48 -0700)]

[BUGFIX] Fix CRT static test bug (#5293)

* [CI][DOCS] Make sure to refresh the cython part

* [BUGFIX] Fix CRT static test bug

* Fix demo_static

* resolve review comment

commit | commitdiff | tree

Zhi [Wed, 8 Apr 2020 21:20:16 +0000 (14:20 -0700)]

[BUGFIX][IR] Fix String SEqual (#5275)

* fix String SEqual

* retrigger ci

commit | commitdiff | tree

Luis Vega [Wed, 8 Apr 2020 20:46:52 +0000 (13:46 -0700)]

update compiler version in docs (#5281)

commit | commitdiff | tree

Haichen Shen [Wed, 8 Apr 2020 03:55:48 +0000 (20:55 -0700)]

[LINT] Remove scalalint from lint deps (#5269)

commit | commitdiff | tree

Krzysztof Parzyszek [Wed, 8 Apr 2020 03:53:25 +0000 (22:53 -0500)]

[LLVM] Include Support/Host.h for declaration of getDefaultTargetTriple (#5268)

In newer versions of LLVM, this header is no longer included by one of
the already included headers in llvm_common.h, so include it explicitly.

commit | commitdiff | tree

Samuel [Wed, 8 Apr 2020 03:45:41 +0000 (09:15 +0530)]

[PYTORCH]celu, gelu, selu activations (#5263)

commit | commitdiff | tree

mbaret [Wed, 8 Apr 2020 03:12:15 +0000 (04:12 +0100)]

[RELAY][BYOC] Add support for composite functions in BYOC (#5261)

* [RELAY] Add 'check' functions to MergeComposite

Currently, MergeComposite can only perform structural
matches. This patch introduces the ability to specify
a 'check' function alongside the pattern which can include
custom logic to determine whether an extracted pattern
should be merged.

For example, if you only want to merge 'NHWC' convolutions,
you can specify a 'check' function which queries the
data_layout value of the extracted pattern (see the test).

Change-Id: I9337ce39f10997051a286d888be38ed0d410d340

* [RELAY] Reformat merge_composite.cc

Run clang-format on merge_composite.cc

Change-Id: I1736bff798cc6d93e57519b08ab3362869098779

* [RELAY][BYOC] Support composite functions in AnnotateTarget

This patch introduces support to annotate composite functions
in the AnnotateTarget pass. In order for a composite function
to be annotated, you should name it according to the style:

{codegen}.{name}
eg. dnnl.add_relu

Change-Id: I74d6c0b506153d866f6d1feb203b32dad59f2871

commit | commitdiff | tree

tobe [Tue, 7 Apr 2020 23:59:32 +0000 (07:59 +0800)]

[RUNTIME] Implement TVMDSOOp(TensorFlow custom op) for TVM runtime (#4459)

* Add implementation of TVMDSOOp

* feat: Update cmake script to work with c++11 and in-repo build

* feat: Use libtvm as oplib dependency

* fix: Add missing link dependency to libtvm

* feat: Update tf tvmdso op by review comments

* fix: Update with pr comments

* fix: Fix lint

* feat: Add test script and fix gpu shape

* feat: Add test script and fix gpu shape

* fix: Conditional build tftvm op for gpu

* fix: Conditional build tftvm op for gpu

* fix: Fix pylint of tf_op module.py

* fix: Fix pylint of tf_op module.py

* feat: Conditional enable gpu test for tftvm op

* feat: Conditional enable gpu test for tftvm op

* feat: Add tf_tvmdsoop test script as an app test

* fix: Fix gpu/cpu enabled check on tvm in test script

* fix: Make tf tvmdso op test script runnable with pytest

* remove unused test script test_tfop_module.py

* fix: Remove pushd & popd in tfdsoop test script

* fix: Upgrade tftvmop use python3 to find TensorFlow

* fix: Upgrade tftvmop use python3 to find TensorFlow

* fix: Change target_link_options to target_link_libraries

* fix: Add tftvmop build script's c++ option

* fix: Add tvm library path to tf op test library path

* fix: Debug ci build for tftvm dso op

* fix: Fix cmake error and skip tfop test

* fix: Fix typo and indentation issues

* feat: Use TF list input op def

* fix: Fix style and unexpected changes

Co-authored-by: baoxinqi <baoxinqi@4paradigm.com>
Co-authored-by: Chen Dihao <chendihao@4paradigm.com>
Co-authored-by: wrongtest <wrongtest@4paradigm.com>

commit | commitdiff | tree

Krzysztof Parzyszek [Tue, 7 Apr 2020 23:58:33 +0000 (18:58 -0500)]

[LLVM] Do not use x86_vcvtph2ps_256 intrinsic with LLVM 11+ (#5267)

This intrinsic was removed in LLVM 11.

commit | commitdiff | tree

Tianqi Chen [Tue, 7 Apr 2020 23:33:12 +0000 (16:33 -0700)]

[RUNTIME] Quick fix PackedFunc String passing (#5266)

commit | commitdiff | tree

Krzysztof Parzyszek [Tue, 7 Apr 2020 22:49:07 +0000 (17:49 -0500)]

[LLVM] Use llvm::ElementCount with LLVM 11+ when creating vectors (#5265)

LLVM 11 added support for scalable vectors, and now the number of
elements in a vector is represented by a llvm::ElementCount class,
not just a number.

commit | commitdiff | tree

Krzysztof Parzyszek [Tue, 7 Apr 2020 22:48:59 +0000 (17:48 -0500)]

[LLVM] Use llvm::Align with LLVM 11+ to avoid warnings (#5264)

LLVM 11 is introducing a separate class to represent alignment.
The functions in IRBuilder that create aligned loads and stores,
and which accept the alignment as an unsigned value have been
deprecated (and now cause warnings to be emitted).

commit | commitdiff | tree

Liangfu Chen [Tue, 7 Apr 2020 21:33:05 +0000 (05:33 +0800)]

[uTVM][Runtime] Introduce Virtual Memory Allocator to CRT (#5124)

* initial crt_memory and memory leak fix in graph_runtime

Change-Id: I0f79f909a04d1c677aabb80f202f0612c5ce7f2a

* fix memory leak

Change-Id: I37104c09e28112b1974fa2b064c809d0a8d686c3

* clean up

Change-Id: I039b12015a1d56c8f4120867cd5a5292da34f3e3

* implement vrealloc

Change-Id: I35800470bcbfcf96652494f359711cb4c2d34398

* allocate from stack memory for most of the variables

Change-Id: I72071289843fff4031c0df8796868a0b9fbc57ee

* allocate from stack memory for all of the variables

Change-Id: I32dba85ac1660c77f51c2d0d8ab6436ed0c01c74

* lint

Change-Id: If12cd240685d7791fc60bc0cfb66389cdc186b73

* lint

Change-Id: I7c9d90c11b60b8edda2427ebd189ebe535af2100

* facilitate the growth of TVM_CRT_MAX_NDIM

Change-Id: I939fa43027a5c7529c5c7c6bd8d6e6beb91b7581

* extend test coverage of vmalloc

Change-Id: Ie4ff6b64fdfe6810836cf8fd44dace82a20c4581

* lint

Change-Id: Ibf3c06619ef296df5c49f3945cb6428777781d69

* move logging.h to src

* fix an error in macOS

* remove logging.h

* use cflags for gcc

* fix compilation error

commit | commitdiff | tree

Haichen Shen [Tue, 7 Apr 2020 19:05:33 +0000 (12:05 -0700)]

[Relay][OP] Add fast_erf implementation (#5241)

* add fast erf

* doc

* lint

* fix

* fix indent

commit | commitdiff | tree

Tianqi Chen [Tue, 7 Apr 2020 17:24:55 +0000 (10:24 -0700)]

[TIR] Fix perf regression of tir refactor (#5258)

commit | commitdiff | tree

Adrian Muresan [Tue, 7 Apr 2020 15:21:23 +0000 (17:21 +0200)]

Fixed typo and type mismatch (#5259)

Co-authored-by: Adrian Muresan <muresan.adrian.bn@gmail.com>

commit | commitdiff | tree

Samuel [Tue, 7 Apr 2020 10:13:45 +0000 (15:43 +0530)]

[Pytorch]layernorm bug fix and testcase updated (#5257)

commit | commitdiff | tree

Samuel [Tue, 7 Apr 2020 08:14:02 +0000 (13:44 +0530)]

[TFLITE]Hard Swish & MobilnetV3 model testing (#5239)

* [TFLITE]Hard Swish & MobilnetV3 model testing

* CI Failure addressed

commit | commitdiff | tree

Pratik Fegade [Tue, 7 Apr 2020 02:04:20 +0000 (22:04 -0400)]

[TE] Minor bugfix in message_passing.cc (#5254)

commit | commitdiff | tree

Haichen Shen [Mon, 6 Apr 2020 21:30:53 +0000 (14:30 -0700)]

[Topi] Breakdown topi.cc into smaller files (#5253)

* [Topi] Breakdown topi.cc into smaller files

* add missing file

commit | commitdiff | tree

Samuel [Mon, 6 Apr 2020 20:31:19 +0000 (02:01 +0530)]

[PYTORCH]LayerNorm support added (#5249)

commit | commitdiff | tree

Tianqi Chen [Mon, 6 Apr 2020 20:25:00 +0000 (13:25 -0700)]

[RUNTIME] Enable auto conversion from str to runtime::String in PackedFunc, move dtype related handling to data_type.h (#5251)

commit | commitdiff | tree

Tang, Shizhi [Mon, 6 Apr 2020 15:43:38 +0000 (23:43 +0800)]

fix lower_warp_memory (#5247)

commit | commitdiff | tree

chinakook [Mon, 6 Apr 2020 15:42:01 +0000 (23:42 +0800)]

fix to skip node not in graph. (#5238)

fix to skip node not in graph because some network cannot be hybridized with some var unused.

commit | commitdiff | tree

Haichen Shen [Mon, 6 Apr 2020 06:38:24 +0000 (23:38 -0700)]

[CI] Update MxNet to 1.6.0 with MKL (#5240)

commit | commitdiff | tree

Haichen Shen [Mon, 6 Apr 2020 03:53:59 +0000 (20:53 -0700)]

[Runtime][Contrib] Support cudnn softmax (#5214)

commit | commitdiff | tree

Josh Fromm [Sun, 5 Apr 2020 21:59:38 +0000 (14:59 -0700)]

[Relay][Topi][AutoTVM] Winograd support for Conv3D (#5186)

* Functional conv3d winograd working.

* Formatted python code.

* registered conv3d winograd compute and started adding relay without_weight_transform operator.

* Add topi testing for conv3d winograd.

* Format file.

* small tweak to unrolling to prevent build sticking.

* Refactoring convolution ops in relay.

* Refactored relay convolutions.

* Bug fixes.

* Fixed static bug in convolution.

* Added conv3d alter op layout and related support.

* Bug fixes and testing done.

* Fix a few autotvm bugs.

* Drop silly debug print.

* Removed debug_skip_region.

* Add variant of conv3d_winograd that doesn't transform depth.

* initial infrastructure done for depthless conv.

* Fix no_depth schedule bugs.

* automatic topi switching between depth and depthless winograd.

* Fixed bug in schedule.

* lint fixes.

* Removed indents in convolution.cc

* missed a few indents oops.

* fixed flop count.

* One more small tweak.

* Change kernel pack inner axes order.

* Style changes.

* Comment fixes.

commit | commitdiff | tree

ga [Sun, 5 Apr 2020 20:45:43 +0000 (16:45 -0400)]

[Fix][VM] Fix copy constructor (#5237)

commit | commitdiff | tree

Yao Wang [Sun, 5 Apr 2020 20:42:28 +0000 (13:42 -0700)]

[Relay][ADT]Static Tensor Array (#5103)

* Add other static tensor array ops

* Add tensor array get data

* Minor refactor

* Fix pylint

* Update docstring

* Make get data more generic

* Improve test

* Improve split test

* Improve get data

* Minor fix

* Further improvement for static shape

* Improve shape parsing

* Unify get_static_name

commit | commitdiff | tree

Tianqi Chen [Sun, 5 Apr 2020 00:36:49 +0000 (17:36 -0700)]

[REFACTOR][TIR] Migrate all low-level passes to the Pass Manager. (#5233)

* [REFACTOR][TIR] Migrate all low-level passes to the Pass Manager.

This PR migrates the tvm.lower to return IRModule of PrimFuncs
instead of the LoweredFuncs.

* Remove LoweredFunc.

commit | commitdiff | tree

Samuel [Sat, 4 Apr 2020 09:33:43 +0000 (15:03 +0530)]

[ONNX]Pool3d & upsample3d op support (#5135)

* [ONNX]Pool3d and Upsample3d op updated

* Pool3d and Upsample3d testcase

* Review comments fixed

* Review comments

commit | commitdiff | tree

Yao Wang [Sat, 4 Apr 2020 03:46:32 +0000 (20:46 -0700)]

Fix intel conv2d auto tune (#5200)

* Fix x86 conv2d and depthwise conv2d auto tuning

* Fix depthwise conv2d infer layout

* Use random data instead of empty data for autotvm

* Fix pylint

* Keep empty array for now for autotvm

commit | commitdiff | tree

Tang, Shizhi [Sat, 4 Apr 2020 01:49:56 +0000 (09:49 +0800)]

[TE] Support mixing normal and cross-thread reduction (#5193)

* Support mixing normal and cross-thread reduction

* minor improvements

commit | commitdiff | tree

Tianqi Chen [Fri, 3 Apr 2020 22:50:11 +0000 (15:50 -0700)]

[REFACTOR][TIR] Migrate most of low-level build to use the Pass Manager. (#5225)

* [REFACTOR][TIR] Migrate most of low-level build to use the Pass Manager.

- SplitHostDevice
- ThreadSync
- BindDevice
- LowerThreadAllreduce
- Provide a temp fix for printing IRModule with PrimFunc before the formal text printer.

* Address comments, fix tests.

* Fix relay tests

* Explicit move

commit | commitdiff | tree

Tianqi Chen [Fri, 3 Apr 2020 22:24:19 +0000 (15:24 -0700)]

[PYTHON] Make IntImm more like an integer (#5232)

commit | commitdiff | tree

Matthew Brookhart [Fri, 3 Apr 2020 21:35:55 +0000 (14:35 -0700)]

[RELAY] Non-recursive Graph Vistor and Rewriter (#4886)

* First pass a defining a non-recursive Graph Vistor and Rewriter

autoformat

remove a currently empty test until testing is solidfied

* Make CalcDep from Dead Code Elimination non-recursive

* Partially working, not passing all tests yet

passes tests when disabling GetExprRefCount, I think I have a bug in visit counting

fix GetExprRefCount

Fix a subtle bug with nested recursive/non-recursive scopes

* Refactor

* improve comments

* respond to review comments on comments

* Fix a problem with default recursion for dataflow nodes

mark DataflowVisitor methods as override

* implement ScopeMutator

* convert forward_rewrite to ScopeMutator, remove DataflowMutator

* rewrite ExprRewriter and convert fast_math to use it

* switch BiasAddSimplifier to ExprRewriter

fix a clang warning

fix cpp lint

fix doc param error

* respond to review comments

* fix a typo in the iterative looping

* add a regression test for GetExprRefCount issue

* Normalize naming

* fix lint

* First pass a defining a non-recursive Graph Vistor and Rewriter

autoformat

remove a currently empty test until testing is solidfied

* Make CalcDep from Dead Code Elimination non-recursive

* Partially working, not passing all tests yet

passes tests when disabling GetExprRefCount, I think I have a bug in visit counting

fix GetExprRefCount

Fix a subtle bug with nested recursive/non-recursive scopes

* Refactor

* improve comments

* respond to review comments on comments

* Fix a problem with default recursion for dataflow nodes

mark DataflowVisitor methods as override

* implement ScopeMutator

* convert forward_rewrite to ScopeMutator, remove DataflowMutator

* rewrite ExprRewriter and convert fast_math to use it

* switch BiasAddSimplifier to ExprRewriter

fix a clang warning

fix cpp lint

fix doc param error

* respond to review comments

* fix a typo in the iterative looping

* add a regression test for GetExprRefCount issue

* Normalize naming

* fix lint

* respond to review comments

commit | commitdiff | tree

Animesh Jain [Fri, 3 Apr 2020 19:11:32 +0000 (12:11 -0700)]

[TOPI x86] Adding unroll_kw config option for depthwise conv2d. (#5197)

commit | commitdiff | tree

mbaret [Fri, 3 Apr 2020 16:33:15 +0000 (17:33 +0100)]

[RELAY][FIX] Fix hang in MergeCompilerRegions (#5227)

For certain network topologies, MCR could hang.
This patch fixes that case.

Change-Id: I3edd8a8a6b452b2b838b777720adea22a3b995b4

commit | commitdiff | tree

Samuel [Fri, 3 Apr 2020 16:04:41 +0000 (21:34 +0530)]

[KERAS]Upsample3d & ZeroPadding3d op (#5125)

* [KERAS]upsampling3d and zeropadding3d op

* [KERAS]upsampling3d and zeropadding3d test case

* Review comments updated

commit | commitdiff | tree

Samuel [Fri, 3 Apr 2020 13:41:28 +0000 (19:11 +0530)]

[DOCSTRING]missing function parameters updated (#5228)

commit | commitdiff | tree

Wei Pan [Fri, 3 Apr 2020 06:57:40 +0000 (23:57 -0700)]

[CodeGen][CUDA] Fix bugs (#5209)

- Support vectorized casts

- It is incorrect to extract elements from int8x4 with

   0x000000ff & (x >> i * 8)

  as this value is of type int in C/C++. If this expression
  is used for sign extensions, the sign bit will be wrong.
  Simply use C style casts instead and sign bits will just work.

Signed-off-by: Wei Pan <weip@nvidia.com>

commit | commitdiff | tree

Tianqi Chen [Thu, 2 Apr 2020 23:56:24 +0000 (16:56 -0700)]

[REFACTOR] tvm.hybrid -> te.hybrid (#5223)

Rationale: The current hybrid module is more aligned with the te part.
We might consider add a new varient of hybrid script that support the unified IR later.
This refactor paves for the potential later changes.

commit | commitdiff | tree

Tianqi Chen [Thu, 2 Apr 2020 23:56:11 +0000 (16:56 -0700)]

[DOCS] Misc docs improvements (#5222)

- Reduce CI docs task log size.
- Update the relation to halide to the latest state.

commit | commitdiff | tree

Samuel [Thu, 2 Apr 2020 22:20:41 +0000 (03:50 +0530)]

[PYTORCH]AvgPool3d, MaxPool3d and Squeeze op support (#5220)

* [PYTORCH]AvgPool3d, MaxPool3d and Squeeze op support

* Testcases added

* review comments

commit | commitdiff | tree

Tianqi Chen [Thu, 2 Apr 2020 20:22:23 +0000 (13:22 -0700)]

[REFACTOR][TIR] Migrate low-level pass functions to Pass Manager, (#5213)

- Migrate LowerTVMBultin
- Migrate inferFragment, LowerThreadAllreduce
- Migrate ThreadSync
- Refactor target::Build to directly take IRModule.
- Remove un-used legacy functions.

commit | commitdiff | tree

Tianqi Chen [Thu, 2 Apr 2020 19:36:55 +0000 (12:36 -0700)]

[TIR] Introduce BufferLoad/Store (#5205)

Co-authored-by: Siyuan Feng <hzfengsy@sjtu.edu.cn>
This PR introduces BufferLoad/Store to TIR. The new nodes will replace
Provide and Call with Tensor arguments in the subsequent refactors.

commit | commitdiff | tree

Haozheng Fan [Thu, 2 Apr 2020 16:04:33 +0000 (00:04 +0800)]

[TIR][PASS] dtype rewrite for indexing variables (#5092)

commit | commitdiff | tree

Zhi [Thu, 2 Apr 2020 15:56:37 +0000 (08:56 -0700)]

[Runtime][Object] expose runtime::String to Python (#5212)

* expose runtime::String to Python

* retrigger ci

commit | commitdiff | tree

Zhi [Thu, 2 Apr 2020 15:30:46 +0000 (08:30 -0700)]

[REFACTOR][IR] kExternalSymbol -> kGlobalSymbol (#5211)

* expose runtime::String to Python

* kExternalSymbol -> kGlobalSymbol

commit | commitdiff | tree

Jeremy Johnson [Thu, 2 Apr 2020 15:29:14 +0000 (16:29 +0100)]

[Frontend][Torch] Fix up graph input handling (#5204)

* [Frontend][Torch] Simplify operator input handling

* [Frontend][Torch] Allow user supplied input names to override graph inputs

* Fix pylint issues

* Updates from code review feedback

* Fix tutorial to use shape list input

* Disable intermittent test failure in topi vision test

commit | commitdiff | tree

Wei Pan [Thu, 2 Apr 2020 02:44:52 +0000 (19:44 -0700)]

[Debug] Add Dump function for Object type (NFC) (#5207)

Signed-off-by: Wei Pan <weip@nvidia.com>

commit | commitdiff | tree

Tianqi Chen [Thu, 2 Apr 2020 02:44:42 +0000 (19:44 -0700)]

[DOCS] Reduce artifcats generated by sphinx gallery (#5208)

commit | commitdiff | tree

Tianqi Chen [Thu, 2 Apr 2020 00:14:49 +0000 (17:14 -0700)]

[REFACTOR][TIR] Introduce ExprDeepEqual, Remove IRDeepCompare (#5206)

* [REFACTOR][TIR] Introduce ExprDeepEqual, Remove IRDeepCompare

This PR introduces ExprDeepEqual which reuses the StructuralEqual infra.
We migrated the usecases of ir_pass::Equal to ExprDeepEqual and StructuralEqual.

* Address comments

commit | commitdiff | tree

mbaret [Wed, 1 Apr 2020 23:16:31 +0000 (00:16 +0100)]

[RELAY] Fixes to MergeCompilerRegions (#5195)

* [RELAY] Fixed issues with MergeCompilerRegions

This PR addresses a few outstanding issues with
the implementation of MergeCompilerRegions. In
particular, it now handles TupleGetItem nodes properly
and other minor bugs related to region merging have
been fixed.

Change-Id: I07783afc56183a6f798a510209f23b0a5f252255

* Fixed issue using pre-merged regions

Change-Id: I0a844ac59bda1089ae0c67cef52f0b0c7ab2cbd7

* Removed some debugging logic

Change-Id: Ib6f2eede6f38bbb270073eb8d4c4dc19f60832c6

* Remove default annotations

Change-Id: I9b7696a51c95871491cbea33c40f92ec327e417f

* Annotate default 'if's

Change-Id: I0098bd1bf6788dd6366810dcefa84f1ebbffaab0

* Clang format

Change-Id: I944365cd3080a97a9261f643a8f1efa5a63cf82b

* Use src/dest in merge

Change-Id: Ie43113492bda8f1ce63eaf9615cb645bb9e2ee86

* Fixed partition test

Change-Id: I46f9e349b1a813a9140f7e4f8a2241687e2df73b

* Removed comments

Change-Id: I309afdd1951d7e796e41d13788aa487707e0ac4c

commit | commitdiff | tree

MORITA Kazutaka [Wed, 1 Apr 2020 22:49:37 +0000 (07:49 +0900)]

[FRONTEND][MXNET] Use leaky by default for LeakyReLU (#5192)

commit | commitdiff | tree

Samuel [Wed, 1 Apr 2020 21:21:50 +0000 (02:51 +0530)]

[PYTORCH]Dropouts And InstanceNorm support added (#5203)

* [PYTORCH]Dropouts And InstanceNorm support added

* Review comments fixed

commit | commitdiff | tree

Samuel [Wed, 1 Apr 2020 19:02:17 +0000 (00:32 +0530)]

[BUGFIX]bugfix in tensorflow space_to_batch_nd (#5175)

* [BUGFIX]bugfix in tensorflow space_to_batch_nd

* Test case added

commit | commitdiff | tree

Tianqi Chen [Wed, 1 Apr 2020 17:55:59 +0000 (10:55 -0700)]

[DOCS] Use https link (#5183)

* [DOCS] Use https link

* use http for sphinx

commit | commitdiff | tree

manupa-arm [Wed, 1 Apr 2020 17:43:44 +0000 (18:43 +0100)]

[RELAY] Partition graph codestyle fixes (#5202)

* [RELAY] Codestyle fixes for Graph Partitioner
*ran through clang-format

* *formatting comments

* *further codestyle changes (after clang-format)

commit | commitdiff | tree

Samuel [Wed, 1 Apr 2020 15:49:03 +0000 (21:19 +0530)]

[PYTORCH]Activations for pytorch (#5194)

* [PYTORCH]Activations for pytorch

* Review comments updated

commit | commitdiff | tree

Tianqi Chen [Wed, 1 Apr 2020 02:44:41 +0000 (19:44 -0700)]

[REFACTOR][TIR] Migrate Low-level Passes to Pass Manager (#5198)

* [TIR][TRANSFORM] Migrate LowerIntrin

* LowerDeviceStorageAccessInfo

* Migrate LowerWarpMemory

commit | commitdiff | tree

Animesh Jain [Tue, 31 Mar 2020 23:57:39 +0000 (16:57 -0700)]

[Topi x86] Missing vectorize for depthwise conv2d. (#5196)

commit | commitdiff | tree

manupa-arm [Tue, 31 Mar 2020 18:27:05 +0000 (19:27 +0100)]

[RELAY] Re-wrote the Graph Partitioner to support multiple outputs (#5143)

* [RELAY] Re-wrote the Graph Partitioner to support multiple outputs

Input : A Relay module that have functions with disjoint annotated regions
        using compiler_begin and compiler_end. There could be multiple outputs.

Output : A Relay module with global functions for such disjoint annotated regions
         with calls inserted at the respective location

Dependencies : AnnotatedRegionSet Utility class.

Methodology :
      1) The AnnotatedRegionSet utility class is able to construct a collection of
         nodes that are bound by a give annotation -- here we use compiler_begin
         and compiler_end
      2) Initially, for each function in the module AnnotatedRegionSets are populated.
      3) Then, Vistor pass is traversed until a compiler_end node is encountered
         that belongs to a "region".
      4) When the first compiler_end of a given annotated region is found, a function is
         formed and inserted.
         a) if the region has multiple outputs, a Tuple node (capturing all outputs)
            is returned.
      5) Thereafter, if we encounter an another output of the same annotated region,
         it is important to note that the function is already formed. Therefore, it will
         lookup the function and add a TupleGetItemNode is inserted.
          a) We will use the location index of "rets" of each "Region" of AnnotatedRegionSet
             as TupleGetItemNode index.
      6) Therefore, functions will be created for all annotated regions. The name for each
         global function is created using "Region" id and the compiler name.

Change-Id: I1372f02a845b6d3da03b561763e03a378dca263c

* [RELAY] Re-wrote the Graph Partitioner to support multiple outputs

    *removed the expected use-case as we are taking broken-down PR approach
    *code style fixes
    *some trivial one liners

* [RELAY] Re-wrote the Graph Partitioner to support multiple outputs

    *fixed an implicit copy to a move

* [RELAY] Re-wrote the Graph Partitioner to support multiple outputs

    *code style changes for comments
    *renamed test case multiple outputs --> mixed single multiple outputs
        Since the existing test case checks for both single and multiple
        output scenarios
    *added a new test case with conv2d + batch_norm
    *some var name changes in the test

* [RELAY] Re-wrote the Graph Partitioner to support multiple outputs

*rebased

commit | commitdiff | tree

Thomas Viehmann [Tue, 31 Mar 2020 16:37:51 +0000 (18:37 +0200)]

rocm: fix dense_rocblas in strategy, topi (#5191)

commit | commitdiff | tree

Wang Yucheng [Tue, 31 Mar 2020 11:01:10 +0000 (19:01 +0800)]

[Torch] Add support for split (#5174)

* [Torch] Add support for split

* fix

* fix test class

commit | commitdiff | tree

Samuel [Tue, 31 Mar 2020 08:41:41 +0000 (14:11 +0530)]

[FRONTEND][KERAS]Max_pool3d and Averagepool3d operator support (#5085)

* [KERAS]Pool3d support added

* Keras pool3d testcase added

commit | commitdiff | tree

Thierry Moreau [Tue, 31 Mar 2020 05:17:36 +0000 (22:17 -0700)]

[VTA] HW sources refactor (#5188)

* refactor

* path udpate

commit | commitdiff | tree

Andrew Reusch [Tue, 31 Mar 2020 04:14:18 +0000 (21:14 -0700)]

Add warning about nnpack installing googletest (#5185)

commit | commitdiff | tree

hlu1 [Tue, 31 Mar 2020 03:27:20 +0000 (20:27 -0700)]

[TVM] ref_counter -> ref_counter_ (#5184)

commit | commitdiff | tree

Yizhi Liu [Tue, 31 Mar 2020 02:04:43 +0000 (19:04 -0700)]

[TE] reverse-mode autodiff without any optimization (#5121)

* [TE] reverse-mode autodiff without any optimization

Co-authored-by: Sergei Grechanik <sergei.grechanik+h@gmail.com>
* address review comments

* add comments and retrigger CI

* move unittest to debug ci

* move test back and add seed

Co-authored-by: Sergei Grechanik <sergei.grechanik+h@gmail.com>

commit | commitdiff | tree

Animesh Jain [Tue, 31 Mar 2020 02:02:07 +0000 (19:02 -0700)]

[TOPI] Setting workload correctly for Depthwise conv ARM. (#5182)

commit | commitdiff | tree

Tianqi Chen [Mon, 30 Mar 2020 23:41:27 +0000 (16:41 -0700)]

[TEST] Various CI fixes for the VTA and Relay (#5181)

* [VTA] Set the correct type for synchronize

* Fix the legacy API

* Temporary remove the structural equal

commit | commitdiff | tree

Thomas Viehmann [Mon, 30 Mar 2020 22:48:17 +0000 (00:48 +0200)]

rocm: fix miopen convolutions (#5179)

* fix miopen convolutions

* fix overly long lines

commit | commitdiff | tree

Tianqi Chen [Mon, 30 Mar 2020 21:16:18 +0000 (14:16 -0700)]

[DOCS] Point docs to the ASF site. (#5178)

* [DOCS] Point docs to the ASF site.

We have migrated the main docs to the ASF site,
which will be periodically updated using the docs generated by the CI.
Points the docs to the ASF version.

* [CI] Improve the docs generation script

commit | commitdiff | tree

mbaret [Mon, 30 Mar 2020 20:59:10 +0000 (21:59 +0100)]

[RELAY] Add MergeCompilerRegions pass (#5134)

* [RELAY] Add MergeCompilerRegions pass

This pass is part of the flow to support creating compiler
regions with multiple outputs. It should be called after
AnnotateTarget and will merge together regions that share
the same target to create larger compiler regions that can
be off-loaded to external codegens.

This pass implements an algorithm to ensure that during the
merging, no data dependency issues are created. See the tests
for an example of this case.

Co-authored-by: Ramana Radhakrishnan <ramana.radhakrishnan@arm.com>
Co-authored-by: Manupa Karunaratne <manupa.karunaratne@arm.com>
Change-Id: Ibd99083564608d888482f57c5080109f3eefec88

* [RELAY] Annotate compiler_ends on each edge

This alters the behaviour of the AnnotateTarget
pass to enforce the property that all compiler
annotations exist along a single data flow edge.
Specifically, this means they should have exactly
one parent and one child.

Change-Id: I0e74803a77767f4f377d17755a13a74a30909797

* Fix comment

* Rebase *Node::make

* Moved block outside for loop

* Code style

* Update make API

* Remove comment

* Remove redundant 'else's

* Make one line

* Fix comment

* RefWrite

* Fix merge ordering

* Add the RFC example as a test

* [FIX] Fixed merging behaviour in AnnotateRegionSet

Deleting items from a list while iterating it seems to
result in undefined behaviour which sometimes segfaults.
This makes sure all the item deletion happens separately.

* Added checks

* Move comment

* Update comments

commit | commitdiff | tree

Samuel [Mon, 30 Mar 2020 20:51:01 +0000 (02:21 +0530)]

[TFLITE]TOP_K op parser support (#5051)

* [TFLITE]TOP_K op parser support

* Testcase updated

commit | commitdiff | tree

ga [Mon, 30 Mar 2020 20:45:11 +0000 (16:45 -0400)]

[Dataflow]: nullptr check (#5176)

commit | commitdiff | tree

Mehrdad Hessar [Mon, 30 Mar 2020 15:36:08 +0000 (08:36 -0700)]

[Runtime][MISRA-C][Bundle] Bundle deployment with static linking (#5158)

* test file for static link added

* rename files

* Fixed static linking issue

* cleanup

* changed to dynamic and static demo

* MISRA-C static and dynamic test

* cleanup

* cleanup

* Update README.md

* cleanup headers

* update readme

commit | commitdiff | tree

Leandro Nunes [Mon, 30 Mar 2020 15:28:41 +0000 (16:28 +0100)]

Create a new parameter --cache-from in tvm/docker/build.sh, so that we can point to an image to be used as cache, from an external (#5173)

script.

* Adjusts documentation to provide information about new optional
parameter "--cache-from"
* Includes --cache-from in the underlying "docker build" command
triggered by build.sh, when required

commit | commitdiff | tree

Tianqi Chen [Mon, 30 Mar 2020 15:28:16 +0000 (08:28 -0700)]

[CI] Improve VTA build message and scripts. (#5170)

* [CI] Improve VTA build message and scripts.

* Use absolute path to set the env var

commit | commitdiff | tree

lfengad [Mon, 30 Mar 2020 07:23:26 +0000 (15:23 +0800)]

Add support for sharing params of operators in tensorflow frontend (#5042)

Domain: Machine Learning / ML Framework;