Leyuan Wang [Sun, 29 Dec 2019 22:35:38 +0000 (14:35 -0800)]
[Perf] Add CublasLt extern support for better Igemm performance (#4550)
* cublaslt added
* fix lint
* address comments
* address more comments
* Trigger CI
* Trigger CI
Neo Chien [Sun, 29 Dec 2019 21:21:04 +0000 (05:21 +0800)]
[GraphRuntime] Support parameter out in the graph runtime debug (#4598)
* [GraphRuntime] Support parameter out in the graph runtime debug
* Dummy commit to trigger build
optima2005 [Sat, 28 Dec 2019 20:05:14 +0000 (04:05 +0800)]
[FRONTEND][TF] conv2d_transpose 'SAME' support kernel more than 1x1 (#4484)
* [FRONTEND][TF] conv3d_transpose 'SAME' support kernel more than 1x1
* revised per as review comments
* add more fallback wolkaround to make all tests pass
zhuochen [Sat, 28 Dec 2019 20:04:41 +0000 (04:04 +0800)]
fix tf.compat.v1 issue for tf verison <=1.12 (#4593)
Wang Yucheng [Fri, 27 Dec 2019 19:22:42 +0000 (03:22 +0800)]
[autotvm] fix typos in comment (#4591)
Zhao Wu (Chinese Name: 吴钊) [Fri, 27 Dec 2019 15:49:57 +0000 (23:49 +0800)]
[Runtime] add necessary const qualifier for NDArray container of parameters (#4590)
optima2005 [Fri, 27 Dec 2019 14:25:25 +0000 (22:25 +0800)]
[TOPI] add 3D upsampling Op. (#4584)
* [TOPI] add 3D upsampling Op.
* fix lint issues
* change align_corners to coordinate_transformation_mode
* fix resize3d half_pixel
* make a simple function and clean up trilinear_resize3d_python
* fix doc
Animesh Jain [Fri, 27 Dec 2019 02:42:21 +0000 (18:42 -0800)]
[Relay][AlterLayout] Broadcast with scalar shape (#4577)
Animesh Jain [Thu, 26 Dec 2019 19:15:46 +0000 (11:15 -0800)]
[Relay] Convert Layout Pass. (#4335)
deepIgnorance [Thu, 26 Dec 2019 18:10:44 +0000 (02:10 +0800)]
[FIX][TOPI][X86] schedule dense pack (#4539)
黎明灰烬 [Thu, 26 Dec 2019 17:36:31 +0000 (01:36 +0800)]
[TOPI][AutoTVM] NHWC conv2d templates for ARM (#3859)
* [AutoTVM][TOPI] NHWC conv2d templates (spatial pack) for ARM
As some frontends (tflite for example) are using NHWC as the default
layout, we are enabling NHWC schedule templates in TOPI and AutoTVM.
* some comments fix
Zhao Wu [Thu, 26 Dec 2019 17:33:03 +0000 (01:33 +0800)]
[Container] Fix NDArray SaveDLTensor declaration and implementation signature different (#4586)
masahi [Thu, 26 Dec 2019 14:13:38 +0000 (23:13 +0900)]
[Quantization, Calibrate] Fix context creation when current_target is explicity set (#4582)
Wang Yucheng [Thu, 26 Dec 2019 09:45:59 +0000 (17:45 +0800)]
[DOCS]fix typos in autotvm tutorial (#4585)
Yizhi Liu [Thu, 26 Dec 2019 04:31:22 +0000 (20:31 -0800)]
[NEWS] add v0.6 release (#4558)
* [NEWS] add v0.6 release
* remove link prefix
* fix issue number
kice [Wed, 25 Dec 2019 21:42:03 +0000 (16:42 -0500)]
Some Windows and MSVC fixes (#4569)
* fix python exception creation in Windows
* better string conversion for msvc
* fix cpp style issue
Tianqi Chen [Wed, 25 Dec 2019 17:21:01 +0000 (09:21 -0800)]
[RUNTIME] Remove Extension VTable in favor of Unified Object system. (#4578)
Before the unified object protocol, we support pass
additional extension objects around by declaring a type as an extension type.
The old extension mechanism requires the types to register their
constructor and deleter to a VTable and does not enjoy the benefit of the
self-contained deletion property of the new Object system.
This PR upgrades the extension example to make use of the new object system
and removed the old Extension VTable.
Note that the register_extension funtion in the python side continues to work
when the passed argument does not require explicit container copy/deletion,
which covers the current usecases of the extension mechanism.
Tianqi Chen [Tue, 24 Dec 2019 23:14:03 +0000 (15:14 -0800)]
[DEPRECATION] Cleanup legacy verilog support (#4576)
This PR cleans up the left over code for legacy verilog support which was experimental.
The new hardware backend path is now support by VTA via TSIM.
Bohan Hou [Tue, 24 Dec 2019 16:51:05 +0000 (00:51 +0800)]
[DOC] fix doc in api.py (#4580)
Josh Fromm [Tue, 24 Dec 2019 05:04:34 +0000 (00:04 -0500)]
[Relay/Topi][Op] Added native DepthToSpace and SpaceToDepth Operators (#4566)
* Added tvm function stencil for subpixel operations to topi.
* Topi subpixel operators added and tested.
* Added subpixel attrs.
* Added depth_to_space relay attributes.
* depth_to_space fully working.
* Fixed NHWC shape bug.
* SpaceToDepth in and all tests passing.
* lint fixes.
* Added string include
* Fixed topi formatting.
* Added DCR/CDR mode to depthtospace operator.
Tianqi Chen [Mon, 23 Dec 2019 19:51:26 +0000 (11:51 -0800)]
[DEPRECATION] Remove NNVM compiler (#4571)
* Remove NNVM compiler
Dmitri Makarov [Mon, 23 Dec 2019 16:50:48 +0000 (17:50 +0100)]
Fix llvm-enabled build by adding missing intrinsics headers (#4575)
masahi [Mon, 23 Dec 2019 16:48:37 +0000 (01:48 +0900)]
remove unnecessary cast to int32 (#4573)
Liangfu Chen [Mon, 23 Dec 2019 16:43:52 +0000 (00:43 +0800)]
[VTA][Chisel] End-to-end Inference with Chisel VTA (#4574)
* [VTA][Chisel] End-to-end Inference with Chisel VTA
* Update TensorAlu.scala
Tianqi Chen [Mon, 23 Dec 2019 04:52:33 +0000 (20:52 -0800)]
Remove nnvm (#4565)
Yong Wu [Mon, 23 Dec 2019 01:43:33 +0000 (17:43 -0800)]
[Relay] add max_pool3d in relay and TF converter (#4551)
* [Relay] add max_pool3d in relay and TF converter
* fix comments
Tianqi Chen [Sun, 22 Dec 2019 17:47:34 +0000 (09:47 -0800)]
[TEST] Remove nnvm related code in topi and test script (#4562)
* [TEST] Remove nnvm related code in topi and test script
* Remove docs dep
Neo Chien [Sun, 22 Dec 2019 17:03:39 +0000 (01:03 +0800)]
[Relay][Frontend][ONNX] Support auto_pad in Conv and ConvTranspose (#4563)
Zhao Wu [Sun, 22 Dec 2019 04:14:40 +0000 (12:14 +0800)]
Support standardize runtime module (#4532)
Tianqi Chen [Sun, 22 Dec 2019 02:26:21 +0000 (18:26 -0800)]
[REFACTOR][DTYPE] Isolate dtype to runtime (#4560)
dtype.h -> runtime/data_type.h
Changes:
- Rename all old reference of tvm::Type to DataType
- ExprNode.type -> ExprNode.dtype
- Expr.type() -> Expr.dtype()
- Change Expr related functions to expr_operator.
- DataType::min() -> min_value(DataType)
- DataType::max() -> max_value(DataType)
- Move type constructor Int, UInt, Float, Handle, Bool into DataType.
- Int(bits) -> DataType::Int(bits)
- UInt(bits) -> DataType::UInt(bits)
Tianqi Chen [Sun, 22 Dec 2019 02:26:11 +0000 (18:26 -0800)]
[RUNTIME][VULKAN] Fix compiler warning (#4559)
Siyuan Feng [Sun, 22 Dec 2019 01:56:18 +0000 (17:56 -0800)]
[IR] fix style in ir_mutator and ir_visitor (#4561)
Liangfu Chen [Sat, 21 Dec 2019 22:19:56 +0000 (06:19 +0800)]
[VTA] improved virtual memory mapping (#4545)
* [VTA] improved virtual memory mapping
* Update virtual_memory.cc
Tianqi Chen [Fri, 20 Dec 2019 22:42:57 +0000 (14:42 -0800)]
[COMMUNITY] @cchung100m -> reviewer (#4557)
Zhi [Fri, 20 Dec 2019 22:36:14 +0000 (14:36 -0800)]
vm external codegen (#4544)
Tianqi Chen [Fri, 20 Dec 2019 22:21:09 +0000 (14:21 -0800)]
[PYTHON][FFI] Cythonize NDArray.copyto (#4549)
* [PYTHON][FFI] Cythonize NDArray.copyto
* Cythonize the shape property
Hideto Ueno [Fri, 20 Dec 2019 09:25:18 +0000 (18:25 +0900)]
[DOCS] Mention Ninja build system in install/from_source.rst (#4554)
* [DOCS] Mention Ninja build system in install/from_source.rst
* Address comments
mbarrett97 [Wed, 18 Dec 2019 21:23:36 +0000 (21:23 +0000)]
[TOPI] Fixed nms max_output_size loop (#4541)
One of the loops in hybrid_nms used for
performing the max_output_size reordering
was incorrectly designated as parallel
resulting in incorrect behaviour. This patch
changes that loop to a serial loop.
Change-Id: I97184f5887f5f028d8ab339fa2808eb7630a4017
Haichen Shen [Wed, 18 Dec 2019 21:17:18 +0000 (13:17 -0800)]
[TOPI] Allow batch matmul to be fused into injective ops (#4537)
Takato Yamada [Wed, 18 Dec 2019 17:58:37 +0000 (02:58 +0900)]
[relay][op] add expand op (from ONNX) to relay frontend (#4483)
* Add Expand to onnx.py
* add test function for expand
* Fix a onnx frontend test
* Add tests for the value itself instead of shape only on test_expand
* Cleaned up some unnecessary modifications.
Alex Gladkov [Wed, 18 Dec 2019 17:35:22 +0000 (09:35 -0800)]
Implement 1d deconvolution (#4476)
Tianqi Chen [Wed, 18 Dec 2019 06:17:51 +0000 (22:17 -0800)]
Update legacy places from nnvm to relay. (#4535)
* Update legacy places from nnvm to relay.
This PR prepares the current mainline to remove nnvm compiler dep.
* remove legacy stage
Zhi [Wed, 18 Dec 2019 03:17:55 +0000 (19:17 -0800)]
[Relay] External codegen (#4482)
lhutton1 [Tue, 17 Dec 2019 17:55:32 +0000 (17:55 +0000)]
PIL is depreciated and should be replaced with pillow (a fork of PIL) (#4533)
Change-Id: If2075df5475505f2da87dae7145af5a7ab83d8a4
Liangfu Chen [Mon, 16 Dec 2019 18:26:54 +0000 (02:26 +0800)]
fix crash issue in tsim backend (#4527)
masahi [Mon, 16 Dec 2019 16:11:53 +0000 (01:11 +0900)]
fix onnx shape dtype (#4528)
Cody Yu [Mon, 16 Dec 2019 06:37:43 +0000 (22:37 -0800)]
fix empty config caused KeyError (#4520)
YixinBao [Mon, 16 Dec 2019 05:46:21 +0000 (13:46 +0800)]
add bfloat16 typeflag support (#4525)
Liang ZOU [Sun, 15 Dec 2019 23:09:51 +0000 (07:09 +0800)]
[ir] use DataType instead of Type for readability because Type has been deprecated (#4513)
miheer vaidya [Sun, 15 Dec 2019 23:09:16 +0000 (16:09 -0700)]
Use the best tuner possible (#4397)
* Use the best tuner possible
* Add comment denoting availability of better tuners
* Fix typos and wording
Josh Fromm [Sun, 15 Dec 2019 23:08:35 +0000 (15:08 -0800)]
Fixed extra reshape parameter bug. (#4524)
Ina Dobreva [Sat, 14 Dec 2019 05:15:12 +0000 (05:15 +0000)]
[Bugfix][Frontend][TFlite] Fix wrong function call in TANH tests (#4517)
* Replace sigmoid() with tanh() in tests for TANH
SWu [Fri, 13 Dec 2019 20:09:56 +0000 (15:09 -0500)]
Fix bias_add gradient (#4516)
* Fix bias_add gradient
A change caused collapse_sum_like to reject implicit dimension
broadcasting for bias_add gradient, so switch to explicit sum reduction
on the non-bias axis dimensions.
* Lint fix
Alexander Pivovarov [Fri, 13 Dec 2019 17:42:58 +0000 (09:42 -0800)]
Fix TF resize for dynamic size models (#4510)
Leandro Nunes [Fri, 13 Dec 2019 05:48:29 +0000 (05:48 +0000)]
[CI] Update docker image ci_lint to obtain Python 3.6 from ppa:deadsnakes/ppa (#4505) (#4506)
masahi [Thu, 12 Dec 2019 22:52:06 +0000 (07:52 +0900)]
[Quantization] Fix annotation for multiply op (#4458)
* fix mul rewrite
* register Realize Rewrite for global avg pool and add test
* remove unnecessary check
* improve the test case
Haichen Shen [Thu, 12 Dec 2019 22:33:57 +0000 (14:33 -0800)]
[Hybrid][Fix] Fix hybrid script to support array of tensors (#4494)
* [Fix][Hybrid] Fix hybrid script to support array of tensors
* add test case
* clean up
* trigger ci
Dmitri Makarov [Thu, 12 Dec 2019 16:00:32 +0000 (17:00 +0100)]
Fix build for llvm newer than 9.0 (#4515)
optima2005 [Thu, 12 Dec 2019 06:06:20 +0000 (14:06 +0800)]
[TOPI] implement pool3d op (#4478)
* [TOPI] implement pool3d op
* use PoolInferCorrectLayout for both 2d and 3d pooling
* unify MakeMaxPool and MakeAvgPool
LaiyuanGong [Thu, 12 Dec 2019 01:47:50 +0000 (19:47 -0600)]
[NODE][Serialization]fix serialization precision loss in float (#4503)
* fix serialization precision loss in float
When we want to serialize a tvm.tensor object(like pickle), we will get a precision loss cause by std::to_string()。
For example, a2.value will be 0.0 while a.value=0.
00000001 in the following:
import tvm
import pickle
a = tvm.const(0.
00000001, 'float32')
a2 = pickle.loads(pickle.dumps(a))
* remove line end spaces
Thomas Viehmann [Thu, 12 Dec 2019 01:23:00 +0000 (02:23 +0100)]
add rocm schedules to topi C++ (#4507)
This imports the CUDA schedules to rocm.
Peter Yeh [Thu, 12 Dec 2019 01:19:42 +0000 (17:19 -0800)]
Add AMD codeGen unit tests (#4509)
Ramana Radhakrishnan [Wed, 11 Dec 2019 18:42:15 +0000 (18:42 +0000)]
Refactor bilinear and neighbour implementation in Tensorflow frontend (#4504)
There is significant duplication between functions.
Spotted while looking to move the tensorflow and tflite framework support to later than
1.13.1. The tests barf around resize_nearest_neighbour not ignoring the attribute
'helpful_pixel_centers'.
That upgrade is a separate discussion while this can go in
independently.
Thanks,
Ramana
Liang ZOU [Wed, 11 Dec 2019 17:33:08 +0000 (01:33 +0800)]
[codegen][Build] it's more readable to move the if condition out of the loop (#4501)
MORITA Kazutaka [Wed, 11 Dec 2019 16:39:06 +0000 (08:39 -0800)]
[RUNTIME] Fix compile errors of OpenCL FPGA backend (#4492)
Peter Yeh [Wed, 11 Dec 2019 09:14:36 +0000 (01:14 -0800)]
update rocm intrin rule (#4499)
Liangfu Chen [Wed, 11 Dec 2019 00:53:53 +0000 (08:53 +0800)]
[VTA] Speedup TSIM by Multi-threading (#4491)
This PR tries to increase TSIM performance by introducing multi-threading support.
reminisce [Tue, 10 Dec 2019 22:05:52 +0000 (14:05 -0800)]
Add __float2half_rn for cuda compute capabilities less than 53 (#4489)
* Fix
* clean up
Haichen Shen [Tue, 10 Dec 2019 19:09:23 +0000 (11:09 -0800)]
[Relay][Fix] Fix alter op layout when calling a global var (#4454)
* [Relay][Fix] Fix alter op layout when calling a global var
* add test case
Yizhi Liu [Tue, 10 Dec 2019 18:35:12 +0000 (10:35 -0800)]
[Team] Jared Roesch -> PPMC (#4488)
Liang ZOU [Tue, 10 Dec 2019 17:54:22 +0000 (01:54 +0800)]
[docs] typos in include/tvm/ir.h (#4493)
Tianqi Chen [Mon, 9 Dec 2019 21:22:31 +0000 (13:22 -0800)]
[REFACTOR][RUNTIME] Add LibraryModule that merges systemlib and dso. (#4481)
Historically we have two variations of modules(DSOModule and SystemLibModule)
that both exposes module via symbols.
This PR creates a common implementation for both, and introduce a Library
base class that allows us to have different implementations of GetSymbol.
It paves ways for future library related module enhancements.
Ina Dobreva [Mon, 9 Dec 2019 17:20:55 +0000 (17:20 +0000)]
[Relay][Frontend][TFlite] Add parses support for UNPACK tflite operator (#4447)
* use SPLIT & SQUEEZE = UNPACK as implemented in tensorflow parser
Relay doesn't support UNPACK
* tflite 1.13: UNPACK doesn't work as exepcted -> copies the values from
1st unpacked tensor to the other unpacks
* tflite 1.13: doesn't accept negative axis
Thierry Moreau [Mon, 9 Dec 2019 06:08:21 +0000 (22:08 -0800)]
[VTA] Bringing group convolution support (#4421)
* group conv operator support for VTA
* autotvm tuning script for group conv2d
* lint fix
* lint fix
* lint fix
* addressing comments
Zhi [Sun, 8 Dec 2019 19:57:25 +0000 (11:57 -0800)]
Check function attr for alpha equal (#4479)
Yuanqiang Liu [Sun, 8 Dec 2019 19:56:40 +0000 (03:56 +0800)]
[Codegen] fix bug on LLVM 10.0 (#4480)
Alexander Pivovarov [Sun, 8 Dec 2019 00:29:55 +0000 (16:29 -0800)]
Workaround to make conv2d_transpose compilation for CUDA work (#4472)
Alexander Pivovarov [Fri, 6 Dec 2019 08:35:20 +0000 (00:35 -0800)]
Fix typo in travserse (#4469)
Zhi [Thu, 5 Dec 2019 06:16:37 +0000 (22:16 -0800)]
[relay][op] Add shape func to tile (#4441)
* [relay][op] Add shape func to tile
* retrigger ci
* check dynamic axes
* retrigger ci
Junru Shao [Wed, 4 Dec 2019 23:08:28 +0000 (15:08 -0800)]
[BUGFIX] Fix search path for libtvm_topi.so (#4467)
ziheng [Wed, 4 Dec 2019 18:09:31 +0000 (10:09 -0800)]
[CONTRIB] TFLite Runtime (#4439)
Leo-arm [Wed, 4 Dec 2019 17:45:42 +0000 (17:45 +0000)]
lldb pretty printers for relay (#4453)
* lldb pretty printers for relay
A set of lldb debugger pretty printers that use the relay
PrettyPrinter functionality to display data structures in
the lldb debugger.
* lldb pretty printers for relay
A set of lldb debugger pretty printers that use the relay
PrettyPrinter functionality to display data structures in
the lldb debugger.
- Put the dot.lldbinit file in your home directory as .lldbinit.
- Update the file to point to the pretty printer script tvm.py
- Restart lldb
Liang ZOU [Wed, 4 Dec 2019 08:46:59 +0000 (16:46 +0800)]
[doc] fix typo (#4463)
optima2005 [Wed, 4 Dec 2019 08:44:07 +0000 (16:44 +0800)]
implement conv3d op (#4400)
* implement conv3d op
* add back missed conv2d_output_shape by mistake
* fix typo and docs, add topi test
* rebase to master and merge 2d/3d unification
* use cudnn.conv_forward
Tianqi Chen [Tue, 3 Dec 2019 23:14:07 +0000 (15:14 -0800)]
[RUNTIME][RPC] Update RPC runtime to allow remote module as arg (#4462)
optima2005 [Tue, 3 Dec 2019 21:18:19 +0000 (05:18 +0800)]
[RUNTIME] Add cudnn conv3d (#4418)
* [RUNTIME] Add cudnn conv3d
* add output checking to test_cudnn.verify()
* fix tests failure
* revised per as review comments
* unify conv_output_shape, conv_find_algo and conv_forward
* convert python list to tvm.array in conv_forward
* revise per as comments
* 'pass as reference' for vector args
* add back con2d/3d seperated implementation
* remove unused included header
* remove extra std::vectors
* remove unused header
Tianqi Chen [Tue, 3 Dec 2019 20:34:15 +0000 (12:34 -0800)]
[MEMORY] Fix gcc 4.8 compact (#4461)
Jammy Zhou [Tue, 3 Dec 2019 18:18:52 +0000 (02:18 +0800)]
Fix the Makefile for howto_deploy (#4457)
jmorrill [Tue, 3 Dec 2019 17:52:00 +0000 (09:52 -0800)]
Fix MSVC build error with container.h (#4455)
abergeron [Tue, 3 Dec 2019 17:39:13 +0000 (12:39 -0500)]
[TOPI][Relay][OP] Add a strided_set operation. (#4303)
Yong Wu [Mon, 2 Dec 2019 21:41:44 +0000 (13:41 -0800)]
[Relay] shape func for zeros, zeros_like, ones, ones_like (#4448)
anwang2009 [Mon, 2 Dec 2019 18:40:10 +0000 (10:40 -0800)]
[DOCS] add benchmark log format doc (#4366)
* add benchmark log format doc
* code review changes
* remove runtime_config, add md5 field
* schema edits
Logan Weber [Mon, 2 Dec 2019 18:38:12 +0000 (10:38 -0800)]
[µTVM] Enable AutoTVM for ARM STM32F746XX Boards (#4274)
HarryWu [Mon, 2 Dec 2019 17:09:38 +0000 (01:09 +0800)]
a tiny typo (#4452)
Alexander Pivovarov [Sun, 1 Dec 2019 15:41:50 +0000 (07:41 -0800)]
[TFLite] Add transpose_conv to TFLite parser (#4440)
Wei Chen [Sun, 1 Dec 2019 15:41:00 +0000 (07:41 -0800)]
[Runtime] Make ADTObject POD container type (#4346)
Haichen Shen [Sun, 1 Dec 2019 00:27:15 +0000 (16:27 -0800)]
[Relay][Pass] Fix lambda lift pass for recursive call (#4432)
* Fix lambda lift
* clean up
* lint
* fix
* remove unused import
Ina Dobreva [Sun, 1 Dec 2019 00:16:44 +0000 (00:16 +0000)]
[Relay][Frontend][TFlite] Add test for qnn_mul operator (#4395)
* Add a function to set the qnn output range wrt each elemwise operation.
* Add comments warning for nonsense clamped output in the tflite/tvm results comparison.
Thierry Moreau [Thu, 28 Nov 2019 18:12:49 +0000 (10:12 -0800)]
rpi4b target (#4445)
Liangfu Chen [Thu, 28 Nov 2019 06:45:58 +0000 (14:45 +0800)]
fix multiple transfer issue in loaduop (#4442)