platform/upstream/tvm.git
4 years agoadd rocm schedules to topi C++ (#4507)
Thomas Viehmann [Thu, 12 Dec 2019 01:23:00 +0000 (02:23 +0100)]
add rocm schedules to topi C++ (#4507)

This imports the CUDA schedules to rocm.

4 years agoAdd AMD codeGen unit tests (#4509)
Peter Yeh [Thu, 12 Dec 2019 01:19:42 +0000 (17:19 -0800)]
Add AMD codeGen unit tests (#4509)

4 years agoRefactor bilinear and neighbour implementation in Tensorflow frontend (#4504)
Ramana Radhakrishnan [Wed, 11 Dec 2019 18:42:15 +0000 (18:42 +0000)]
Refactor bilinear and neighbour implementation in Tensorflow frontend (#4504)

There is significant duplication between functions.

Spotted while looking to move the tensorflow and tflite framework support to later than
1.13.1. The tests barf around resize_nearest_neighbour not ignoring the attribute
'helpful_pixel_centers'.

That upgrade is a separate discussion while this can go in
independently.

Thanks,
Ramana

4 years ago[codegen][Build] it's more readable to move the if condition out of the loop (#4501)
Liang ZOU [Wed, 11 Dec 2019 17:33:08 +0000 (01:33 +0800)]
[codegen][Build] it's more readable to move the if condition out of the loop (#4501)

4 years ago[RUNTIME] Fix compile errors of OpenCL FPGA backend (#4492)
MORITA Kazutaka [Wed, 11 Dec 2019 16:39:06 +0000 (08:39 -0800)]
[RUNTIME] Fix compile errors of OpenCL FPGA backend (#4492)

4 years agoupdate rocm intrin rule (#4499)
Peter Yeh [Wed, 11 Dec 2019 09:14:36 +0000 (01:14 -0800)]
update rocm intrin rule (#4499)

4 years ago[VTA] Speedup TSIM by Multi-threading (#4491)
Liangfu Chen [Wed, 11 Dec 2019 00:53:53 +0000 (08:53 +0800)]
[VTA] Speedup TSIM by Multi-threading (#4491)

This PR tries to increase TSIM performance by introducing multi-threading support.

4 years agoAdd __float2half_rn for cuda compute capabilities less than 53 (#4489)
reminisce [Tue, 10 Dec 2019 22:05:52 +0000 (14:05 -0800)]
Add __float2half_rn for cuda compute capabilities less than 53 (#4489)

* Fix

* clean up

4 years ago[Relay][Fix] Fix alter op layout when calling a global var (#4454)
Haichen Shen [Tue, 10 Dec 2019 19:09:23 +0000 (11:09 -0800)]
[Relay][Fix] Fix alter op layout when calling a global var (#4454)

* [Relay][Fix] Fix alter op layout when calling a global var

* add test case

4 years ago[Team] Jared Roesch -> PPMC (#4488)
Yizhi Liu [Tue, 10 Dec 2019 18:35:12 +0000 (10:35 -0800)]
[Team] Jared Roesch -> PPMC (#4488)

4 years ago[docs] typos in include/tvm/ir.h (#4493)
Liang ZOU [Tue, 10 Dec 2019 17:54:22 +0000 (01:54 +0800)]
[docs] typos in include/tvm/ir.h (#4493)

4 years ago[REFACTOR][RUNTIME] Add LibraryModule that merges systemlib and dso. (#4481)
Tianqi Chen [Mon, 9 Dec 2019 21:22:31 +0000 (13:22 -0800)]
[REFACTOR][RUNTIME] Add LibraryModule that merges systemlib and dso. (#4481)

Historically we have two variations of modules(DSOModule and SystemLibModule)
that both exposes module via symbols.

This PR creates a common implementation for both, and introduce a Library
base class that allows us to have different implementations of GetSymbol.

It paves ways for future library related module enhancements.

4 years ago[Relay][Frontend][TFlite] Add parses support for UNPACK tflite operator (#4447)
Ina Dobreva [Mon, 9 Dec 2019 17:20:55 +0000 (17:20 +0000)]
[Relay][Frontend][TFlite] Add parses support for UNPACK tflite operator (#4447)

* use SPLIT & SQUEEZE = UNPACK as implemented in tensorflow parser
  Relay doesn't support UNPACK
* tflite 1.13: UNPACK doesn't work as exepcted -> copies the values from
  1st unpacked tensor to the other unpacks
* tflite 1.13: doesn't accept negative axis

4 years ago[VTA] Bringing group convolution support (#4421)
Thierry Moreau [Mon, 9 Dec 2019 06:08:21 +0000 (22:08 -0800)]
[VTA] Bringing group convolution support  (#4421)

* group conv operator support for VTA

* autotvm tuning script for group conv2d

* lint fix

* lint fix

* lint fix

* addressing comments

4 years agoCheck function attr for alpha equal (#4479)
Zhi [Sun, 8 Dec 2019 19:57:25 +0000 (11:57 -0800)]
Check function attr for alpha equal (#4479)

4 years ago[Codegen] fix bug on LLVM 10.0 (#4480)
Yuanqiang Liu [Sun, 8 Dec 2019 19:56:40 +0000 (03:56 +0800)]
[Codegen] fix bug on LLVM 10.0 (#4480)

4 years agoWorkaround to make conv2d_transpose compilation for CUDA work (#4472)
Alexander Pivovarov [Sun, 8 Dec 2019 00:29:55 +0000 (16:29 -0800)]
Workaround to make conv2d_transpose compilation for CUDA work (#4472)

4 years agoFix typo in travserse (#4469)
Alexander Pivovarov [Fri, 6 Dec 2019 08:35:20 +0000 (00:35 -0800)]
Fix typo in travserse (#4469)

4 years ago[relay][op] Add shape func to tile (#4441)
Zhi [Thu, 5 Dec 2019 06:16:37 +0000 (22:16 -0800)]
[relay][op] Add shape func to tile (#4441)

* [relay][op] Add shape func to tile

* retrigger ci

* check dynamic axes

* retrigger ci

4 years ago[BUGFIX] Fix search path for libtvm_topi.so (#4467)
Junru Shao [Wed, 4 Dec 2019 23:08:28 +0000 (15:08 -0800)]
[BUGFIX] Fix search path for libtvm_topi.so (#4467)

4 years ago[CONTRIB] TFLite Runtime (#4439)
ziheng [Wed, 4 Dec 2019 18:09:31 +0000 (10:09 -0800)]
[CONTRIB] TFLite Runtime (#4439)

4 years agolldb pretty printers for relay (#4453)
Leo-arm [Wed, 4 Dec 2019 17:45:42 +0000 (17:45 +0000)]
lldb pretty printers for relay (#4453)

* lldb pretty printers for relay

A set of lldb debugger pretty printers that use the relay
PrettyPrinter functionality to display data structures in
the lldb debugger.

* lldb pretty printers for relay

A set of lldb debugger pretty printers that use the relay
PrettyPrinter functionality to display data structures in
the lldb debugger.
- Put the dot.lldbinit file in your home directory as .lldbinit.
- Update the file to point to the pretty printer script tvm.py
- Restart lldb

4 years ago[doc] fix typo (#4463)
Liang ZOU [Wed, 4 Dec 2019 08:46:59 +0000 (16:46 +0800)]
[doc] fix typo (#4463)

4 years agoimplement conv3d op (#4400)
optima2005 [Wed, 4 Dec 2019 08:44:07 +0000 (16:44 +0800)]
implement conv3d op (#4400)

* implement conv3d op

* add back missed conv2d_output_shape by mistake

* fix typo and docs, add topi test

* rebase to master and merge 2d/3d unification

* use cudnn.conv_forward

4 years ago[RUNTIME][RPC] Update RPC runtime to allow remote module as arg (#4462)
Tianqi Chen [Tue, 3 Dec 2019 23:14:07 +0000 (15:14 -0800)]
[RUNTIME][RPC] Update RPC runtime to allow remote module as arg (#4462)

4 years ago[RUNTIME] Add cudnn conv3d (#4418)
optima2005 [Tue, 3 Dec 2019 21:18:19 +0000 (05:18 +0800)]
[RUNTIME] Add cudnn conv3d (#4418)

* [RUNTIME] Add cudnn conv3d

* add output checking to test_cudnn.verify()

* fix tests failure

* revised per as review comments

* unify conv_output_shape, conv_find_algo and conv_forward

* convert python list to tvm.array in conv_forward

* revise per as comments

* 'pass as reference' for vector args

* add back con2d/3d seperated implementation

* remove unused included header

* remove extra std::vectors

* remove unused header

4 years ago[MEMORY] Fix gcc 4.8 compact (#4461)
Tianqi Chen [Tue, 3 Dec 2019 20:34:15 +0000 (12:34 -0800)]
[MEMORY] Fix gcc 4.8 compact (#4461)

4 years agoFix the Makefile for howto_deploy (#4457)
Jammy Zhou [Tue, 3 Dec 2019 18:18:52 +0000 (02:18 +0800)]
Fix the Makefile for howto_deploy (#4457)

4 years agoFix MSVC build error with container.h (#4455)
jmorrill [Tue, 3 Dec 2019 17:52:00 +0000 (09:52 -0800)]
Fix MSVC build error with container.h (#4455)

4 years ago[TOPI][Relay][OP] Add a strided_set operation. (#4303)
abergeron [Tue, 3 Dec 2019 17:39:13 +0000 (12:39 -0500)]
[TOPI][Relay][OP] Add a strided_set operation. (#4303)

4 years ago[Relay] shape func for zeros, zeros_like, ones, ones_like (#4448)
Yong Wu [Mon, 2 Dec 2019 21:41:44 +0000 (13:41 -0800)]
[Relay] shape func for zeros, zeros_like, ones, ones_like (#4448)

4 years ago[DOCS] add benchmark log format doc (#4366)
anwang2009 [Mon, 2 Dec 2019 18:40:10 +0000 (10:40 -0800)]
[DOCS] add benchmark log format doc (#4366)

* add benchmark log format doc

* code review changes

* remove runtime_config, add md5 field

* schema edits

4 years ago[µTVM] Enable AutoTVM for ARM STM32F746XX Boards (#4274)
Logan Weber [Mon, 2 Dec 2019 18:38:12 +0000 (10:38 -0800)]
[µTVM] Enable AutoTVM for ARM STM32F746XX Boards (#4274)

4 years agoa tiny typo (#4452)
HarryWu [Mon, 2 Dec 2019 17:09:38 +0000 (01:09 +0800)]
a tiny typo (#4452)

4 years ago[TFLite] Add transpose_conv to TFLite parser (#4440)
Alexander Pivovarov [Sun, 1 Dec 2019 15:41:50 +0000 (07:41 -0800)]
[TFLite] Add transpose_conv to TFLite parser (#4440)

4 years ago[Runtime] Make ADTObject POD container type (#4346)
Wei Chen [Sun, 1 Dec 2019 15:41:00 +0000 (07:41 -0800)]
[Runtime] Make ADTObject POD container type (#4346)

4 years ago[Relay][Pass] Fix lambda lift pass for recursive call (#4432)
Haichen Shen [Sun, 1 Dec 2019 00:27:15 +0000 (16:27 -0800)]
[Relay][Pass] Fix lambda lift pass for recursive call (#4432)

* Fix lambda lift

* clean up

* lint

* fix

* remove unused import

4 years ago[Relay][Frontend][TFlite] Add test for qnn_mul operator (#4395)
Ina Dobreva [Sun, 1 Dec 2019 00:16:44 +0000 (00:16 +0000)]
[Relay][Frontend][TFlite] Add test for qnn_mul operator (#4395)

* Add a function to set the qnn output range wrt each elemwise operation.
* Add comments warning for nonsense clamped output in the tflite/tvm results comparison.

4 years agorpi4b target (#4445)
Thierry Moreau [Thu, 28 Nov 2019 18:12:49 +0000 (10:12 -0800)]
rpi4b target (#4445)

4 years agofix multiple transfer issue in loaduop (#4442)
Liangfu Chen [Thu, 28 Nov 2019 06:45:58 +0000 (14:45 +0800)]
fix multiple transfer issue in loaduop (#4442)

4 years ago[Doc] Fix broken link (#4438)
Neo Chien [Wed, 27 Nov 2019 18:42:54 +0000 (02:42 +0800)]
[Doc] Fix broken link (#4438)

* [Doc] Fix broken link

* [Doc] Fix broken link

* [Doc] Fix broken link

4 years ago[VTA] Enable streamlined GEMM execution (#4392)
Liangfu Chen [Wed, 27 Nov 2019 17:04:19 +0000 (01:04 +0800)]
[VTA] Enable streamlined GEMM execution (#4392)

* disable pipelined adder and enable streamlined gemm execution

* pipeline first layer of adder

* explain difference between pipeadder and adder

* add comment for explaining the hard-coded latency

4 years agoadd DeviceName to ROCm api (#4437)
Thomas Viehmann [Wed, 27 Nov 2019 15:15:37 +0000 (16:15 +0100)]
add DeviceName to ROCm api (#4437)

4 years ago[ARM CPU] Fix infer shape error of depthwise (#4384)
Zhao Wu [Wed, 27 Nov 2019 06:42:20 +0000 (14:42 +0800)]
[ARM CPU] Fix infer shape error of depthwise (#4384)

* [ARM CPU] Fix contrib_spatial_pack error

* PyLint error fix

* diable no-else-return as other files

* Change the test case split OC not be 1 to cover 5D weight layout

4 years ago[VTA][HotFix] Relay->VTA quantization fix (#4433)
Thierry Moreau [Wed, 27 Nov 2019 03:21:56 +0000 (19:21 -0800)]
[VTA][HotFix] Relay->VTA quantization fix (#4433)

* relay -> vta fix

* setting optlevel to 3 for quantization to fold batchnorm

4 years ago[RELEASE] Update copyright message, change notice, remove cma kernel module for now...
Tianqi Chen [Tue, 26 Nov 2019 23:58:13 +0000 (15:58 -0800)]
[RELEASE] Update copyright message, change notice, remove cma kernel module for now (#4431)

4 years ago[DOCS] Update main website to tvm.apache.org (#4429)
Tianqi Chen [Tue, 26 Nov 2019 22:40:26 +0000 (14:40 -0800)]
[DOCS] Update main website to tvm.apache.org (#4429)

* [DOCS] Update main website to tvm.apache.org

* Update jvm pom repo loc

* Change the org to asf

* Update ci addr to new one

4 years agoAllow Array/Map store objects that are not NodeRef (#4430)
Junru Shao [Tue, 26 Nov 2019 22:33:44 +0000 (14:33 -0800)]
Allow Array/Map store objects that are not NodeRef (#4430)

4 years agoTweak debugger result (#4426)
Haichen Shen [Tue, 26 Nov 2019 19:06:47 +0000 (11:06 -0800)]
Tweak debugger result (#4426)

4 years ago[AutoTVM] select model with the most tuned schedules (#4404)
Xingyu Zhou [Tue, 26 Nov 2019 18:17:25 +0000 (10:17 -0800)]
[AutoTVM] select model with the most tuned schedules (#4404)

* select model with the most tuned schedules

* change detect empty map method

* modify model description for load_reference_log

4 years ago[SETUP] Add optional dependencies to extras_require (#4428)
Neo Chien [Tue, 26 Nov 2019 17:48:24 +0000 (01:48 +0800)]
[SETUP] Add optional dependencies to extras_require (#4428)

4 years ago[Fix][Relay] Remove schedule register for nonexisting log1p op (#4425)
Haichen Shen [Tue, 26 Nov 2019 01:11:15 +0000 (17:11 -0800)]
[Fix][Relay] Remove schedule register for nonexisting log1p op (#4425)

4 years agoremoving nnvm dep from VTA sources (#4419)
Thierry Moreau [Tue, 26 Nov 2019 00:18:10 +0000 (16:18 -0800)]
removing nnvm dep from VTA sources (#4419)

4 years agoadd rocm codegen unittest for cross thread reduction (#4423)
Thomas Viehmann [Mon, 25 Nov 2019 15:37:52 +0000 (16:37 +0100)]
add rocm codegen unittest for cross thread reduction (#4423)

4 years ago[Perf] Enhance cudnn and cublas backend and enable TensorCore (#4353)
Siyuan Feng [Mon, 25 Nov 2019 06:01:55 +0000 (22:01 -0800)]
[Perf] Enhance cudnn and cublas backend and enable TensorCore (#4353)

* add half and mix precision support to cublas backend

* add TensorCore support in CuDNN

* enhance CuDNN support

* address comments and fix lint

* fix

* add fp16 test

4 years ago[RUNTIME] rename allocator.make -> allocator.make_object for term consistency (#4416)
Tianqi Chen [Sun, 24 Nov 2019 22:34:17 +0000 (14:34 -0800)]
[RUNTIME] rename allocator.make -> allocator.make_object for term consistency (#4416)

4 years agoFix compilaton of bfloat16 on Windows (#4415)
Philip Hyunsu Cho [Sun, 24 Nov 2019 22:16:29 +0000 (14:16 -0800)]
Fix compilaton of bfloat16 on Windows (#4415)

4 years ago[LICENSE] clarify the blockingqueue license, update version to 0.6.0 (#4414)
Tianqi Chen [Sun, 24 Nov 2019 19:43:21 +0000 (11:43 -0800)]
[LICENSE] clarify the blockingqueue license, update version to 0.6.0 (#4414)

4 years ago[License] move cma_api to 3rdparty. separate BSD 2-clause and 3-clause (#4410)
Yizhi Liu [Sun, 24 Nov 2019 17:44:38 +0000 (09:44 -0800)]
[License] move cma_api to 3rdparty. separate BSD 2-clause and 3-clause (#4410)

* [License] move cma_api to 3rdparty. separate BSD 2-clause and 3-clause

* add zlib license for blockingconcurrentqueue.h

4 years ago[LINT] Remove unnecessary copyright message for files with ASF header (#4409)
Tianqi Chen [Sun, 24 Nov 2019 08:22:55 +0000 (00:22 -0800)]
[LINT] Remove unnecessary copyright message for files with ASF header (#4409)

* [LINT] Improve the check tool to handle ASF copyright message.

* [LINT] Remove unnecessary copyright message as per ASF requirement.

* Fix codegen hybrid

* [LINT] Broaden license checks to include html, xml

* [LINT] Fix rest of the files

* Fix notice

* [LINT] Improve check file type error message

4 years ago[Release] resolve license issues (#4408)
Yizhi Liu [Sun, 24 Nov 2019 03:39:25 +0000 (19:39 -0800)]
[Release] resolve license issues (#4408)

4 years ago[Relay][Legalize] Legalize conv2d_transpose for NHWC (#4399)
Alexander Pivovarov [Sat, 23 Nov 2019 05:59:15 +0000 (21:59 -0800)]
[Relay][Legalize] Legalize conv2d_transpose for NHWC (#4399)

4 years ago[RUNTIME] Move module export to the function level. (#4405)
Tianqi Chen [Sat, 23 Nov 2019 04:32:20 +0000 (20:32 -0800)]
[RUNTIME] Move module export to the function level. (#4405)

4 years ago[TVM][RUNTIME] A minimum example to generate external library wrappers for DSOModule...
Zhi [Fri, 22 Nov 2019 23:31:50 +0000 (15:31 -0800)]
[TVM][RUNTIME] A minimum example to generate external library wrappers for DSOModule (#4280)

4 years ago[LICENSE] add 3rdparty licenses (#4402)
Yizhi Liu [Fri, 22 Nov 2019 22:37:25 +0000 (14:37 -0800)]
[LICENSE] add 3rdparty licenses (#4402)

* [LICENSE] add 3rdparty licenses

* rename license files to .txt

4 years agoAdded tflite frontend support for quantized mean. (#4339)
tristan-arm [Fri, 22 Nov 2019 21:34:40 +0000 (21:34 +0000)]
Added tflite frontend support for quantized mean. (#4339)

4 years ago[DOCS] Mention incubating in readme (#4401)
Tianqi Chen [Fri, 22 Nov 2019 18:28:43 +0000 (10:28 -0800)]
[DOCS] Mention incubating in readme (#4401)

4 years ago[Golang][Doc] improve the samples and doc (#4385)
Neo Chien [Fri, 22 Nov 2019 05:52:24 +0000 (13:52 +0800)]
[Golang][Doc] improve the samples and doc (#4385)

* [Golang][Doc] improve the samples and doc

* [Golang][Doc] add asf header

* [Golang][Doc] Improve the end to end example

* [Golang][Doc] Improve the end to end example

4 years agoupdate_document_after_repository_renamed (#4398)
tripley [Fri, 22 Nov 2019 03:12:05 +0000 (19:12 -0800)]
update_document_after_repository_renamed (#4398)

4 years agoUpdate Jenkinsfile for external runtime (#4396)
Cody Yu [Fri, 22 Nov 2019 00:45:47 +0000 (16:45 -0800)]
Update Jenkinsfile for external runtime (#4396)

4 years ago[Relay][VM] Clean up the VM and VM profiler code (#4391)
Haichen Shen [Fri, 22 Nov 2019 00:01:01 +0000 (16:01 -0800)]
[Relay][VM] Clean up the VM and VM profiler code (#4391)

* [VM] add a few more API to vm

* [VM][Fix] fix vm convert args

* [VM] a few fixes

* rename fields

* update

* update vm profiler

* x

* add doc

* lint

* fix test

* address comments

4 years ago[TOPI] Fix flaky testcase for floor div (#4382)
Yizhi Liu [Thu, 21 Nov 2019 23:39:40 +0000 (15:39 -0800)]
[TOPI] Fix flaky testcase for floor div (#4382)

* [TOPI] Fix flaky testcase for floor div

* avoid check at 0.0

4 years agoAdd Logan to reviewer (#4390)
Haichen Shen [Thu, 21 Nov 2019 20:46:41 +0000 (12:46 -0800)]
Add Logan to reviewer (#4390)

4 years agoUpdate compile_engine.py (#4393)
Huang, Guangtai [Thu, 21 Nov 2019 20:46:00 +0000 (04:46 +0800)]
Update compile_engine.py (#4393)

4 years ago[Relay][Frontend][TF] Fix slice when begin or size is not Const (#4372)
Siyuan Li [Thu, 21 Nov 2019 18:53:37 +0000 (02:53 +0800)]
[Relay][Frontend][TF] Fix slice when begin or size is not Const (#4372)

* fix slice bug when input is param

* use _infer_value rather than _infer_value_simulated

4 years agoadd GPU checking before compilation for rocm (#4394)
Thomas Viehmann [Thu, 21 Nov 2019 14:40:29 +0000 (15:40 +0100)]
add GPU checking before compilation for rocm (#4394)

Previously, we would rely on the later phases to error out
(often for using too much shared memory). This enables the
checks on the IR that already exist for CUDA and OpenCL also
for ROCm.

4 years ago[QNN] Lowering for Depthwise Convolution. (#4351)
Animesh Jain [Thu, 21 Nov 2019 05:22:25 +0000 (21:22 -0800)]
[QNN] Lowering for Depthwise Convolution. (#4351)

4 years ago[fix][pass] Save the function when it is used as a call arg (#4389)
Zhi [Thu, 21 Nov 2019 00:50:01 +0000 (16:50 -0800)]
[fix][pass] Save the function when it is used as a call arg (#4389)

4 years ago[CI] Add more info, per exec ws isolation (#4388)
Tianqi Chen [Wed, 20 Nov 2019 23:43:54 +0000 (15:43 -0800)]
[CI] Add more info, per exec ws isolation (#4388)

4 years ago[ThreadPool] Solve thread transitions issue (#4344)
Zhao Wu [Wed, 20 Nov 2019 20:43:20 +0000 (04:43 +0800)]
[ThreadPool] Solve thread transitions issue (#4344)

* [ThreadPool] Solve thread transitions issue

* Use pthread_atfork to avoid master thread affinity be derived by child.

* Code Format

* comment of exclude_worker0_

* set full cpu affinity

* Redundant blank line

* CPPLint

* CPPLint namespace

* CPPLint

* Fix the wrong logic of bind master thread.

4 years agoCompare all outputs in TFLite test_forward_ssd_mobilenet_v1 (#4373)
Alexander Pivovarov [Wed, 20 Nov 2019 17:36:57 +0000 (09:36 -0800)]
Compare all outputs in TFLite test_forward_ssd_mobilenet_v1 (#4373)

4 years ago[team] add Yizhi's pgp key (#4380)
Yizhi Liu [Wed, 20 Nov 2019 17:31:34 +0000 (09:31 -0800)]
[team] add Yizhi's pgp key (#4380)

4 years agofix build with llvm trunk (#4386)
masahi [Wed, 20 Nov 2019 17:09:29 +0000 (02:09 +0900)]
fix build with llvm trunk (#4386)

4 years ago[doc] fix typo, codege to codegen (#4383)
Liang ZOU [Wed, 20 Nov 2019 13:05:01 +0000 (21:05 +0800)]
[doc] fix typo, codege to codegen (#4383)

4 years ago[CI] Avoid content-length request in test data download (#4375)
Tianqi Chen [Wed, 20 Nov 2019 06:04:42 +0000 (22:04 -0800)]
[CI] Avoid content-length request in test data download (#4375)

4 years ago[nvcc] enable multiple arch in one fatbin (#4377)
Yizhi Liu [Tue, 19 Nov 2019 23:07:29 +0000 (15:07 -0800)]
[nvcc] enable multiple arch in one fatbin (#4377)

4 years ago[Relay][Quantize] Integrate data-aware calibration into quantization (#4295)
Wuwei Lin [Tue, 19 Nov 2019 22:54:57 +0000 (17:54 -0500)]
[Relay][Quantize] Integrate data-aware calibration into quantization (#4295)

* [Relay][Quantize] Integrate data-aware calibration into quantization

* Update _calibrate.py

* trigger ci

* Address comments

* address comments

4 years ago[PERF] Parallelize reduction for CPU (#4158)
Haichen Shen [Tue, 19 Nov 2019 21:56:51 +0000 (13:56 -0800)]
[PERF] Parallelize reduction for CPU (#4158)

* [PERF] parallel reduction in cpu

* fix

* x

* update

* lint

* fix

4 years ago[tutorial][benchmark] nnvm -> relay (#4368)
Yizhi Liu [Tue, 19 Nov 2019 21:51:34 +0000 (13:51 -0800)]
[tutorial][benchmark] nnvm -> relay (#4368)

* [tutorial] nnvm -> relay

* use relay workload

* delete movbilenetv2 option

4 years agoFix TFLite RESHAPE assert (#4320)
Alexander Pivovarov [Tue, 19 Nov 2019 17:15:08 +0000 (09:15 -0800)]
Fix TFLite RESHAPE assert (#4320)

4 years ago[Relay tests] AlterOpLayout - Temporary attr update (#4357)
Animesh Jain [Tue, 19 Nov 2019 04:18:58 +0000 (20:18 -0800)]
[Relay tests] AlterOpLayout - Temporary attr update (#4357)

4 years agoadd rule for clean (#4364)
miheer vaidya [Tue, 19 Nov 2019 04:03:53 +0000 (21:03 -0700)]
add rule for clean (#4364)

* add rule for clean

* Update clean rule

Seems like lib/ directory is not made by the makefile
So don't delete directory, just the contents of it.

4 years agoreminding message for TVM_REGISTER_NODE_TYPE (#4365)
Yizhi Liu [Mon, 18 Nov 2019 23:10:10 +0000 (15:10 -0800)]
reminding message for TVM_REGISTER_NODE_TYPE (#4365)

4 years agofix Android and OpenCL docker install (#4363)
Cody Hao Yu [Mon, 18 Nov 2019 19:24:39 +0000 (11:24 -0800)]
fix Android and OpenCL docker install (#4363)

4 years ago[SOURCE] Add ASF header to __init__.py files (#4359)
Tianqi Chen [Mon, 18 Nov 2019 18:22:25 +0000 (10:22 -0800)]
[SOURCE] Add ASF header to __init__.py files (#4359)

4 years ago[Frontend]Add TensorFlow FloorMod (#4308)
Yao Wang [Mon, 18 Nov 2019 03:54:34 +0000 (19:54 -0800)]
[Frontend]Add TensorFlow FloorMod (#4308)

* Add tf FloorMod

* Add floor_div/mod into topi and relay

* Add to rst

* Fix test

4 years ago[Relay][Frontend][Tensorflow]Add conv2d_transpose (#4300)
optima2005 [Mon, 18 Nov 2019 01:24:44 +0000 (09:24 +0800)]
[Relay][Frontend][Tensorflow]Add conv2d_transpose (#4300)

* [Relay][Frontend][Tensorflow]Add conv2d_transpose

* add transformation from NHWC to NCHW to compatible with TVM conv2d_transpose implementation

* remove 'dilations' paramater to compitable with TF1.3

4 years agoSend list as argument to schedule_conv2d (#4358)
miheer vaidya [Mon, 18 Nov 2019 00:39:36 +0000 (17:39 -0700)]
Send list as argument to schedule_conv2d (#4358)

When getting cuda schedule passing single tensor seem to work but after changing target to "llvm" causes assert.
Sending list on other hand makes both cuda and llvm targets happy.
See https://discuss.tvm.ai/t/solved-simple-example-error-attributeerror-tensorslice-object-has-no-attribute-op/2245/3

4 years agoFix docstring in topi.nn.fifo_buffer (#4349)
Philip Hyunsu Cho [Sat, 16 Nov 2019 16:40:38 +0000 (08:40 -0800)]
Fix docstring in topi.nn.fifo_buffer (#4349)

4 years agoRetain qnn input kernel scales (#4292)
Ramana Radhakrishnan [Sat, 16 Nov 2019 16:39:19 +0000 (16:39 +0000)]
Retain qnn input kernel scales (#4292)

* Add qnn conv2d attributes for input_tensor_scale and
kernel_tensor_scale.

The lowering in the tflite frontend loses the input_tensor_scale
and the kernel_tensor_scale by multiplying it and putting it into
the Requantize operation. This means that any graph partitioning
passes or other passes that need to access this information no longer
have it available in the qnn dialect.

regards
Ramana

* Store input tensor scale and Weight tensor scale for Dense as well

As for conv2d, the tflite frontend drops the input tensor
scale and the weight tensor scale from the relay op. Store
it as separate fields in there.

* Fix unintentional tab

* Rename input_tensor_scale to input_scale and kernel_tensor_scale
to kernel_scale for conv2d.

* input_tensor_scale -> input_scale weight_tensor_scale->weight_scale

* Rework dense testcase

And use input_scale and kernel_scale

* Be consistent in use of input_scale and kernel_scale values

* Fixup qnn conv2d tests for input_scale and kernel_scale

* Make pydoc identical between conv2d and dense for weight_tensor

* Fix up conv2d parameters to be in the same order between C++ and python

* Fix ordering of parameters for dense.

* Add input_scale and output_scale to try and satisfy ci gods

* Delete input_scale and kernel_scale.

nn.conv2d does not contain input_scale and kernel_scale. We need
to delete it when lowering it to nn.conv2d.

* Add input_scale and kernel_scale for qnn.conv2d