platform/upstream/tvm.git
5 years agoVulkan2 Runtime API (#3849)
Andrew Tulloch [Fri, 13 Sep 2019 20:40:43 +0000 (13:40 -0700)]
Vulkan2 Runtime API (#3849)

5 years ago[VTA] RPC path update. (#3924)
Hua Jiang [Fri, 13 Sep 2019 20:33:55 +0000 (13:33 -0700)]
[VTA] RPC path update. (#3924)

Issue:
RPC path get changed into "vta_rpc" from "pynq_rpc", but related
document still use old informaiton.

Solution:
Update RPC path information.

5 years agoAdd AVX512VNNI support for TVM (#3388)
Jianyu Huang [Fri, 13 Sep 2019 20:27:40 +0000 (13:27 -0700)]
Add AVX512VNNI support for TVM (#3388)

5 years agoRefactoring x86 conv2d_NCHWc (#3944)
Animesh Jain [Fri, 13 Sep 2019 18:38:14 +0000 (11:38 -0700)]
Refactoring x86 conv2d_NCHWc (#3944)

5 years agoFix CUDA int8x4 vectorize (#3928)
noituIover [Fri, 13 Sep 2019 01:04:52 +0000 (09:04 +0800)]
Fix CUDA int8x4 vectorize (#3928)

* Fix int8x4 vectorize

* Fix gpu shared/local memory accumulate

* Add test_shared_memory for int8x4

* Adjust test format

* Fix cpplint

5 years agoDo type checking for the input and kernel in the qnn conv2d (#3904)
shoubhik [Thu, 12 Sep 2019 23:34:20 +0000 (16:34 -0700)]
Do type checking for the input and kernel in the qnn conv2d (#3904)

* [QNN] Convolution 2D Implementation.

Rebasing. Empty commit.

Clang-format styling.

* Reformatting code.

* Fixing lint issues.

5 years ago[TOPI][CUDA] Support cuBLAS BatchMatMul (#3936)
Jon Soifer [Thu, 12 Sep 2019 20:04:45 +0000 (13:04 -0700)]
[TOPI][CUDA] Support cuBLAS BatchMatMul (#3936)

* Support cuBLAS BatchMatMul

* Add test and check target name

5 years ago[RFC] [Contrib] Minimal runtime (~12kb .text on ARMv7/x86) for subset of TVM models...
Andrew Tulloch [Thu, 12 Sep 2019 19:32:01 +0000 (12:32 -0700)]
[RFC] [Contrib] Minimal runtime (~12kb .text on ARMv7/x86) for subset of TVM models (#3567)

This is an alternative implementation of a subset of the TVM runtime API (and
graph runtime) that focuses entirely on reducing code size, at the expense of
functionality (no tvm.extern(..) calls via PackedFunc, CPU only, etc). It might
be worth incrementally expanding the surface area if there's interest.

The motivation for this work was seeing what the minimal useful subset of the
TVM runtime is. This is relevant for e.g. super code-size constrained
applications in e.g. embedded/mobile. The current runtime is more like O(100KiB)
or so, so this might be compelling for some users.

The smaller surface area for auditing might make this relevant for
https://github.com/dmlc/tvm/issues/3159, or the usecases I was thinking about in
https://github.com/dmlc/tvm/issues/2523#issuecomment-459165815 re: the Rust
runtime.

The symbols in the tvm::minimalruntime space (i.e. excluding std:: and
picojson::) are about 5KiB, so I think there's a bunch of room here (i.e. we
could replace picojson:: with [`jsmn`](https://zserge.com/jsmn.html) or
something, and we could replace more of the `std::unordered_map` usage, etc with
custom primitives as well (similar to the `DynArray`).

5 years ago[Relay][Module] Refactor the way we interface between different modules of Relay...
Jared Roesch [Thu, 12 Sep 2019 03:39:56 +0000 (22:39 -0500)]
[Relay][Module] Refactor the way we interface between different modules of Relay. (#3906)

* Module refactor

* Add load module

* Add support for idempotent import

* Tweak load paths

* Move path around

* Expose C++ import functions in Python

* Fix import

* Add doc string

* Fix

* Fix lint

* Fix lint

* Fix test failure

* Add type solver

* Fix lint

5 years ago[Community] Add reviewer Balint Cristian (#3935)
Lianmin Zheng [Wed, 11 Sep 2019 21:32:15 +0000 (14:32 -0700)]
[Community] Add reviewer Balint Cristian (#3935)

5 years ago[Arm] parallel batch axis (#3931)
Yizhi Liu [Wed, 11 Sep 2019 18:10:48 +0000 (02:10 +0800)]
[Arm] parallel batch axis (#3931)

* support LLVM trunk

* guard with USE_LLVM in if condition for c++14

* GREATER_EQUAL -> GREATER

* [Arm] parallel batch axis

5 years ago[TFLite] Support depthwise convolution multiplier greater than 1 (#3922)
Zhao Wu [Wed, 11 Sep 2019 04:09:25 +0000 (12:09 +0800)]
[TFLite] Support depthwise convolution multiplier greater than 1 (#3922)

5 years ago[Relay] fix exponential blowup in interpreter (#3559)
雾雨魔理沙 [Wed, 11 Sep 2019 03:30:46 +0000 (20:30 -0700)]
[Relay] fix exponential blowup in interpreter (#3559)

5 years ago[Relay][Frontend][Keras] Fix ReLU in Keras Converter missed the case (#3917)
Neo Chien [Tue, 10 Sep 2019 17:41:16 +0000 (01:41 +0800)]
[Relay][Frontend][Keras] Fix ReLU in Keras Converter missed the case (#3917)

* [Relay][Frontend][Keras] Fix ReLU in Keras Converter missed the case

* [Relay][Frontend][Keras] Add test case for ReLU in Keras Converter missed the case

* [Relay][Frontend][Keras] Add test case for ReLU in Keras Converter missed the case

5 years ago[CODEGEN] Remove incorrect check for LLVM in C codegen test (#3921)
Pratyush Patel [Tue, 10 Sep 2019 00:43:01 +0000 (17:43 -0700)]
[CODEGEN] Remove incorrect check for LLVM in C codegen test (#3921)

5 years ago[Relay][Training] Add gradient for max. (#3915)
雾雨魔理沙 [Mon, 9 Sep 2019 19:48:04 +0000 (12:48 -0700)]
[Relay][Training] Add gradient for max. (#3915)

* save

* save

5 years ago[VTA][Config] hotfix denano10 (#3918)
Luis Vega [Mon, 9 Sep 2019 17:31:31 +0000 (10:31 -0700)]
[VTA][Config] hotfix denano10 (#3918)

5 years agoNumpy compatible dtype inference for `tvm.convert` and `tvm.const` (#3861)
Xingjian Shi [Mon, 9 Sep 2019 17:26:34 +0000 (10:26 -0700)]
Numpy compatible dtype inference for `tvm.convert` and `tvm.const` (#3861)

* numpy compatible type inference

* update

* try to fix

* fix

* try to fix

* fix lint

* Update nn.h

* cast to int32

* try to fix

* fix again

* retrigger ci

5 years ago[Relay/TOPI][Op] Add erf intrinsic and op (#3702)
Haichen Shen [Mon, 9 Sep 2019 14:54:15 +0000 (07:54 -0700)]
[Relay/TOPI][Op] Add erf intrinsic and op (#3702)

* add more ops

* stop vectorization for erf

* x

* cleanup

* fix

* add whitelist for vectorizable intrin

* add tf converter

* fix dense

* fix

* add missing intrin

* fix mxnet frontend

* fix nvptx

5 years ago[Relay][Training] Add gradient for cast (#3894)
雾雨魔理沙 [Sun, 8 Sep 2019 03:11:47 +0000 (20:11 -0700)]
[Relay][Training] Add gradient for cast (#3894)

save

fix

fix grad

5 years agochange docker install script (#3524)
雾雨魔理沙 [Sun, 8 Sep 2019 00:10:11 +0000 (17:10 -0700)]
change docker install script (#3524)

5 years ago[Fix] Fix blas cmake for mac os (#3898)
Haichen Shen [Sat, 7 Sep 2019 21:34:32 +0000 (14:34 -0700)]
[Fix] Fix blas cmake for mac os (#3898)

* fix cmake for mac os

* rename

5 years agoSupport LLVM trunk (#3907)
Yizhi Liu [Sat, 7 Sep 2019 18:43:29 +0000 (02:43 +0800)]
Support LLVM trunk (#3907)

* support LLVM trunk

* guard with USE_LLVM in if condition for c++14

* GREATER_EQUAL -> GREATER

5 years agoFix a typo (#3913)
noituIover [Sat, 7 Sep 2019 16:44:39 +0000 (00:44 +0800)]
Fix a typo (#3913)

5 years agoAdd .hsaco save/load for ROCm target (#3852)
Peter Yeh [Sat, 7 Sep 2019 03:41:35 +0000 (20:41 -0700)]
Add .hsaco save/load for ROCm target (#3852)

fix lld

5 years agoadd luis as reviewer (#3909)
Haichen Shen [Sat, 7 Sep 2019 00:12:56 +0000 (17:12 -0700)]
add luis as reviewer (#3909)

5 years ago[VTA] Support TLPP in function simulator. (#3555)
Hua Jiang [Sat, 7 Sep 2019 00:03:51 +0000 (17:03 -0700)]
[VTA] Support TLPP in function simulator. (#3555)

* [VTA] Support TLPP in function simulator.
Issue:
currently vta function simulator just doing serialized instruction
execution, the dependency logic of runtime ISA which use for task
level pipe line parallelism can not get verified by function simulator.

Solution:
make the simulator driver to be multiple thread and support TLPP.

Benefit:
TLPP support VTA function simulator would make VTA logic testing/debug
/change more easy.

replace boost lockfree queue

add configure control for simulator tlpp enable or disable.

change code tyle into google style.

Wrap queue read/write and sync logic to make function call more simple.

Add some comments.

Remove MT logic, change into Single thread mode.

address review comments.

code style change to match google code style and add comments.

add cmake macro to enable/disable simulator tlpp logic.

submodule update.

correct file name mentioned in comments.

* remove USE_VTA_FSIM_TLPP.

5 years ago[TOPI] Intel graphics conv2d autotvm template added (#3839)
Leyuan Wang [Sat, 7 Sep 2019 00:01:29 +0000 (17:01 -0700)]
[TOPI] Intel graphics conv2d autotvm template added (#3839)

* update lint

* lint fixed

* lint updated

* lint fixed

* lint fixed

* lint fixed

* updates

* add intel graphics as a package

* remove print info

* depthwise conv2d schedule added for intel graphics

* asdf

* fix lint

* fix lint

* fix ci

* add channels

5 years agosave (#3901)
雾雨魔理沙 [Fri, 6 Sep 2019 22:17:37 +0000 (15:17 -0700)]
save (#3901)

5 years ago[Relay][Op] Make Type Relation catch more errors (#3899)
雾雨魔理沙 [Fri, 6 Sep 2019 18:51:27 +0000 (11:51 -0700)]
[Relay][Op] Make Type Relation catch more errors (#3899)

* save

* init

* move type_relations

5 years ago[Relay] Add ADTs to text format (#3863)
Logan Weber [Fri, 6 Sep 2019 18:04:34 +0000 (11:04 -0700)]
[Relay] Add ADTs to text format (#3863)

* Getting closer to having ADT defs

* ADT defs working probly

* Match parsing basipally done

* came to earth in a silver chrome UFO

* match finished?

* All tests but newest are passing

* ADT constructors work

now cleanup?

* Cleanup round 1

* Cleanup round 2

* Cleanup round 3

* Cleanup round 4

* Cleanup round 6

* Cleanup round 7

* Lil grammar fix

* Remove ANTLR Java files

* Lint roller

* Lint roller

* Address feedback

* Test completeness in match test

* Remove unused imports

* Lint roller

* Switch to Rust-style ADT syntax

* Lil fix

* Add dummy `extern type` handler

* Add type arg to test

* Update prelude semantic version

* Repair test

* Fix graph var handling in match

* Revert 's/graph_equal/is_unifiable' change

5 years ago[bugfix] remove duplicate resize (#3902)
Yong Wu [Fri, 6 Sep 2019 15:30:04 +0000 (08:30 -0700)]
[bugfix] remove duplicate resize (#3902)

5 years agoAdd another MKL name alias for MKL (#3853)
Jason Knight [Fri, 6 Sep 2019 13:30:13 +0000 (06:30 -0700)]
Add another MKL name alias for MKL (#3853)

Installed through pypi

5 years ago[schedule] Improve ceil_divide in tile/split (#3842)
Yizhi Liu [Fri, 6 Sep 2019 13:29:31 +0000 (21:29 +0800)]
[schedule] Improve ceil_divide in tile/split (#3842)

5 years ago[PYTHON/FFI] Search PATH for DLLs (#3888)
Jon Soifer [Thu, 5 Sep 2019 23:42:29 +0000 (16:42 -0700)]
[PYTHON/FFI] Search PATH for DLLs (#3888)

* Search PATH for DLLs

* Fix lint issue

5 years ago[Relay] add Tuple pattern (#3596)
雾雨魔理沙 [Thu, 5 Sep 2019 23:41:44 +0000 (16:41 -0700)]
[Relay] add Tuple pattern (#3596)

* implement tuple pattern

* add tuple pattern

* lint;

* lint

* lint

* fix error

* fix

* add test

5 years agoFix int32 range overflow by using int64 (#3870)
kice [Thu, 5 Sep 2019 23:21:54 +0000 (19:21 -0400)]
Fix int32 range overflow by using int64 (#3870)

5 years ago[Relay] Fix operator fusion for multiple output (#3871)
雾雨魔理沙 [Thu, 5 Sep 2019 21:39:13 +0000 (14:39 -0700)]
[Relay] Fix operator fusion for multiple output (#3871)

* save

* add test

* refactor

* fix indent

* save

* refactor

5 years ago[DOC] Fix doc rendering (#3897)
Haibin Lin [Thu, 5 Sep 2019 18:48:57 +0000 (11:48 -0700)]
[DOC] Fix doc rendering  (#3897)

* Update from_source.rst

* Update deploy_ssd_gluoncv.py

5 years ago[Test] enable NHWC of `relay.testing.mobilenet` (#3886)
黎明灰烬 [Thu, 5 Sep 2019 18:32:21 +0000 (02:32 +0800)]
[Test] enable NHWC of `relay.testing.mobilenet` (#3886)

* [Relay] enable NHWC of `relay.testing.mobilenet`

In this way, we can play around NHWC inside TVM regardless of
the frontends.

* [Test] test for NHWC of relay.testing.mobilenet

5 years ago[VTA][TOPI] Conv2d transpose (deconvolution) operator support (#3777)
Thierry Moreau [Thu, 5 Sep 2019 18:29:42 +0000 (11:29 -0700)]
[VTA][TOPI] Conv2d transpose (deconvolution) operator support (#3777)

* initial conv2d_transpose

* correct select operator

* cleanup

* fix

* fix correcness check

* conv2d transpose declaration fix

* autotvm conv2d_transpose tuning script

* ir pass fix

* fix tuning script

* deriving params from env, adding bias

* removing bias comp from deconvolution

* lint

* fix

* lint

* lint

* turning off cpu

* lint, ops

* lint

* import fix

* removing hard coded values

* lint

5 years ago[VTA][Relay] Extending Vision model coverage compilation for VTA (#3740)
Thierry Moreau [Thu, 5 Sep 2019 18:17:09 +0000 (11:17 -0700)]
[VTA][Relay] Extending Vision model coverage compilation for VTA (#3740)

* adding support for graphpack over multiply op

* increasing resnet model coverage

* fix indentation

* lint

* moving recursion limit fix into graphpack pass

* moving recursionlimit to relay init

* pooling on NCHWnc format

* adding more models

* deploy_resnet_on_vta.py

* trailing line

* generalizing to vision models

* merge conflicts

* fix, apply quantization to VTA only

* improving comments

* trimming models that have runtime issues for the moment

* lint

* lint

* lint

5 years ago[Relay][Training] Small refactoring (#3893)
雾雨魔理沙 [Thu, 5 Sep 2019 18:13:07 +0000 (11:13 -0700)]
[Relay][Training] Small refactoring (#3893)

* init

* fix

5 years ago[QNN] Add - Refactoring to C++ (#3736)
Animesh Jain [Thu, 5 Sep 2019 17:22:45 +0000 (10:22 -0700)]
[QNN] Add - Refactoring to C++ (#3736)

5 years ago[VTA] de10-nano driver (#3394)
Liangfu Chen [Thu, 5 Sep 2019 16:52:10 +0000 (00:52 +0800)]
[VTA] de10-nano driver (#3394)

* rework;

* `de10-nano` -> `de10nano`;

* fix compilation error;

* bug fix;

* Update install.md

* Update install.md

* Update install.md

* update with current runtime;

* add debug messages;

* bug fix in cma kernel module;

5 years agoReveal hidden code snippets by inserting newline (#3892)
miheer vaidya [Thu, 5 Sep 2019 04:24:00 +0000 (22:24 -0600)]
Reveal hidden code snippets by inserting newline (#3892)

5 years ago[VTA][Chisel] add ISA BitPat generation (#3891)
Luis Vega [Wed, 4 Sep 2019 17:36:21 +0000 (10:36 -0700)]
[VTA][Chisel] add ISA BitPat generation (#3891)

5 years ago[QNN] Convolution 2D Implementation. (#3580)
Animesh Jain [Wed, 4 Sep 2019 17:05:22 +0000 (11:05 -0600)]
[QNN] Convolution 2D Implementation. (#3580)

Rebasing. Empty commit.

Clang-format styling.

5 years ago[TENSORFLOW] Convert scalar Const into tvm.relay.const (#3885)
lixiaoquan [Wed, 4 Sep 2019 16:57:20 +0000 (00:57 +0800)]
[TENSORFLOW] Convert scalar Const into tvm.relay.const (#3885)

* [TENSORFLOW] Convert scalar Const into tvm.relay.const

* use _get_num_param() and _get_list_param()

5 years ago[Relay] Add grads (#3857)
SWu [Wed, 4 Sep 2019 07:07:39 +0000 (03:07 -0400)]
[Relay] Add grads (#3857)

* Add gradient implementations

* Add docstrings to fix lint errors

5 years ago[Relay][Frontend][darknet] Solve tvm parsing darknet resnext failure bug (#3778)
youluexx [Wed, 4 Sep 2019 05:46:29 +0000 (13:46 +0800)]
[Relay][Frontend][darknet] Solve tvm parsing darknet resnext failure bug (#3778)

* test_darkent_bug

* test_darkent

* add resnext tests

5 years ago[VTA][Chisel] add scalafmt and format existing scala codebase (#3880)
Luis Vega [Wed, 4 Sep 2019 05:19:01 +0000 (22:19 -0700)]
[VTA][Chisel] add scalafmt and format existing scala codebase (#3880)

* [VTA][Chisel] add scalafmt and format existing scala codebase

* change column width to 100

* add scalafmt conf file as a valid file type

* add asf header to scalafmt conf file and rerun formatter

5 years ago[VTA] Fix TSIM compile error in Linux (add missing -fPIC flag) (#3876)
Liangfu Chen [Tue, 3 Sep 2019 16:31:32 +0000 (00:31 +0800)]
[VTA] Fix TSIM compile error in Linux (add missing -fPIC flag) (#3876)

* [VTA] Fix TSIM compile error in Linux (add missing -fPIC flag);

* [VTA] Fix TSIM compile error in Linux (add missing -fPIC flag);

* fix indentation problem;

5 years agoRevert "[Runtime] Allow parameter sharing between modules (#3489)" (#3884)
Tianqi Chen [Tue, 3 Sep 2019 07:31:04 +0000 (15:31 +0800)]
Revert "[Runtime] Allow parameter sharing between modules (#3489)" (#3884)

This reverts commit 224cc243b4e54a77d011644fe7d81bdee8e8116b.

5 years agoONNX frontend operator support: And (#3878)
Neo Chien [Tue, 3 Sep 2019 04:02:52 +0000 (12:02 +0800)]
ONNX frontend operator support: And (#3878)

5 years ago[Runtime] Allow parameter sharing between modules (#3489)
Yong Sun [Tue, 3 Sep 2019 03:53:42 +0000 (20:53 -0700)]
[Runtime] Allow parameter sharing between modules (#3489)

As GraphRuntime does not provide control-flow logics, we have to split
our model to two parts. While we need to share parameters between them
to save memory usage.

Solution:
1) add "lazy_init_input" in graph's attributes
   "attrs": {
     ... ...
     "lazy_init_input": [
       "list_str",
       [
         "p0"
       ]
     ]
    }
2) allow un-allocated NDArray entry in SetupStorage
3) utilize "set_input_zero_copy" function to set parameters

5 years agoRemove extern C warpper for cuBLAS (#3877)
kice [Tue, 3 Sep 2019 02:35:14 +0000 (22:35 -0400)]
Remove extern C warpper for cuBLAS (#3877)

5 years ago[VTA][Chisel] rename USE_TSIM macro with USE_VTA64 and cleanup runtime (#3872)
Luis Vega [Mon, 2 Sep 2019 18:08:53 +0000 (11:08 -0700)]
[VTA][Chisel] rename USE_TSIM macro with USE_VTA64 and cleanup runtime (#3872)

5 years ago[Relay] [Parser] fix parser for cast. (#3873)
雾雨魔理沙 [Mon, 2 Sep 2019 15:22:11 +0000 (08:22 -0700)]
[Relay] [Parser] fix parser for cast. (#3873)

* fix

* lint

5 years ago[WIP][µTVM] Add OpenOCD Low-Level Device (RISC-V Support) (#3756)
Logan Weber [Mon, 2 Sep 2019 07:32:52 +0000 (00:32 -0700)]
[WIP][µTVM] Add OpenOCD Low-Level Device (RISC-V Support) (#3756)

5 years ago[QNN] Requantize - Optimize lowering for some corner cases. (#3864)
Animesh Jain [Mon, 2 Sep 2019 02:58:38 +0000 (20:58 -0600)]
[QNN] Requantize - Optimize lowering for some corner cases. (#3864)

5 years agoImplementation of tile for TFLite (#3814)
Neo Chien [Sun, 1 Sep 2019 01:56:39 +0000 (09:56 +0800)]
Implementation of tile for TFLite (#3814)

5 years ago[Relay][Any] Add shape func for dynamic shape (#3606)
Haichen Shen [Sun, 1 Sep 2019 01:50:22 +0000 (18:50 -0700)]
[Relay][Any] Add shape func for dynamic shape (#3606)

* init shape func in interpreter and vm compiler

* Update interpreter

* fix

* lint

* lint

* fix

* remove hack

* update

* fix

* fix

* update

* address comments & update for shape_of

* fix lint

* update

* fix hybrid

* lint

* fix bug & add take shape func

* lint

* lint

* update

* fix flaky test

* add todo

5 years ago[NNVM][FRONTEND][ONNX] Fix PReLU conversion (#3813)
Alexey Golunov [Sun, 1 Sep 2019 00:52:42 +0000 (03:52 +0300)]
[NNVM][FRONTEND][ONNX] Fix PReLU conversion (#3813)

5 years ago[Relay] Bitserial ops (#3844)
Josh Fromm [Sun, 1 Sep 2019 00:51:51 +0000 (17:51 -0700)]
[Relay] Bitserial ops (#3844)

* Added arm_cpu NHWC schedules.

* Fixed kernel shape legalization.

* Added bitserial ops to relay.

* Snapshot and more missing files.

* Added dense testing.

* Added tests

* Added ASF header to new files.

* cc lint

* Pylint change.

* pylint fixes.

* Change arm legalize test.

* Added assert check to arm legalize.

* Added better documentation, fixed some bad style

* Reverted arm conv2d nhwc changes.

5 years agoAdd not operator for the frontend/onnx.py (#3836)
Neo Chien [Sun, 1 Sep 2019 00:50:36 +0000 (08:50 +0800)]
Add not operator for the frontend/onnx.py (#3836)

5 years ago[VTA][TSIM] add virtual memory support to tsim example (#3868)
Luis Vega [Sun, 1 Sep 2019 00:49:17 +0000 (17:49 -0700)]
[VTA][TSIM] add virtual memory support to tsim example (#3868)

* [VTA][TSIM] add virtual memory support to tsim example

* fix identation

* remove USE_TSIM macro and use 32-bit addr instead

5 years agoImprove numerical gradient check (#3856)
SWu [Sat, 31 Aug 2019 06:25:16 +0000 (02:25 -0400)]
Improve numerical gradient check (#3856)

5 years agoAdd more cases to keras _convert_reshape (#3846)
Alexander Pivovarov [Sat, 31 Aug 2019 04:30:59 +0000 (21:30 -0700)]
Add more cases to keras _convert_reshape (#3846)

5 years ago[QNN] Concat - Refactoring to C++ (#3819)
Animesh Jain [Sat, 31 Aug 2019 04:30:18 +0000 (21:30 -0700)]
[QNN] Concat - Refactoring to C++ (#3819)

5 years ago[Relay][QNN] Moving Conv, Dense, Concatenate InferTypes to header for sharing. (...
Animesh Jain [Fri, 30 Aug 2019 16:12:03 +0000 (09:12 -0700)]
[Relay][QNN] Moving Conv, Dense, Concatenate InferTypes to header for sharing. (#3783)

5 years ago[Relay][QNN] QNNtoRelay & QNNLegalize Pass utility using Relay Legalize API. (#3838)
Animesh Jain [Fri, 30 Aug 2019 16:10:25 +0000 (09:10 -0700)]
[Relay][QNN] QNNtoRelay & QNNLegalize Pass utility using Relay Legalize API. (#3838)

5 years agocodegen_spirv support Call::reinterpret (#3795)
Andrew Tulloch [Fri, 30 Aug 2019 00:25:07 +0000 (17:25 -0700)]
codegen_spirv support Call::reinterpret (#3795)

5 years ago[VTA] Infinite recursive device_api.ext_dev call fix. (#3843)
Hua Jiang [Thu, 29 Aug 2019 21:26:39 +0000 (14:26 -0700)]
[VTA] Infinite recursive device_api.ext_dev call fix. (#3843)

Issue
when try vta on fpga board, would see a Infinite recursive
device_api.ext_dev issue that cause stack overflow and vta
failed.

Analysis:
device_api.ext_dev function in rpc_server.py is use to load
vta library, once vta library get load, device_api.ext_dev would
get replaced with vta function by vta library, vta device_api.cc
did such work, but because a logic issue in VTA.cmake, the said file
not get compiled, then vta would keep failing on rpc_server.py.

Solution:
fix the logic issue in VTA.cmake.

5 years ago[VTA] Fix RewriteForceSerial Function logic issue. (#3854)
Hua Jiang [Thu, 29 Aug 2019 17:48:42 +0000 (10:48 -0700)]
[VTA] Fix RewriteForceSerial Function logic issue. (#3854)

Issue:
RewriteForceSerial is a debug function to force instructions
to be serialize instead of parrallel running, by doing so we
can isolate some parallel problem or do performance compare
between parallel and serialize. But this function have some
problem, once get enabled by set debug flag, vta would stuck
when running on pynq board.

Analysis:
once enable RewriteForceSerial, the dependency logic is different
with default one, but we still use same logic to generate FINISH
and other logic, this would cause dead lock.

Solution:
give a different dependency settings when enable RewriteForceSerial.

5 years ago[Relay] Conv2d grad (#3636)
Wuwei Lin [Thu, 29 Aug 2019 17:36:06 +0000 (13:36 -0400)]
[Relay] Conv2d grad (#3636)

* [Relay] Conv2d grad

* Fix test

* Fix first order gradient

5 years agoAdd build_create_shared_func to tvm/contrib/cc.py (#3840)
Alexander Pivovarov [Thu, 29 Aug 2019 16:59:40 +0000 (09:59 -0700)]
Add build_create_shared_func to tvm/contrib/cc.py (#3840)

5 years agoImprovements in conda recipe (#3791)
Marcelo Duarte Trevisani [Thu, 29 Aug 2019 03:14:04 +0000 (04:14 +0100)]
Improvements in conda recipe (#3791)

5 years ago[TEST] Not assuming HOME in tvm/download.py (#3803)
Neo Chien [Thu, 29 Aug 2019 03:12:28 +0000 (11:12 +0800)]
[TEST] Not assuming HOME in tvm/download.py (#3803)

* Not assuming HOME in tvm/download.py

* Trigger notification

5 years ago[runtime] reduce set_input and set_input_zero_copy overhead (#3805)
hlu1 [Thu, 29 Aug 2019 03:08:21 +0000 (20:08 -0700)]
[runtime] reduce set_input and set_input_zero_copy overhead (#3805)

5 years ago[TensorFlow] Fix limitation that depth_mult can only be 1 for DepthwiseConv2dNative...
lixiaoquan [Thu, 29 Aug 2019 02:59:06 +0000 (10:59 +0800)]
[TensorFlow] Fix limitation that depth_mult can only be 1 for DepthwiseConv2dNative (#3676)

* [TensorFlow] Fix limitation that depth_mult can only be 1 for DepthwiseConv2dNative

* Improve code readability

5 years agoSupport MKL on Windows (#3837)
Jon Soifer [Thu, 29 Aug 2019 02:44:43 +0000 (19:44 -0700)]
Support MKL on Windows (#3837)

5 years ago[AutoTVM] Fix database APIs (#3821)
Cody Hao Yu [Wed, 28 Aug 2019 17:04:12 +0000 (10:04 -0700)]
[AutoTVM] Fix database APIs (#3821)

* [AutoTVM] Fix database APIs

* Refactor the byte conversion

5 years ago[Relay][Keras] Dot (#3668)
Yong Wu [Wed, 28 Aug 2019 06:16:48 +0000 (23:16 -0700)]
[Relay][Keras] Dot (#3668)

* [Relay][Keras] Dot

* fix reshape

* fix comments

5 years ago[Bugfix][Keras] axis of softmax (#3834)
Yong Wu [Wed, 28 Aug 2019 02:15:51 +0000 (19:15 -0700)]
[Bugfix][Keras] axis of softmax (#3834)

5 years ago[VTA] Parameterization and bug fix in TensorLoad module (#3841)
Liangfu Chen [Tue, 27 Aug 2019 06:13:18 +0000 (14:13 +0800)]
[VTA] Parameterization and bug fix in TensorLoad module (#3841)

5 years agoupdate docs for installation for CUDA (#3832)
Yixing Jiang [Mon, 26 Aug 2019 18:41:27 +0000 (11:41 -0700)]
update docs for installation for CUDA (#3832)

5 years agoFix inconsistent python/cpp API behavior for if_then_else, power (#3829)
Xingjian Shi [Mon, 26 Aug 2019 18:31:10 +0000 (11:31 -0700)]
Fix inconsistent python/cpp API behavior for if_then_else, power (#3829)

* fix inconsistent python/cpp APIs for if_then_else

* fix error message

* fix power consistency

* fix

* fix bug

* add test

5 years ago[VTA][TSIM] Introduce Virtual Memory for TSIM Driver (#3686)
Liangfu Chen [Mon, 26 Aug 2019 16:36:23 +0000 (00:36 +0800)]
[VTA][TSIM] Introduce Virtual Memory for TSIM Driver (#3686)

* initial virtual memory;

* initial integration;

* include the header file in cmake;

* implement allocation with virtual to logical address mapping;

* virtual memory for tsim_driver;

* implement the missing memory release function;

* readability improvement;

* readability improvement;

* address review comments;

* improved robustness in virtual memory allocation;

* remove VTA_TSIM_USE_VIRTUAL_MEMORY macro and use virtual memory for tsim by default;

* link tvm against vta library;

* merge with master

* build virtual memory system without linking tvm against vta;

* minor change;

* reuse VTA_PAGE_BYTES;

* using DRAM class from sim_driver as VirtualMemoryManager;

* satisfy linter;

* add comments in code;

* undo changes to Makefile

* undo changes to Makefile

* retrigger ci;

* retrigger ci;

* directly call into VirtualMemoryManager::Global()

5 years agoFix code comment of operators (#3830)
Neo Chien [Sat, 24 Aug 2019 20:32:55 +0000 (04:32 +0800)]
Fix code comment of operators (#3830)

5 years agoFixed repo change for llvm-9 to resolve missing dependency issue when building images...
tristan-arm [Fri, 23 Aug 2019 13:06:54 +0000 (14:06 +0100)]
Fixed repo change for llvm-9 to resolve missing dependency issue when building images with llvm enabled (#3826)

5 years agoUpdate hybrid_script.rst (#3799)
Haibin Lin [Fri, 23 Aug 2019 04:50:59 +0000 (21:50 -0700)]
Update hybrid_script.rst (#3799)

5 years ago[Legalize][QNN] Pass out_types to Legalize. Update QNN requantize to read from out_ty...
Animesh Jain [Fri, 23 Aug 2019 04:50:00 +0000 (21:50 -0700)]
[Legalize][QNN] Pass out_types to Legalize. Update QNN requantize to read from out_types. (#3782)

5 years ago[CODE] Halide attributions (#3824)
Tianqi Chen [Fri, 23 Aug 2019 04:21:49 +0000 (12:21 +0800)]
[CODE] Halide attributions (#3824)

5 years ago[Relay] Fix typo in parser (#3785)
雾雨魔理沙 [Fri, 23 Aug 2019 03:16:15 +0000 (20:16 -0700)]
[Relay] Fix typo in parser (#3785)

5 years ago[TVM] Fix warnings (#3817)
lixiaoquan [Thu, 22 Aug 2019 23:05:35 +0000 (07:05 +0800)]
[TVM] Fix warnings (#3817)

transform.h:118:3: warning: 'const' type qualifier on return type has no
effect
attrs.h:68:3: note: expanded from macro 'TVM_DECLARE_ATTRS'
node.h:244:3: note: expanded from macro 'TVM_DECLARE_NODE_TYPE_INFO'

transform.h:95:3: warning: extra ';' after member function definition
attrs.h:68:62: note: expanded from macro 'TVM_DECLARE_ATTRS'

5 years ago[TOPI][Relay][TensorFlow] Add OneHot operator (#3781)
Jon Soifer [Thu, 22 Aug 2019 20:45:45 +0000 (13:45 -0700)]
[TOPI][Relay][TensorFlow] Add OneHot operator (#3781)

* Add one-hot to Relay

* topi implementation

* Working

* add topi test

* Add TF test

* Fix check

* fix linting issues

* fix documentation

* Fix documentation

* Add support for on_value, off_value, axis, dtype

* Add full support for axis

* Fix compute and update test_forward

* Move on_value and off_value to inputs

* Add topi test

* Update tests

* Update docs

* Fix style

* re-enable tests

* Add one_hot to mxnet converter

5 years agoChanged topi cc resize to python implementation with new features. (#3788)
Josh Fromm [Thu, 22 Aug 2019 03:39:09 +0000 (20:39 -0700)]
Changed topi cc resize to python implementation with new features. (#3788)

5 years ago[TOPI] Use cblas for dense and batch_matmul when "cblas" is in the target libraries...
Jon Soifer [Wed, 21 Aug 2019 05:24:10 +0000 (22:24 -0700)]
[TOPI] Use cblas for dense and batch_matmul when "cblas" is in the target libraries (#3787)

* Support cblas library in dense

* start to add support for generic batch_matmul compute

* Add x86 override for batch_matmul

* Fix linting

* reset file

* Fix typos

* dummy change to re-trigger CI

5 years ago[Relay][VM]VM Profiler (#3727)
Wei Chen [Wed, 21 Aug 2019 04:28:05 +0000 (21:28 -0700)]
[Relay][VM]VM Profiler (#3727)

* [Relay][VM]VM debugger

* Report mean/min/max for op duration

* Typos

* Lint

* Lint

* Lint

* Support build debug VM in CMake

* Lint

* Enable VM debug in unit test

* Disable debug vm test until new docker image is built

* Add device sync code

* Fix qnn unit test

* Disable vm debug by default

* Rename files

* Rename classes

* Fix comment

* Fix comment