Neo Chien [Fri, 20 Sep 2019 19:17:11 +0000 (03:17 +0800)]
[Relay][Frontend][ONNX] operator support: Tile (#3941)
* [Relay][Frontend][ONNX] operator support: Tile
* Trigger notification
Tianqi Chen [Fri, 20 Sep 2019 17:17:04 +0000 (10:17 -0700)]
[ARITH] Add Lowering rule for FloorDiv/Mod (#3976)
* [ARITH] Add Lowering rule for FloorDiv/Mod
* add comment about constant folding
Alex Gladkov [Fri, 20 Sep 2019 03:49:34 +0000 (20:49 -0700)]
Add support for MXNet pad operator. (#3739)
MXNet pad is described at:
https://mxnet.incubator.apache.org/api/python/symbol/symbol.html#mxnet.symbol.pad
Add support for parameter 'None' in MXNet slice operator.
MXNet 'slice' is described at
https://mxnet.incubator.apache.org/api/python/symbol/symbol.html#mxnet.symbol.slice
Add support for MXNet cos, sin, arctan
MXNet 'cos' is described at
https://mxnet.incubator.apache.org/api/python/symbol/symbol.html#mxnet.symbol.cos
MXNet 'sin' is described at
https://mxnet.incubator.apache.org/api/python/symbol/symbol.html#mxnet.symbol.sin
MXNet arctan is descirbed at
https://mxnet.incubator.apache.org/api/python/symbol/symbol.html#mxnet.symbol.arctan
Add support for MXNet 1D Convolution and 1D Deconvolution
MXNet convolution is described at:
https://mxnet.incubator.apache.org/api/python/symbol/symbol.html#mxnet.symbol.Convolution
MXNet Deconvolution is described at:
https://mxnet.incubator.apache.org/api/python/symbol/symbol.html#mxnet.symbol.Deconvolution
Animesh Jain [Fri, 20 Sep 2019 00:06:44 +0000 (17:06 -0700)]
[QNN] Renaming tests to follow the Relay nomenclature. (#3975)
Cody Hao Yu [Thu, 19 Sep 2019 21:23:20 +0000 (14:23 -0700)]
[TOPI] Add proper scheduling for dense on CUDA (#3923)
* add proper scheduling for dense on CUDA
* add fallback config and fix unit test
* fix corner cases
* refactoring
* fix bias and add testcase
* let fusion happen
Meghan Cowan [Thu, 19 Sep 2019 17:36:30 +0000 (10:36 -0700)]
Remove GTest cmake flag from install docs (#3953)
Ina Dobreva [Thu, 19 Sep 2019 17:33:26 +0000 (18:33 +0100)]
adjust pylint output (#3973)
adjust pylint output to show file location to make it possible to locate errors
Animesh Jain [Thu, 19 Sep 2019 04:54:01 +0000 (21:54 -0700)]
[Relay] Legalize and AlterOpLayout for Int8 Intel. (#3961)
Tianqi Chen [Thu, 19 Sep 2019 04:02:30 +0000 (21:02 -0700)]
[ARITH] Introduce base-class IRMutatorWithAnalyzer for scope dependent analysis (#3969)
Ligeng Zhu [Wed, 18 Sep 2019 23:03:09 +0000 (19:03 -0400)]
[Relay] Add shape check for ConcatenateRel and StackRel (#3699)
* [Relay] add shape check for concat
* [Relay] add shape check for stack
* add test case for shape mismatch
* [typo] add the missing assert
* fix lint errors.
* replace int with size_t.
* statically cast param->axis to size_t.
* switch to run_infer_type.
* fix checking for negative index
* add static_cast for param->axis
* merge to latest tvm
* fix lint error
* Fix an error with negative index.
* Update transform.h
* Update transform.cc
Neo Chien [Wed, 18 Sep 2019 15:12:32 +0000 (23:12 +0800)]
[TVM][AutoTVM] cast filepath arguments to string (#3968)
Josh Fromm [Wed, 18 Sep 2019 07:40:31 +0000 (00:40 -0700)]
[Relay] Keras frontend upsample and 1 channel conv2d fixes (#3937)
* Fix upsample layout in keras frontend.
* Fixed group conv being used instead of conv when channels=1
* Add new conv2d test to catch bugs when channels=1.
shoubhik [Tue, 17 Sep 2019 23:39:34 +0000 (16:39 -0700)]
Adding support to check if an attribute is present or not without having to get the value (#3957)
* Adding support to check if an attribute is present or not without having to get the value.
* - Renaming the method to more appropriate name.
Andrew Tulloch [Tue, 17 Sep 2019 16:34:33 +0000 (09:34 -0700)]
[Vulkan] Minor optimization for deferred token lookups. (#3960)
Use a hash map keyed on the descriptor set to avoid bad asymptotic behaviour.
Junru Shao [Tue, 17 Sep 2019 16:33:30 +0000 (09:33 -0700)]
More friendly error msg; Fix Android Demo LLVM ver (#3962)
Animesh Jain [Mon, 16 Sep 2019 21:52:28 +0000 (14:52 -0700)]
[TOPI] Setting up AutoTVM template for Intel Int8 conv2D (#3955)
Yuwei Hu [Mon, 16 Sep 2019 20:03:32 +0000 (13:03 -0700)]
[TOPI] Improve conv2d_transpose schedule on X86 and CUDA (#3948)
* improve conv2d_transpose x86 performance by reusing conv2d schedule
* parallelize across batches to make large-batch conv2d and conv2d_transpose faster
* improve doc for autotvm.task.space.FallbackConfigEntity.fallback_with_reference_log
* add fallback schedule for schedule_conv2d_transpose_nchw_cuda
* fix pylint
* fix pylint
* unify conv2d_transpose declaration in topi.nn and topi.x86
Yao Wang [Mon, 16 Sep 2019 18:22:00 +0000 (11:22 -0700)]
[Graph Tuner] Fix benchmark layout in graph tuner (#3926)
* Fix graph tuner benchmarking layout transform
* Add test
Zhi [Mon, 16 Sep 2019 18:07:40 +0000 (11:07 -0700)]
[tvm][codegen] Make buffer auto broadcast independent to the order of input args (#3956)
* [tvm][codegen] Make buffer auto broadcast independent to the order of the input arg
* fix indent
Neo Chien [Mon, 16 Sep 2019 17:42:47 +0000 (01:42 +0800)]
[TOPI] operator support: logical_and, logical_or, logical_not (#3929)
* [TOPI] operator support: logical_and, logical_or, logical_not
* [TOPI] operator support: logical_and, logical_or, logical_not
* [TOPI] fix test cases for operator support: logical_and, logical_or, logical_not
* [TOPI] fix test cases for operator support: logical_not
Animesh Jain [Mon, 16 Sep 2019 17:35:48 +0000 (10:35 -0700)]
[QNN] Legalization for Intel x86 QNN Conv2D (#3896)
* QNNLegalize for conv2d
* [QNN] Legalization for Intel x86 QNN Conv2D
Peter Yeh [Sun, 15 Sep 2019 21:52:31 +0000 (14:52 -0700)]
Enable miopen transpose convolution and fp16 support (#3952)
* Enable miopen transpose convolution and fp16 support
* linter
Jon Soifer [Sun, 15 Sep 2019 20:03:19 +0000 (13:03 -0700)]
[Relay][TensorFlow] Add support for SquaredDifference (#3930)
* Add support for SquaredDifference and StopGradient; minor fix in BatchMatMul
* Remove stopgradient change
* Resolve PR comment
* Dummy change to retrigger CI
* dummy change to retrigger CI
Cody Hao Yu [Sun, 15 Sep 2019 00:37:17 +0000 (17:37 -0700)]
[AutoTVM] Enhance tuning space of split (#3949)
* Refine policies for define_split
- Rename policy "all" to "factors"
- Add policy "verbose" and "power2"
* Refine search space
* add doc
Junru Shao [Sat, 14 Sep 2019 19:45:37 +0000 (12:45 -0700)]
trivial (#3954)
Umang Yadav [Fri, 13 Sep 2019 20:42:36 +0000 (16:42 -0400)]
1) Add EQ op to the deduce_bound and add unittests for the same (#3775)
2) Add EQ support in the loop partition and add test for the same
3) Change typo truc to trunc
Andrew Tulloch [Fri, 13 Sep 2019 20:40:43 +0000 (13:40 -0700)]
Vulkan2 Runtime API (#3849)
Hua Jiang [Fri, 13 Sep 2019 20:33:55 +0000 (13:33 -0700)]
[VTA] RPC path update. (#3924)
Issue:
RPC path get changed into "vta_rpc" from "pynq_rpc", but related
document still use old informaiton.
Solution:
Update RPC path information.
Jianyu Huang [Fri, 13 Sep 2019 20:27:40 +0000 (13:27 -0700)]
Add AVX512VNNI support for TVM (#3388)
Animesh Jain [Fri, 13 Sep 2019 18:38:14 +0000 (11:38 -0700)]
Refactoring x86 conv2d_NCHWc (#3944)
noituIover [Fri, 13 Sep 2019 01:04:52 +0000 (09:04 +0800)]
Fix CUDA int8x4 vectorize (#3928)
* Fix int8x4 vectorize
* Fix gpu shared/local memory accumulate
* Add test_shared_memory for int8x4
* Adjust test format
* Fix cpplint
shoubhik [Thu, 12 Sep 2019 23:34:20 +0000 (16:34 -0700)]
Do type checking for the input and kernel in the qnn conv2d (#3904)
* [QNN] Convolution 2D Implementation.
Rebasing. Empty commit.
Clang-format styling.
* Reformatting code.
* Fixing lint issues.
Jon Soifer [Thu, 12 Sep 2019 20:04:45 +0000 (13:04 -0700)]
[TOPI][CUDA] Support cuBLAS BatchMatMul (#3936)
* Support cuBLAS BatchMatMul
* Add test and check target name
Andrew Tulloch [Thu, 12 Sep 2019 19:32:01 +0000 (12:32 -0700)]
[RFC] [Contrib] Minimal runtime (~12kb .text on ARMv7/x86) for subset of TVM models (#3567)
This is an alternative implementation of a subset of the TVM runtime API (and
graph runtime) that focuses entirely on reducing code size, at the expense of
functionality (no tvm.extern(..) calls via PackedFunc, CPU only, etc). It might
be worth incrementally expanding the surface area if there's interest.
The motivation for this work was seeing what the minimal useful subset of the
TVM runtime is. This is relevant for e.g. super code-size constrained
applications in e.g. embedded/mobile. The current runtime is more like O(100KiB)
or so, so this might be compelling for some users.
The smaller surface area for auditing might make this relevant for
https://github.com/dmlc/tvm/issues/3159, or the usecases I was thinking about in
https://github.com/dmlc/tvm/issues/2523#issuecomment-
459165815 re: the Rust
runtime.
The symbols in the tvm::minimalruntime space (i.e. excluding std:: and
picojson::) are about 5KiB, so I think there's a bunch of room here (i.e. we
could replace picojson:: with [`jsmn`](https://zserge.com/jsmn.html) or
something, and we could replace more of the `std::unordered_map` usage, etc with
custom primitives as well (similar to the `DynArray`).
Jared Roesch [Thu, 12 Sep 2019 03:39:56 +0000 (22:39 -0500)]
[Relay][Module] Refactor the way we interface between different modules of Relay. (#3906)
* Module refactor
* Add load module
* Add support for idempotent import
* Tweak load paths
* Move path around
* Expose C++ import functions in Python
* Fix import
* Add doc string
* Fix
* Fix lint
* Fix lint
* Fix test failure
* Add type solver
* Fix lint
Lianmin Zheng [Wed, 11 Sep 2019 21:32:15 +0000 (14:32 -0700)]
[Community] Add reviewer Balint Cristian (#3935)
Yizhi Liu [Wed, 11 Sep 2019 18:10:48 +0000 (02:10 +0800)]
[Arm] parallel batch axis (#3931)
* support LLVM trunk
* guard with USE_LLVM in if condition for c++14
* GREATER_EQUAL -> GREATER
* [Arm] parallel batch axis
Zhao Wu [Wed, 11 Sep 2019 04:09:25 +0000 (12:09 +0800)]
[TFLite] Support depthwise convolution multiplier greater than 1 (#3922)
雾雨魔理沙 [Wed, 11 Sep 2019 03:30:46 +0000 (20:30 -0700)]
[Relay] fix exponential blowup in interpreter (#3559)
Neo Chien [Tue, 10 Sep 2019 17:41:16 +0000 (01:41 +0800)]
[Relay][Frontend][Keras] Fix ReLU in Keras Converter missed the case (#3917)
* [Relay][Frontend][Keras] Fix ReLU in Keras Converter missed the case
* [Relay][Frontend][Keras] Add test case for ReLU in Keras Converter missed the case
* [Relay][Frontend][Keras] Add test case for ReLU in Keras Converter missed the case
Pratyush Patel [Tue, 10 Sep 2019 00:43:01 +0000 (17:43 -0700)]
[CODEGEN] Remove incorrect check for LLVM in C codegen test (#3921)
雾雨魔理沙 [Mon, 9 Sep 2019 19:48:04 +0000 (12:48 -0700)]
[Relay][Training] Add gradient for max. (#3915)
* save
* save
Luis Vega [Mon, 9 Sep 2019 17:31:31 +0000 (10:31 -0700)]
[VTA][Config] hotfix denano10 (#3918)
Xingjian Shi [Mon, 9 Sep 2019 17:26:34 +0000 (10:26 -0700)]
Numpy compatible dtype inference for `tvm.convert` and `tvm.const` (#3861)
* numpy compatible type inference
* update
* try to fix
* fix
* try to fix
* fix lint
* Update nn.h
* cast to int32
* try to fix
* fix again
* retrigger ci
Haichen Shen [Mon, 9 Sep 2019 14:54:15 +0000 (07:54 -0700)]
[Relay/TOPI][Op] Add erf intrinsic and op (#3702)
* add more ops
* stop vectorization for erf
* x
* cleanup
* fix
* add whitelist for vectorizable intrin
* add tf converter
* fix dense
* fix
* add missing intrin
* fix mxnet frontend
* fix nvptx
雾雨魔理沙 [Sun, 8 Sep 2019 03:11:47 +0000 (20:11 -0700)]
[Relay][Training] Add gradient for cast (#3894)
save
fix
fix grad
雾雨魔理沙 [Sun, 8 Sep 2019 00:10:11 +0000 (17:10 -0700)]
change docker install script (#3524)
Haichen Shen [Sat, 7 Sep 2019 21:34:32 +0000 (14:34 -0700)]
[Fix] Fix blas cmake for mac os (#3898)
* fix cmake for mac os
* rename
Yizhi Liu [Sat, 7 Sep 2019 18:43:29 +0000 (02:43 +0800)]
Support LLVM trunk (#3907)
* support LLVM trunk
* guard with USE_LLVM in if condition for c++14
* GREATER_EQUAL -> GREATER
noituIover [Sat, 7 Sep 2019 16:44:39 +0000 (00:44 +0800)]
Fix a typo (#3913)
Peter Yeh [Sat, 7 Sep 2019 03:41:35 +0000 (20:41 -0700)]
Add .hsaco save/load for ROCm target (#3852)
fix lld
Haichen Shen [Sat, 7 Sep 2019 00:12:56 +0000 (17:12 -0700)]
add luis as reviewer (#3909)
Hua Jiang [Sat, 7 Sep 2019 00:03:51 +0000 (17:03 -0700)]
[VTA] Support TLPP in function simulator. (#3555)
* [VTA] Support TLPP in function simulator.
Issue:
currently vta function simulator just doing serialized instruction
execution, the dependency logic of runtime ISA which use for task
level pipe line parallelism can not get verified by function simulator.
Solution:
make the simulator driver to be multiple thread and support TLPP.
Benefit:
TLPP support VTA function simulator would make VTA logic testing/debug
/change more easy.
replace boost lockfree queue
add configure control for simulator tlpp enable or disable.
change code tyle into google style.
Wrap queue read/write and sync logic to make function call more simple.
Add some comments.
Remove MT logic, change into Single thread mode.
address review comments.
code style change to match google code style and add comments.
add cmake macro to enable/disable simulator tlpp logic.
submodule update.
correct file name mentioned in comments.
* remove USE_VTA_FSIM_TLPP.
Leyuan Wang [Sat, 7 Sep 2019 00:01:29 +0000 (17:01 -0700)]
[TOPI] Intel graphics conv2d autotvm template added (#3839)
* update lint
* lint fixed
* lint updated
* lint fixed
* lint fixed
* lint fixed
* updates
* add intel graphics as a package
* remove print info
* depthwise conv2d schedule added for intel graphics
* asdf
* fix lint
* fix lint
* fix ci
* add channels
雾雨魔理沙 [Fri, 6 Sep 2019 22:17:37 +0000 (15:17 -0700)]
save (#3901)
雾雨魔理沙 [Fri, 6 Sep 2019 18:51:27 +0000 (11:51 -0700)]
[Relay][Op] Make Type Relation catch more errors (#3899)
* save
* init
* move type_relations
Logan Weber [Fri, 6 Sep 2019 18:04:34 +0000 (11:04 -0700)]
[Relay] Add ADTs to text format (#3863)
* Getting closer to having ADT defs
* ADT defs working probly
* Match parsing basipally done
* came to earth in a silver chrome UFO
* match finished?
* All tests but newest are passing
* ADT constructors work
now cleanup?
* Cleanup round 1
* Cleanup round 2
* Cleanup round 3
* Cleanup round 4
* Cleanup round 6
* Cleanup round 7
* Lil grammar fix
* Remove ANTLR Java files
* Lint roller
* Lint roller
* Address feedback
* Test completeness in match test
* Remove unused imports
* Lint roller
* Switch to Rust-style ADT syntax
* Lil fix
* Add dummy `extern type` handler
* Add type arg to test
* Update prelude semantic version
* Repair test
* Fix graph var handling in match
* Revert 's/graph_equal/is_unifiable' change
Yong Wu [Fri, 6 Sep 2019 15:30:04 +0000 (08:30 -0700)]
[bugfix] remove duplicate resize (#3902)
Jason Knight [Fri, 6 Sep 2019 13:30:13 +0000 (06:30 -0700)]
Add another MKL name alias for MKL (#3853)
Installed through pypi
Yizhi Liu [Fri, 6 Sep 2019 13:29:31 +0000 (21:29 +0800)]
[schedule] Improve ceil_divide in tile/split (#3842)
Jon Soifer [Thu, 5 Sep 2019 23:42:29 +0000 (16:42 -0700)]
[PYTHON/FFI] Search PATH for DLLs (#3888)
* Search PATH for DLLs
* Fix lint issue
雾雨魔理沙 [Thu, 5 Sep 2019 23:41:44 +0000 (16:41 -0700)]
[Relay] add Tuple pattern (#3596)
* implement tuple pattern
* add tuple pattern
* lint;
* lint
* lint
* fix error
* fix
* add test
kice [Thu, 5 Sep 2019 23:21:54 +0000 (19:21 -0400)]
Fix int32 range overflow by using int64 (#3870)
雾雨魔理沙 [Thu, 5 Sep 2019 21:39:13 +0000 (14:39 -0700)]
[Relay] Fix operator fusion for multiple output (#3871)
* save
* add test
* refactor
* fix indent
* save
* refactor
Haibin Lin [Thu, 5 Sep 2019 18:48:57 +0000 (11:48 -0700)]
[DOC] Fix doc rendering (#3897)
* Update from_source.rst
* Update deploy_ssd_gluoncv.py
黎明灰烬 [Thu, 5 Sep 2019 18:32:21 +0000 (02:32 +0800)]
[Test] enable NHWC of `relay.testing.mobilenet` (#3886)
* [Relay] enable NHWC of `relay.testing.mobilenet`
In this way, we can play around NHWC inside TVM regardless of
the frontends.
* [Test] test for NHWC of relay.testing.mobilenet
Thierry Moreau [Thu, 5 Sep 2019 18:29:42 +0000 (11:29 -0700)]
[VTA][TOPI] Conv2d transpose (deconvolution) operator support (#3777)
* initial conv2d_transpose
* correct select operator
* cleanup
* fix
* fix correcness check
* conv2d transpose declaration fix
* autotvm conv2d_transpose tuning script
* ir pass fix
* fix tuning script
* deriving params from env, adding bias
* removing bias comp from deconvolution
* lint
* fix
* lint
* lint
* turning off cpu
* lint, ops
* lint
* import fix
* removing hard coded values
* lint
Thierry Moreau [Thu, 5 Sep 2019 18:17:09 +0000 (11:17 -0700)]
[VTA][Relay] Extending Vision model coverage compilation for VTA (#3740)
* adding support for graphpack over multiply op
* increasing resnet model coverage
* fix indentation
* lint
* moving recursion limit fix into graphpack pass
* moving recursionlimit to relay init
* pooling on NCHWnc format
* adding more models
* deploy_resnet_on_vta.py
* trailing line
* generalizing to vision models
* merge conflicts
* fix, apply quantization to VTA only
* improving comments
* trimming models that have runtime issues for the moment
* lint
* lint
* lint
雾雨魔理沙 [Thu, 5 Sep 2019 18:13:07 +0000 (11:13 -0700)]
[Relay][Training] Small refactoring (#3893)
* init
* fix
Animesh Jain [Thu, 5 Sep 2019 17:22:45 +0000 (10:22 -0700)]
[QNN] Add - Refactoring to C++ (#3736)
Liangfu Chen [Thu, 5 Sep 2019 16:52:10 +0000 (00:52 +0800)]
[VTA] de10-nano driver (#3394)
* rework;
* `de10-nano` -> `de10nano`;
* fix compilation error;
* bug fix;
* Update install.md
* Update install.md
* Update install.md
* update with current runtime;
* add debug messages;
* bug fix in cma kernel module;
miheer vaidya [Thu, 5 Sep 2019 04:24:00 +0000 (22:24 -0600)]
Reveal hidden code snippets by inserting newline (#3892)
Luis Vega [Wed, 4 Sep 2019 17:36:21 +0000 (10:36 -0700)]
[VTA][Chisel] add ISA BitPat generation (#3891)
Animesh Jain [Wed, 4 Sep 2019 17:05:22 +0000 (11:05 -0600)]
[QNN] Convolution 2D Implementation. (#3580)
Rebasing. Empty commit.
Clang-format styling.
lixiaoquan [Wed, 4 Sep 2019 16:57:20 +0000 (00:57 +0800)]
[TENSORFLOW] Convert scalar Const into tvm.relay.const (#3885)
* [TENSORFLOW] Convert scalar Const into tvm.relay.const
* use _get_num_param() and _get_list_param()
SWu [Wed, 4 Sep 2019 07:07:39 +0000 (03:07 -0400)]
[Relay] Add grads (#3857)
* Add gradient implementations
* Add docstrings to fix lint errors
youluexx [Wed, 4 Sep 2019 05:46:29 +0000 (13:46 +0800)]
[Relay][Frontend][darknet] Solve tvm parsing darknet resnext failure bug (#3778)
* test_darkent_bug
* test_darkent
* add resnext tests
Luis Vega [Wed, 4 Sep 2019 05:19:01 +0000 (22:19 -0700)]
[VTA][Chisel] add scalafmt and format existing scala codebase (#3880)
* [VTA][Chisel] add scalafmt and format existing scala codebase
* change column width to 100
* add scalafmt conf file as a valid file type
* add asf header to scalafmt conf file and rerun formatter
Liangfu Chen [Tue, 3 Sep 2019 16:31:32 +0000 (00:31 +0800)]
[VTA] Fix TSIM compile error in Linux (add missing -fPIC flag) (#3876)
* [VTA] Fix TSIM compile error in Linux (add missing -fPIC flag);
* [VTA] Fix TSIM compile error in Linux (add missing -fPIC flag);
* fix indentation problem;
Tianqi Chen [Tue, 3 Sep 2019 07:31:04 +0000 (15:31 +0800)]
Revert "[Runtime] Allow parameter sharing between modules (#3489)" (#3884)
This reverts commit
224cc243b4e54a77d011644fe7d81bdee8e8116b.
Neo Chien [Tue, 3 Sep 2019 04:02:52 +0000 (12:02 +0800)]
ONNX frontend operator support: And (#3878)
Yong Sun [Tue, 3 Sep 2019 03:53:42 +0000 (20:53 -0700)]
[Runtime] Allow parameter sharing between modules (#3489)
As GraphRuntime does not provide control-flow logics, we have to split
our model to two parts. While we need to share parameters between them
to save memory usage.
Solution:
1) add "lazy_init_input" in graph's attributes
"attrs": {
... ...
"lazy_init_input": [
"list_str",
[
"p0"
]
]
}
2) allow un-allocated NDArray entry in SetupStorage
3) utilize "set_input_zero_copy" function to set parameters
kice [Tue, 3 Sep 2019 02:35:14 +0000 (22:35 -0400)]
Remove extern C warpper for cuBLAS (#3877)
Luis Vega [Mon, 2 Sep 2019 18:08:53 +0000 (11:08 -0700)]
[VTA][Chisel] rename USE_TSIM macro with USE_VTA64 and cleanup runtime (#3872)
雾雨魔理沙 [Mon, 2 Sep 2019 15:22:11 +0000 (08:22 -0700)]
[Relay] [Parser] fix parser for cast. (#3873)
* fix
* lint
Logan Weber [Mon, 2 Sep 2019 07:32:52 +0000 (00:32 -0700)]
[WIP][µTVM] Add OpenOCD Low-Level Device (RISC-V Support) (#3756)
Animesh Jain [Mon, 2 Sep 2019 02:58:38 +0000 (20:58 -0600)]
[QNN] Requantize - Optimize lowering for some corner cases. (#3864)
Neo Chien [Sun, 1 Sep 2019 01:56:39 +0000 (09:56 +0800)]
Implementation of tile for TFLite (#3814)
Haichen Shen [Sun, 1 Sep 2019 01:50:22 +0000 (18:50 -0700)]
[Relay][Any] Add shape func for dynamic shape (#3606)
* init shape func in interpreter and vm compiler
* Update interpreter
* fix
* lint
* lint
* fix
* remove hack
* update
* fix
* fix
* update
* address comments & update for shape_of
* fix lint
* update
* fix hybrid
* lint
* fix bug & add take shape func
* lint
* lint
* update
* fix flaky test
* add todo
Alexey Golunov [Sun, 1 Sep 2019 00:52:42 +0000 (03:52 +0300)]
[NNVM][FRONTEND][ONNX] Fix PReLU conversion (#3813)
Josh Fromm [Sun, 1 Sep 2019 00:51:51 +0000 (17:51 -0700)]
[Relay] Bitserial ops (#3844)
* Added arm_cpu NHWC schedules.
* Fixed kernel shape legalization.
* Added bitserial ops to relay.
* Snapshot and more missing files.
* Added dense testing.
* Added tests
* Added ASF header to new files.
* cc lint
* Pylint change.
* pylint fixes.
* Change arm legalize test.
* Added assert check to arm legalize.
* Added better documentation, fixed some bad style
* Reverted arm conv2d nhwc changes.
Neo Chien [Sun, 1 Sep 2019 00:50:36 +0000 (08:50 +0800)]
Add not operator for the frontend/onnx.py (#3836)
Luis Vega [Sun, 1 Sep 2019 00:49:17 +0000 (17:49 -0700)]
[VTA][TSIM] add virtual memory support to tsim example (#3868)
* [VTA][TSIM] add virtual memory support to tsim example
* fix identation
* remove USE_TSIM macro and use 32-bit addr instead
SWu [Sat, 31 Aug 2019 06:25:16 +0000 (02:25 -0400)]
Improve numerical gradient check (#3856)
Alexander Pivovarov [Sat, 31 Aug 2019 04:30:59 +0000 (21:30 -0700)]
Add more cases to keras _convert_reshape (#3846)
Animesh Jain [Sat, 31 Aug 2019 04:30:18 +0000 (21:30 -0700)]
[QNN] Concat - Refactoring to C++ (#3819)
Animesh Jain [Fri, 30 Aug 2019 16:12:03 +0000 (09:12 -0700)]
[Relay][QNN] Moving Conv, Dense, Concatenate InferTypes to header for sharing. (#3783)
Animesh Jain [Fri, 30 Aug 2019 16:10:25 +0000 (09:10 -0700)]
[Relay][QNN] QNNtoRelay & QNNLegalize Pass utility using Relay Legalize API. (#3838)
Andrew Tulloch [Fri, 30 Aug 2019 00:25:07 +0000 (17:25 -0700)]
codegen_spirv support Call::reinterpret (#3795)
Hua Jiang [Thu, 29 Aug 2019 21:26:39 +0000 (14:26 -0700)]
[VTA] Infinite recursive device_api.ext_dev call fix. (#3843)
Issue
when try vta on fpga board, would see a Infinite recursive
device_api.ext_dev issue that cause stack overflow and vta
failed.
Analysis:
device_api.ext_dev function in rpc_server.py is use to load
vta library, once vta library get load, device_api.ext_dev would
get replaced with vta function by vta library, vta device_api.cc
did such work, but because a logic issue in VTA.cmake, the said file
not get compiled, then vta would keep failing on rpc_server.py.
Solution:
fix the logic issue in VTA.cmake.