review.tizen.org Git - platform/upstream/tvm.git/log

projects / platform / upstream / tvm.git / log

Animesh Jain [Wed, 13 Nov 2019 19:18:49 +0000 (11:18 -0800)]

[QNN][Legalize] Specialize for Platforms without any fast Int8 arithmetic units. (#4307)

Zhao Wu [Wed, 13 Nov 2019 06:11:38 +0000 (14:11 +0800)]

[TOPI][OP] Support Faster-RCNN Proposal OP on CPU (#4297)

* Support Proposal operator on CPU.

* PyLint space issue

* PyLint space issue

* Pylint singleton-comparison issue

commit | commitdiff | tree

Eric Platon [Tue, 12 Nov 2019 23:52:24 +0000 (00:52 +0100)]

Fix the TF tutorial to run against TF2.0 and TF1.x (#4104)

* WIP Run the TF tutorial on TF2

* Remove debugger statement.

* Complete the support for TF2.0's `resize`.

TF2.0 adds a `half_pixel_centers` attribute to the `resize` function in
the image API. This commit completes the hooks in Relay's TF frontend.

At the point of this commit, no new test yet. Also, this commit
addresses solely the `resize` change. Other commits address other
changes in TF2.0.

* Support TF2.0 in the tutorial by using the compat API.

This looks cleaner than trying to detect the TF version.

* Use the TF compat API, so as to support TF2.0.

This is a direct change, relying on the compat API provided by the TF
team.

This code will last as long as the compat API exists, so a
"proper" support for TF1.x and 2.x will require more work in some
future.

* Partial support for EXPLICIT padding introduced in TF2.0.

Explicit padding is a special case in TF2.0 (see reference linked
below). Some models are serialized with that mode, and break TF support
in TVM.

Support is *partial* as EXPLICIT falls back to set padding on the
Relay op, which only supports 2 values. At some point, padding may need
to be extended to support 4 values, but that is out of scope of this
support commit.

Reference on EXPLICIT padding: https://github.com/tensorflow/tensorflow/commit/ec81825aaf7e848d9f8ddffdf1e0d20aebe9172c#diff-1d1c0bb0a880f85b6164f71dbb2f446e

* Guard on checking for optional TF2.0 attribute.

* Do not expect Relay to implement TF-specific attributes.

The `half_pixel_centers` attribute is a new feature in TF2.0. Earlier
commits of mine mistakenly introduce them in the Relay API. This is
probably not what Relay is expected to support, and the semantics of
`half_pixel_centers` is unclear (to me, at least) at this point.

* Remove unclear comment.

CR https://github.com/dmlc/tvm/pull/4104#discussion_r338705742

Addresses #4104

* Changes after review.

Complying without understanding the rationale for now.

* Fix the arguments set mistakenly.

An argument ignored for the wrong operation.

commit | commitdiff | tree

Wei Chen [Tue, 12 Nov 2019 20:36:28 +0000 (12:36 -0800)]

[Relay][Op][TF] Complete tensor array unstack with all ranks support (#4309)

commit | commitdiff | tree

Ina Dobreva [Tue, 12 Nov 2019 20:23:04 +0000 (20:23 +0000)]

Add test for the qnn_add operator (#4282)

* Add test for the qnn_add operator

The tests use fake quant approach so until the tf session tensors remain in float32.
The test data has to be passed in uint8 because of how the tflite/tvm comparison works.
Abs tolerance up to 1 is allowed for the qnn results. For now input_stats are hardcoded
assuming the tests for the other qnn ops will pass the input data in the same range.

* Separate qnn uint8 test function from the fp32 elemwise tests

Isolate qnn uint8 elemwise tests
Remove blank lines

commit | commitdiff | tree

Haichen Shen [Tue, 12 Nov 2019 19:54:56 +0000 (11:54 -0800)]

add (#4311)

commit | commitdiff | tree

Xingyu Zhou [Tue, 12 Nov 2019 18:30:04 +0000 (10:30 -0800)]

[Relay][Frontend][Keras] batch_norm op params not handling well (#4310)

* Relay Keras frontent batch_norm op params not handeling well

* add unit test for Relay Frontend Keras batch_norm

commit | commitdiff | tree

jmorrill [Tue, 12 Nov 2019 16:50:06 +0000 (08:50 -0800)]

Fix incorrect call to Unicode Win32 InetPton (#4306)

* Fix incorrect call to Unicode Win32

* Removed inet_pton call. Win32 already has it.

commit | commitdiff | tree

Neo Chien [Tue, 12 Nov 2019 04:34:50 +0000 (12:34 +0800)]

[Relay][Frontend][Tensorflow] Fix type assignment for operator 'tf.range' (#4294)

commit | commitdiff | tree

Yao Wang [Mon, 11 Nov 2019 23:46:29 +0000 (15:46 -0800)]

Add More Shape Functions (#4179)

* Add shape functions

* Fix get_const_tuple

* Fix cpplint

* Fix pylint

* Fix pylint

* rebase and fix

* Check Any for infer type

* Fix expand_dim shape func for zero rank input

* Fix pooling infer type

* Address comment

* Register layout transform attr

commit | commitdiff | tree

Wei Chen [Mon, 11 Nov 2019 19:22:14 +0000 (11:22 -0800)]

[TF][Relay][Op] Pass module when infer shape (#4287)

* [TF][Relay][Op] Pass module when infer shape

* Fix lint

* Improve style

* Add test

commit | commitdiff | tree

Tianqi Chen [Mon, 11 Nov 2019 18:09:29 +0000 (10:09 -0800)]

[RUNTIME][REFACTOR] Use object protocol to support runtime::Module (#4289)

Previously runtime::Module was supported using shared_ptr.
This PR refactors the codebase to use the Object protocol.

It will open doors to allow easier interpolation between
Object containers and module in the future.

commit | commitdiff | tree

Yong Wu [Mon, 11 Nov 2019 17:24:52 +0000 (09:24 -0800)]

[TF][TEST] add test_forward_reduce_any back (#4301)

the test case was removed in #4181 for some reason
@tqchen @soiferj @zhiics

commit | commitdiff | tree

Yao Wang [Mon, 11 Nov 2019 16:23:23 +0000 (08:23 -0800)]

Fix tf reshape (#4285)

* Fix tf reshape

* Fix test

* Fix pylint

* Fix pylint

commit | commitdiff | tree

Zhi [Mon, 11 Nov 2019 05:57:43 +0000 (21:57 -0800)]

[tutorial] Relay pass infra tutorial (#4083)

* Add pass manager tutorial

* fix some examples

* retrigger ci

* Update tutorials/dev/relay_pass_infra.py

Co-Authored-By: 雾雨魔理沙 <lolisa@marisa.moe>
* Add ToANormalForm link

commit | commitdiff | tree

Animesh Jain [Mon, 11 Nov 2019 03:09:16 +0000 (19:09 -0800)]

[TOPI][AlterOpLayout][ARM] Enabling NHWC to NCHW layout transformation. (#4249)

commit | commitdiff | tree

Zhao Wu [Sun, 10 Nov 2019 22:56:44 +0000 (06:56 +0800)]

[RUTNIME] Support C++ RPC (#4281)

commit | commitdiff | tree

Zhao Wu [Sun, 10 Nov 2019 19:45:10 +0000 (03:45 +0800)]

[TFLite] Support PRelu (#4298)

commit | commitdiff | tree

Wei Chen [Sun, 10 Nov 2019 18:31:20 +0000 (10:31 -0800)]

[Test][TF][Relay] Fix argument preparation for vm test mode (#4296)

commit | commitdiff | tree

Yizhi Liu [Sun, 10 Nov 2019 06:16:34 +0000 (22:16 -0800)]

[Codegen][cuda-fp16] fallback to fp32 simulation when cuda arch < sm53 (#4268)

commit | commitdiff | tree

Yizhi Liu [Sun, 10 Nov 2019 02:20:33 +0000 (18:20 -0800)]

Rename ml.dmlc.tvm to org.apache.tvm (#4290)

commit | commitdiff | tree

Minmin Sun (孙敏敏) [Sat, 9 Nov 2019 21:01:36 +0000 (05:01 +0800)]

Auto TensorCore CodeGen (#4234)

* Add Auto TensorCore TensorCore Unit Test

* Rebase to tvm master branch & Add auto tensor core

* Code Refine

* Add tensor core switch by pragma

* Add pragma in tensor core example code

* Get real tile size to replace hard coded 16

* support more than 2 dimensions (e.g. batchmatmul) for buffer bind scope

* support batch matmul

* Move cuda env check to tensor_core.cc

* Coderefine for tensor_core.cc

* Refine comments

* Some refinements of code and comment

* Update TensorCore UT to pass the CPU test

* remove redundant code

* matmul's storage align for different layout

* Add support for differenct position of type cast

* Add formal tutorial for auto tensorcore codegen

* move tensorcore check up to tutorial code

* code and doc refine

* comment out tune_and_evaluate in tutorial

* fix cpplint error

commit | commitdiff | tree

peike [Fri, 8 Nov 2019 08:23:15 +0000 (19:23 +1100)]

Update tvm_runtime.h (#4278)

fix the problem that android_rpc compilation failed

commit | commitdiff | tree

Cody Hao Yu [Fri, 8 Nov 2019 05:44:35 +0000 (21:44 -0800)]

[TOPI][CUDA] Fix Winograd Kernel Size Support (#4276)

* fix_winograd_cuda_kernel_size

* add unit test

commit | commitdiff | tree

Jon Soifer [Thu, 7 Nov 2019 22:10:30 +0000 (14:10 -0800)]

[Relay][Frontend][ONNX] Add support for broadcasting to Where and MatMul (#4267)

commit | commitdiff | tree

Josh Fromm [Thu, 7 Nov 2019 00:07:09 +0000 (16:07 -0800)]

[AutoTVM] Add batch_matmul to tunable operations (#4242)

* Batch matmul tuning running but with errors.

* Default x86 schedule as good as before.

* Code Cleanup

* Remove unused argument.

* improved template documentation.

* Silly lint fix

* Removed leftover comment.

* Moved cfg declaration to schedule for batch_matmul

* Moved x86 dense cfg declaration to schedule.

* lint fix

* Removed duplicate cfg declaration in dense.

* Reverted changes to dense.

commit | commitdiff | tree

Cody Hao Yu [Wed, 6 Nov 2019 23:02:54 +0000 (15:02 -0800)]

[TOPI] Fix bug in Winograd on CUDA (#4260)

* fix winograd

* move get padding after kernel transform

commit | commitdiff | tree

Neo Chien [Wed, 6 Nov 2019 18:39:40 +0000 (02:39 +0800)]

[Contrib] Fix error message at callback_get_section_size() (#4221)

* [Contrib] Fix error message at callback_get_section_size()

* Trigger notification

commit | commitdiff | tree

Liangfu Chen [Wed, 6 Nov 2019 17:19:22 +0000 (01:19 +0800)]

[VTA] Hotfix for padded load test in Chisel VTA (#4264)

* Update TensorUtil.scala

* Update test_vta_insn.py

commit | commitdiff | tree

Tianqi Chen [Wed, 6 Nov 2019 00:03:04 +0000 (16:03 -0800)]

[DOCS] Update link loc (#4257)

commit | commitdiff | tree

zhuochen [Tue, 5 Nov 2019 17:51:36 +0000 (01:51 +0800)]

workaround typing.Deque import error for Python 3.5 (#4254)

commit | commitdiff | tree

Thomas Viehmann [Tue, 5 Nov 2019 10:25:18 +0000 (11:25 +0100)]

Require LLVM >= 9 for AMDGPU backend (#4253)

LLVM 8 will crash when loading the bitcodes

This is a runtime check as the file will be compiled in even when
USE_ROCM OFF is used in the configuration if ROCM is installed
in the default location.

Fixes: #4087

commit | commitdiff | tree

Tianqi Chen [Mon, 4 Nov 2019 22:03:33 +0000 (14:03 -0800)]

CI trigger after repo move (#4252)

commit | commitdiff | tree

Trevor Morris [Mon, 4 Nov 2019 18:37:41 +0000 (10:37 -0800)]

[Relay][Frontend][Tensorflow] Fix GatherV2, Add StopGradient (#4238)

* Add StopGradient. Add batch_dims attr to ignore list for GatherV2

* Trigger CI

commit | commitdiff | tree

Kim [Mon, 4 Nov 2019 16:04:02 +0000 (00:04 +0800)]

remove PEP498 f-string new feature for support python3.5 (#4250)

commit | commitdiff | tree

XFPlus [Mon, 4 Nov 2019 16:03:42 +0000 (00:03 +0800)]

Fix typo in err msg (#4251)

commit | commitdiff | tree

Hua Jiang [Sat, 2 Nov 2019 03:29:54 +0000 (20:29 -0700)]

[VTA] Performance optimize, remove unnecessary contigious memory use. (#4246)

* [VTA] Performance optimize, remove unnecessary contigious memory use.

Issue:
Uop maintain a cache vector to copy uop data into contigious DRAM memory for
FPGA/Simulator use, but this cache vector not get clear after FPGA/Simulator
core run, in Resnet18 case, if we printf the cache size in UopQueue::ReadBarrier
function, we can saw such cache size keep increase, this would cause
no use data copy and unnecessary contigous DRAM memory malloc.

Analysis:
This issue caused by not clear cache_ vector when do
uop_queue_.Reset().

Solution:
Override BaseQueue Reset function in UopQueue and add cache_ clear
logic.

* address review comments, remove spacing.

commit | commitdiff | tree

Yao Wang [Sat, 2 Nov 2019 03:10:21 +0000 (20:10 -0700)]

Support reshape for dynamic shape in tf converter (#4185)

* Support reshape for dynamic shape in tf converter

* Only allow reshape directly after shape function for symbolic input shape

* Fix lint

commit | commitdiff | tree

Tianqi Chen [Fri, 1 Nov 2019 23:34:42 +0000 (16:34 -0700)]

[NODE][REFACTOR] Rename IRFunctor->NodeFunctor, use func pointer (#4247)

* [NODE][REFACTOR] Rename IRFunctor->NodeFunctor, use function pointer for dispatching.

Previously we used std::function for the functor dispatching.
It introduces additional overhead and problems during dll destruction(of std::function).

This PR changes the std::function to function pointers.
This change a bit restrictions around the set_dispatch that we can get around,
but will improve the general efficiency by reducing one level of indirection in the std::function.
We also no longer need special marcos to register functions to the Functor.

commit | commitdiff | tree

Jared Roesch [Fri, 1 Nov 2019 21:28:23 +0000 (16:28 -0500)]

Implement explicit IR representation of memory alloction (#3560)

commit | commitdiff | tree

Wei Chen [Fri, 1 Nov 2019 20:37:58 +0000 (13:37 -0700)]

[Relay][Prelude] Add more dtypes to tensor_t (#4233)

commit | commitdiff | tree

Wuwei Lin [Fri, 1 Nov 2019 17:36:36 +0000 (13:36 -0400)]

[Relay][Pass] Avoid FoldConstant folding some ops (#4245)

* [Relay][Pass] Avoid FoldConstant folding some ops

* rename

commit | commitdiff | tree

Kim [Fri, 1 Nov 2019 15:54:33 +0000 (23:54 +0800)]

[ Relay ][ Frontend ][ Tensorflow ]add op add_n to relay/frontend/tensorflow.py (#4181)

commit | commitdiff | tree

Sergei Grechanik [Fri, 1 Nov 2019 15:51:43 +0000 (18:51 +0300)]

[ARITH] Fix lowering of FloorMod (#4236)

commit | commitdiff | tree

autumnqin [Fri, 1 Nov 2019 14:53:47 +0000 (22:53 +0800)]

Fix the problem that android_rpc compilation failed. (#4244)

Signed-off-by: qinqiuping <autumnqin@126.com>

commit | commitdiff | tree

Tianqi Chen [Thu, 31 Oct 2019 18:13:48 +0000 (11:13 -0700)]

[BUILD] Disable utvm standalone runtime by default (#4240)

commit | commitdiff | tree

Tianqi Chen [Thu, 31 Oct 2019 18:13:32 +0000 (11:13 -0700)]

[CUDA] Fix fp16 intrin, disable bad fp16 vecadd test for now (#4239)

commit | commitdiff | tree

Tianqi Chen [Thu, 31 Oct 2019 18:11:46 +0000 (11:11 -0700)]

[CI] Update GPU docker to cuda10 (#4228)

* [CI] Update the ci-gpu to use cuda10

* [CI] Enforce tensorcore gpu for unittest

commit | commitdiff | tree

KoolKoffee [Thu, 31 Oct 2019 16:15:57 +0000 (16:15 +0000)]

Fix typo in get_output doc-string (#4237)

commit | commitdiff | tree

Tianqi Chen [Thu, 31 Oct 2019 05:24:52 +0000 (22:24 -0700)]

[CI] Move gpu docker binary to cuda10 (#4229)

* [CI] Move gpu docker binary to cuda10

* Fix the gcn tutorial

commit | commitdiff | tree

Wei Chen [Thu, 31 Oct 2019 02:52:12 +0000 (19:52 -0700)]

[Doc] Update ANTLR instruction (#4231)

* [Doc] Update ANTLR instruction

* Update docs/install/from_source.rst

commit | commitdiff | tree

Wei Chen [Wed, 30 Oct 2019 22:54:56 +0000 (15:54 -0700)]

[Relay] Install Relay Prelude program in package install (#4227)

commit | commitdiff | tree

Tianqi Chen [Wed, 30 Oct 2019 22:33:10 +0000 (15:33 -0700)]

[CI] use llvm9 for the gpu tests (#4224)

* [CI] use llvm9 for the gpu tests

* Update Docker script to support new nvidia docker

commit | commitdiff | tree

Jon Soifer [Wed, 30 Oct 2019 18:43:09 +0000 (11:43 -0700)]

[Relay][Topi][TensorFlow][ONNX][Lang] Add support for Any op (#4205)

* Add support for Any op

* Support ONNX frontend

* Add doc

* Add to relay docs

* Dummy change to retrigger CI

commit | commitdiff | tree

Josh Fromm [Wed, 30 Oct 2019 18:24:47 +0000 (11:24 -0700)]

[Relay][Frontend][ONNX] New Operators and Opsets to Support BERT (#4197)

* Added slice v10

* Added constantofshape operation and small refactor.

* Finished one_hot implementation.

* Reshape working across all bert layers.

* Fixed constantofshape and removed code duplication.

* onnx model fully ingested.

* Working on improving onnx tests.

* Changed onnx testing to use onnxruntime instead of caffe2, also formatted.

* Add arbitrary output nodes to onnx frontend.

* Added v6 tiling for bert squad 8 support.

* Small syntax fixes

* Reduced code duplication in split opset versions.

* Added batch matmul test

* Added unstack split testing.

* Adde onehot test, needs a little cleanup probably.

* Replaced deprecated constant fill with constantofshape and updated tests accordingly.

* Added tests for new opset version of slice and tile.

* lint clean up

* Lint fixes

* Changed onnx dependency

* Went back to caffe2 runtime for CI integration.

* Rebase and small typo/syntax changes.

* Added hard casting of onehot attributes to int.

commit | commitdiff | tree

Tianqi Chen [Wed, 30 Oct 2019 18:17:33 +0000 (11:17 -0700)]

[PYTHON] Add __init__ to the generated grammar so that it can be installed properly (#4223)

commit | commitdiff | tree

Sergei Grechanik [Wed, 30 Oct 2019 16:43:53 +0000 (19:43 +0300)]

[ARITH] Fix the rule y < x && x <= y (#4220)

commit | commitdiff | tree

shoubhik [Wed, 30 Oct 2019 16:12:14 +0000 (09:12 -0700)]

Improve the lowering of Qnn Dense (#4213)

* [QNN] Improving Dense lowering.

* - Moving get_shape method to util
- Finalizing the test cases and the code structure for optimized dense computation.

* - Fixing cpplint.

* - Addressing review comments.

* - Renaming the variables correctly.

* - Renaming the variables correctly.

commit | commitdiff | tree

Bohan Hou [Wed, 30 Oct 2019 09:13:35 +0000 (17:13 +0800)]

Fix typo in packed_func.h (#4219)

commit | commitdiff | tree

Logan Weber [Wed, 30 Oct 2019 04:51:20 +0000 (21:51 -0700)]

[Relay] Add Python type functor and tests (#4209)

* Add Python type functor and tests

* Lint roller

commit | commitdiff | tree

LiangHao [Tue, 29 Oct 2019 18:45:02 +0000 (02:45 +0800)]

Optimizing autotvm task extraction speed (#4138)

* Optimize task extraction speed

* correct pylint errors

* Delete unused function

* remove unnecessary argument

* resolve code review comments

* corrent cpp lint errors

* remove one more graph_json return value

* fix test bugs

commit | commitdiff | tree

Wuwei Lin [Tue, 29 Oct 2019 12:23:23 +0000 (08:23 -0400)]

Update have_int8 condition to run on compute capability 7.x devices (#4214)

commit | commitdiff | tree

Wuwei Lin [Tue, 29 Oct 2019 04:51:29 +0000 (00:51 -0400)]

[Relay][Quantize] Use fixed point mulplications (#4160)

commit | commitdiff | tree

Xingyu Zhou [Mon, 28 Oct 2019 18:34:56 +0000 (11:34 -0700)]

[Relay][Op] Enhance Upsample Operator to support float scales (#4206)

* :add scale2 for upsample

* update unit test for upsampling

* support latest upsample op for multiple frontend

* fix lint

* fix lint

* fix lint

* fix lint

* update scale description and rebase

commit | commitdiff | tree

Tianqi Chen [Mon, 28 Oct 2019 18:28:00 +0000 (14:28 -0400)]

[TOPI] Fix flaky testcase for check round (#4211)

commit | commitdiff | tree

Animesh Jain [Mon, 28 Oct 2019 18:26:20 +0000 (11:26 -0700)]

[Relay] Setting Legalize opt_level to 1. (#4198)

commit | commitdiff | tree

Logan Weber [Mon, 28 Oct 2019 15:03:32 +0000 (08:03 -0700)]

Fix type var docs (#4208)

commit | commitdiff | tree

Tianqi Chen [Sun, 27 Oct 2019 20:41:20 +0000 (16:41 -0400)]

[RUNTIME] Separate runtime related contrib into runtime/contrib (#4207)

commit | commitdiff | tree

Benjamin Tu [Sun, 27 Oct 2019 00:06:49 +0000 (17:06 -0700)]

[VTA][Chisel] TSIM VTA Source Refactor (#4163)

* app init push

* fix on readme

* change name, add bit serial explanantion

* rm serialLoadMM, change doc

* syntax change for readme

* add parallel test functionality

* fix readme

* add python doc

* syntax

* init commit

* fix empty line

* fix typo

commit | commitdiff | tree

Jon Soifer [Sun, 27 Oct 2019 00:05:22 +0000 (17:05 -0700)]

[Relay][Frontend][ONNX] Add support for op Where (#4184)

* Add support for op Where

* Update impl version

commit | commitdiff | tree

Jared Roesch [Sun, 27 Oct 2019 00:05:03 +0000 (17:05 -0700)]

[Relay][Params] Add APIs for storing and retrieving parameters from individual functions. (#4194)

* Add support for attaching params

* Fix types

* Fix test

commit | commitdiff | tree

Altan Haan [Sun, 27 Oct 2019 00:04:42 +0000 (17:04 -0700)]

[Relay][Training] Add checkpoint annotation for checkpointing memory optimization (#4146)

* add checkpoint annotation for checkpointing memory optimization

* add alpha-equivalence checkpoint test and fix gradient type issue

* fix build issues

* ignore checkpoint annotation when checking missing gradients

* refactor, fix checkpoint compute for tuple and add tests

commit | commitdiff | tree

Zhi [Fri, 25 Oct 2019 22:47:45 +0000 (15:47 -0700)]

[hotfix] missing include headers (#4204)

commit | commitdiff | tree

雾雨魔理沙 [Fri, 25 Oct 2019 14:54:29 +0000 (07:54 -0700)]

[Relay] crossentropy_with_logits and its gradient (#4075)

* save

* lint

commit | commitdiff | tree

Animesh Jain [Fri, 25 Oct 2019 06:24:25 +0000 (23:24 -0700)]

[TOPI][x86] Legalize - Support int8xint8 convolution to use VNNI instructions. (#4196)

commit | commitdiff | tree

Tianqi Chen [Thu, 24 Oct 2019 22:49:24 +0000 (15:49 -0700)]

hotfix the ci (#4199)

commit | commitdiff | tree

Tianqi Chen [Thu, 24 Oct 2019 20:40:04 +0000 (13:40 -0700)]

[NODE][REFACTOR] Refactor reflection system in node. (#4189)

* [NODE][REFACTOR] Refactor reflection system in node.

- Removed the old Node, Node is now just an alias of runtime::Object
- Introduce ReflectionVTable, a new columnar dispatcher to support reflection
  - This allows us to remove vtable from most node objects
  - The VisitAttrs are registered via TVM_RESGITER_NODE_TYPE,
    they are no longer virtual.
- Consolidated serialization and reflection features into node.

* Explicit type qualification when calling destructor.

* Fix SPIRV, more comments

commit | commitdiff | tree

Siyuan Feng [Thu, 24 Oct 2019 19:04:37 +0000 (12:04 -0700)]

TensorCore Support using Intrinsic (#4136)

* add tensor core support

* avoid memory bank conflict

* fix thread sync & better performance

* better performance

* add schedule test for conv2d

* extend into BatchMatMul

* support config fragment shape and layout using intrinsic

* add TensorCore tutorial

* add int support and fix lint

* address comment

* add 32*16*8 TensorCore test

* fix wmma include logic

commit | commitdiff | tree

Cody Hao Yu [Thu, 24 Oct 2019 19:03:15 +0000 (12:03 -0700)]

[TOPI] Tunable Template for Conv2D HWCN on CUDA (#4168)

* support conv2d HWCN in AutoTVM and Relay

* fix lint

* fix comments and unit tests

commit | commitdiff | tree

雾雨魔理沙 [Thu, 24 Oct 2019 18:50:25 +0000 (11:50 -0700)]

[Relay] Fix memory leak in the interpreter (#4155)

* save

lint

* address reviewer comment

commit | commitdiff | tree

Ina Dobreva [Thu, 24 Oct 2019 16:58:22 +0000 (17:58 +0100)]

Add parser support for SUM tflite operator (#4182)

commit | commitdiff | tree

Zhi [Thu, 24 Oct 2019 16:56:58 +0000 (09:56 -0700)]

[relay] use time_evaluator for measurement (#4191)

commit | commitdiff | tree

optima2005 [Thu, 24 Oct 2019 16:56:00 +0000 (00:56 +0800)]

[Documentation]Fix example code in comment of tvm.build_module.build() (#4195)

* Fix example code in comment of tvm.build_module.build()

* Update build_module.py

commit | commitdiff | tree

Yao Wang [Thu, 24 Oct 2019 15:37:56 +0000 (08:37 -0700)]

Split adaptive_pool2d_avg into sum and div (#4186)

commit | commitdiff | tree

Jon Soifer [Thu, 24 Oct 2019 05:16:08 +0000 (22:16 -0700)]

[cmake][ANTLR] Support setting path to ANTLR jar (#4176)

* Support setting path to ANTLR jar

* Update comment

commit | commitdiff | tree

Tianqi Chen [Thu, 24 Oct 2019 04:59:58 +0000 (21:59 -0700)]

Revert "[Relay][QNN] Add unit test for int8 (#4159)" (#4192)

This reverts commit 6f9d028b80f9e41fd577b5c6a7229cafcfc72173.

commit | commitdiff | tree

Jon Soifer [Thu, 24 Oct 2019 03:12:47 +0000 (20:12 -0700)]

[DOCS] Add TensorFlow frontend docs (#4154)

* Start to update TF frontend docs

* Add rst

* Remove markdown

* Update wording

* Resolve comments

commit | commitdiff | tree

Bjarke Hammersholt Roune [Wed, 23 Oct 2019 20:53:03 +0000 (13:53 -0700)]

Add support and testing for tf.assert (as no-op) and tf.no_op to TF Relay frontend. (#4172)

commit | commitdiff | tree

Yizhi Liu [Wed, 23 Oct 2019 18:02:34 +0000 (02:02 +0800)]

[rpc] use callback func to do send & recv (#4147)

* [rpc] use callback func to do send & recv. don't get fd from sock as it is deprecated in java

* fix java build

* fix min/max macro define in windows

* keep the old rpc setup for py

* add doc for CallbackChannel

commit | commitdiff | tree

Wei Chen [Wed, 23 Oct 2019 02:13:55 +0000 (19:13 -0700)]

[Pass] Remove dead code (#4177)

commit | commitdiff | tree

Jon Soifer [Tue, 22 Oct 2019 23:37:53 +0000 (16:37 -0700)]

[Relay][Frontend][TF] Fix Size operator (#4175)

* [Relay][Frontend][TF] Fix Size operator

* Uncomment tests

commit | commitdiff | tree

mbarrett97 [Tue, 22 Oct 2019 20:49:17 +0000 (21:49 +0100)]

[TOPI] Added support for Mali Bifrost target (#4047)

commit | commitdiff | tree

Cody Hao Yu [Tue, 22 Oct 2019 20:26:04 +0000 (13:26 -0700)]

merge extract_from_program and extract_from_multiple_progam (#4173)

commit | commitdiff | tree

Altan Haan [Tue, 22 Oct 2019 06:13:55 +0000 (23:13 -0700)]

add missing gradient check to gradient pass (#4169)

commit | commitdiff | tree

Zhi [Tue, 22 Oct 2019 06:13:35 +0000 (23:13 -0700)]

[relay][vm] Reuse allocated device memory (#4170)

commit | commitdiff | tree

Zhi [Mon, 21 Oct 2019 23:16:45 +0000 (16:16 -0700)]

[Relay][QNN] Add unit test for int8 (#4159)

* [bugfix][codegen] fix casting bug in llvm codegen

* update example

* retrigger ci

* check llvm version

commit | commitdiff | tree

Haichen Shen [Mon, 21 Oct 2019 20:40:55 +0000 (13:40 -0700)]

[Relay][Pass] Count MAC for BatchMatMul (#4157)

* count MAC for BatchMatMul

* update doc

commit | commitdiff | tree

Monkeyking [Mon, 21 Oct 2019 16:05:26 +0000 (00:05 +0800)]

Fix missspelling (#4166)

FIX "After connecting he usb" with "After connecting the usb"

commit | commitdiff | tree

ekalda [Mon, 21 Oct 2019 14:07:39 +0000 (15:07 +0100)]

Add support for quantized multiply to Relay (#4141)

This patch adds multiply operator for quantized tensors.
The details of the quantized multiplication are outlined
in the code.

This builds on pull request 3927 and includes the changes
Animesh mentions in the comments on that request.

Change-Id: I555715b53d0266a91d5c03dc3dfe8fc31e7ce4e1

commit | commitdiff | tree

Tianqi Chen [Mon, 21 Oct 2019 03:29:32 +0000 (20:29 -0700)]

[CI] Move golang tests to the end (#4164)

Domain: Machine Learning / ML Framework;

RSS Atom