review.tizen.org Git - platform/upstream/llvm.git/commit

author	aartbik <ajcbik@google.com>
	Thu, 23 Jul 2020 06:47:33 +0000 (23:47 -0700)
committer	aartbik <ajcbik@google.com>
	Thu, 23 Jul 2020 06:47:36 +0000 (23:47 -0700)
commit	1485fd295b2ab7fde3bf8d3ce33912c8a5fe1105
tree	84d58d8e09a2314cbe2c454832187e55c51579ee	tree \| snapshot
parent	dab898f9ab62275f55eb2a7e76ea0cd01696f68e	commit \| diff

[mlir] [VectorOps] Improve scatter/gather CPU performance

Replaced the linearized address with the proper LLVM way of
defining vector of base + indices in SIMD style. This yields
much better code. Some prototype results with microbencmarking
sparse matrix x vector with 50% sparsity (about 2-3x faster):

         LINEARIZED     IMPROVED
GFLOPS  sdot  saxpy     sdot saxpy
16x16    1.6   1.4       4.4  2.1
32x32    1.7   1.6       5.8  5.9
64x64    1.7   1.7       6.4  6.4
128x128  1.7   1.7       5.9  5.9
256x256  1.6   1.6       6.1  6.0
512x512  1.4   1.4       4.9  4.7

Reviewed By: nicolasvasilache

Differential Revision: https://reviews.llvm.org/D84368

mlir/include/mlir/Dialect/Vector/VectorOps.td		diff \| blob \| history
mlir/lib/Conversion/VectorToLLVM/ConvertVectorToLLVM.cpp		diff \| blob \| history
mlir/test/Conversion/VectorToLLVM/vector-to-llvm.mlir		diff \| blob \| history