review.tizen.org Git - platform/upstream/llvm.git/commit

projects / platform / upstream / llvm.git / commit

author	thomasraoux <thomasraoux@google.com>
	Thu, 27 May 2021 15:58:11 +0000 (08:58 -0700)
committer	thomasraoux <thomasraoux@google.com>
	Thu, 27 May 2021 16:13:51 +0000 (09:13 -0700)
commit	b44007bec2470db0d9f100c6a9216d8e05cef608
tree	0522516af3f06cf0a93906acf22eb7a096d9b309	tree \| snapshot
parent	6d2c0950205f50f926ba5e362e845faff22582b7	commit \| diff

[mlir][gpu] Relax restriction on MMA store op to allow chain of mma ops.

In order to allow large matmul operations using the MMA ops we need to chain
operations this is not possible unless "DOp" and "COp" type have matching
layout so remove the "DOp" layout and force accumulator and result type to
match.
Added a test for the case where the MMA value is accumulated.

Differential Revision: https://reviews.llvm.org/D103023

Domain: System / Toolchain;

RSS Atom

mlir/include/mlir/Dialect/GPU/GPUDialect.h		diff \| blob \| history
mlir/include/mlir/Dialect/GPU/GPUOps.td		diff \| blob \| history
mlir/lib/Conversion/GPUToNVVM/LowerGpuOpsToNVVMOps.cpp		diff \| blob \| history
mlir/lib/Conversion/GPUToNVVM/WmmaOpsToNvvm.cpp		diff \| blob \| history
mlir/lib/Dialect/GPU/IR/GPUDialect.cpp		diff \| blob \| history
mlir/test/Conversion/GPUToNVVM/wmma-ops-to-nvvm.mlir		diff \| blob \| history
mlir/test/Dialect/GPU/invalid.mlir		diff \| blob \| history
mlir/test/Integration/GPU/CUDA/TensorCore/wmma-matmul-f16.mlir		diff \| blob \| history
mlir/test/Integration/GPU/CUDA/TensorCore/wmma-matmul-f32.mlir		diff \| blob \| history