review.tizen.org Git - platform/upstream/llvm.git/commit

[MLIR][GPU] Make max flat work group size for ROCDL kernels configurable

While the default value for the amdgpu-flat-work-group-size attribute,
"1, 256", matches the defaults from Clang, some users of the ROCDL dialect,
namely Tensorflow, use larger workgroups, such as 1024. Therefore,
instead of hardcoding this value, we add a rocdl.max_flat_work_group_size
attribute that can be set on GPU kernels to override the default value.

Reviewed By: whchung

Differential Revision: https://reviews.llvm.org/D115741

author	Krzysztof Drewniak <Krzysztof.Drewniak@amd.com>
	Tue, 14 Dec 2021 18:47:09 +0000 (18:47 +0000)
committer	Krzysztof Drewniak <Krzysztof.Drewniak@amd.com>
	Tue, 14 Dec 2021 20:12:23 +0000 (20:12 +0000)
commit	c57b2a0635df9eae0b1d699f83b9b158d5a89135
tree	64e1e8d61b62043cf2990417e0039ac2a05c8e5d	tree \| snapshot
parent	100863ccd8d41091f90749ba76d91f6dfafdde57	commit \| diff

mlir/lib/Target/LLVMIR/Dialect/ROCDL/ROCDLToLLVMIRTranslation.cpp		diff \| blob \| history
mlir/test/Target/LLVMIR/rocdl.mlir		diff \| blob \| history