review.tizen.org Git - platform/upstream/llvm.git/commit

projects / platform / upstream / llvm.git / commit

author	Jay Foad <jay.foad@amd.com>
	Mon, 13 Jun 2022 15:35:44 +0000 (16:35 +0100)
committer	Jay Foad <jay.foad@amd.com>
	Mon, 13 Jun 2022 20:12:11 +0000 (21:12 +0100)
commit	bfcfd53b9244874b9807409a01407fd9e1d5d3e3
tree	080d7360a13416999bd8fe9804189c8b06e9f2ba	tree \| snapshot
parent	be232979bccee6e0257bce246f0126ef26460f48	commit \| diff

[AMDGPU] Add GFX11 llvm.amdgcn.permlane64 intrinsic

Compared to permlane16, permlane64 has no BC input because it has no
boundary conditions, no fi input because the instruction acts as if FI
were always enabled, and no OLD input because it always writes to every
active lane.

Also use the new intrinsic in the atomic optimizer pass.

Differential Revision: https://reviews.llvm.org/D127662

Domain: System / Toolchain;

RSS Atom

llvm/include/llvm/IR/IntrinsicsAMDGPU.td		diff \| blob \| history
llvm/lib/Target/AMDGPU/AMDGPUAtomicOptimizer.cpp		diff \| blob \| history
llvm/lib/Target/AMDGPU/AMDGPUInstCombineIntrinsic.cpp		diff \| blob \| history
llvm/lib/Target/AMDGPU/AMDGPURegisterBankInfo.cpp		diff \| blob \| history
llvm/lib/Target/AMDGPU/VOP1Instructions.td		diff \| blob \| history
llvm/test/CodeGen/AMDGPU/atomic_optimizations_local_pointer.ll		diff \| blob \| history
llvm/test/CodeGen/AMDGPU/llvm.amdgcn.permlane64.ll	[new file with mode: 0644]	blob
llvm/test/Transforms/InstCombine/AMDGPU/permlane64.ll	[new file with mode: 0644]	blob