[AMDGPU] Update VMEM scalar write hazard mitigation sequence
authorCarl Ritson <carl.ritson@amd.com>
Thu, 16 Jul 2020 02:07:26 +0000 (11:07 +0900)
committerCarl Ritson <carl.ritson@amd.com>
Thu, 16 Jul 2020 02:37:45 +0000 (11:37 +0900)
commit5bf2a9dd40dbba6dacbbe61f843d4d3a6f54f294
tree4650a370688a90bd8f446875ac8e2f8f8724923d
parentfd802cc4dea4ed1a233ff725f98c686dc2836bf3
[AMDGPU] Update VMEM scalar write hazard mitigation sequence

Using s_waitcnt_depctr 0xffe3 is potentially faster than v_nop.

Reviewed By: rampitec, foad

Differential Revision: https://reviews.llvm.org/D83872
llvm/lib/Target/AMDGPU/GCNHazardRecognizer.cpp
llvm/test/CodeGen/AMDGPU/GlobalISel/llvm.amdgcn.div.fmas.ll
llvm/test/CodeGen/AMDGPU/GlobalISel/llvm.amdgcn.end.cf.i32.ll
llvm/test/CodeGen/AMDGPU/atomic_optimizations_local_pointer.ll
llvm/test/CodeGen/AMDGPU/atomic_optimizations_pixelshader.ll
llvm/test/CodeGen/AMDGPU/cc-update.ll
llvm/test/CodeGen/AMDGPU/vgpr-descriptor-waterfall-loop-idom-update.ll
llvm/test/CodeGen/AMDGPU/vgpr-tuple-allocation.ll
llvm/test/CodeGen/AMDGPU/vmem-to-salu-hazard.mir
llvm/test/CodeGen/AMDGPU/wave32.ll