AMDGPU: Re-organize the outer loop of SILoadStoreOptimizer
authorNicolai Haehnle <nhaehnle@gmail.com>
Tue, 28 Nov 2017 08:42:46 +0000 (08:42 +0000)
committerNicolai Haehnle <nhaehnle@gmail.com>
Tue, 28 Nov 2017 08:42:46 +0000 (08:42 +0000)
commitb4f28deda0b3e9a3bf2c092622d5fc3ecc846201
tree6dadd1d916dd950c45164d7b4b220ef10cc07dc6
parent39980dac0b1724cea2a64a2ee76ce35df6dbf0b6
AMDGPU: Re-organize the outer loop of SILoadStoreOptimizer

Summary:
The entire algorithm operates per basic-block, so for cache locality
it should be better to re-optimize a basic-block immediately rather than
in a separate loop.

I don't have performance measurements.

Change-Id: I85106570bd623c4ff277faaa50ee43258e1ddcc5

Reviewers: arsenm, rampitec

Subscribers: kzhuravl, wdng, yaxunl, dstuttard, tpr, llvm-commits, t-tye

Differential Revision: https://reviews.llvm.org/D40344

llvm-svn: 319156
llvm/lib/Target/AMDGPU/SILoadStoreOptimizer.cpp