aco: don't optimize cross-lane instructions across p_wqm
authorDaniel Schürmann <daniel@schuermann.dev>
Mon, 4 Sep 2023 13:32:24 +0000 (15:32 +0200)
committerMarge Bot <emma+marge@anholt.net>
Thu, 14 Sep 2023 09:25:22 +0000 (09:25 +0000)
commit1275981df8c3762c208a49c8cde7deaa15489b24
tree01fbae6d8bbf67b781c37f42a02ae38a9fc3fa29
parent0907b53740a456d769d7ad26f071d7610abb9918
aco: don't optimize cross-lane instructions across p_wqm

We will use p_wqm as a marker in the next step.

Totals from 8846 (11.55% of 76572) affected shaders: (GFX11)

Instrs: 7031274 -> 7072729 (+0.59%); split: -0.02%, +0.60%
CodeSize: 37060272 -> 37355244 (+0.80%); split: -0.01%, +0.80%
VGPRs: 402660 -> 398724 (-0.98%); split: -0.99%, +0.01%
Latency: 62231926 -> 62322311 (+0.15%); split: -0.01%, +0.15%
InvThroughput: 10341361 -> 10392589 (+0.50%); split: -0.00%, +0.50%
VClause: 105344 -> 105368 (+0.02%); split: -0.03%, +0.05%
SClause: 218330 -> 218469 (+0.06%); split: -0.07%, +0.14%
Copies: 378609 -> 377644 (-0.25%); split: -0.42%, +0.17%
Branches: 97218 -> 97207 (-0.01%); split: -0.01%, +0.00%
PreSGPRs: 307654 -> 307644 (-0.00%); split: -0.08%, +0.08%
PreVGPRs: 314744 -> 308650 (-1.94%)
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25038>
src/amd/compiler/aco_opt_value_numbering.cpp
src/amd/compiler/tests/test_d3d11_derivs.cpp