aco/insert_exec_mask: optimize top-level transition to exact before demote
authorRhys Perry <pendingchaos02@gmail.com>
Wed, 23 Feb 2022 17:29:25 +0000 (17:29 +0000)
committerMarge Bot <emma+marge@anholt.net>
Tue, 8 Mar 2022 12:49:59 +0000 (12:49 +0000)
commitd068eb53e84ca1e44ad96c31dab63476880b3c72
treeb8c933592201ca9e9fab99d477715da46d619517
parent42a5be975adc51d1029f25719b979bf2b3bae8a9
aco/insert_exec_mask: optimize top-level transition to exact before demote

fossil-db (Sienna Cichlid):
Totals from 5767 (3.55% of 162293) affected shaders:
Instrs: 3264949 -> 3257527 (-0.23%); split: -0.23%, +0.00%
CodeSize: 17835692 -> 17806004 (-0.17%); split: -0.17%, +0.00%
Latency: 45990060 -> 45987924 (-0.00%); split: -0.00%, +0.00%
InvThroughput: 7643850 -> 7643835 (-0.00%); split: -0.00%, +0.00%
Copies: 193641 -> 186219 (-3.83%); split: -3.84%, +0.01%

Signed-off-by: Rhys Perry <pendingchaos02@gmail.com>
Reviewed-by: Daniel Schürmann <daniel@schuermann.dev>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15244>
src/amd/compiler/aco_insert_exec_mask.cpp