[libomptarget] [amdgpu] Change default number of teams per computation unit
authorDhruva Chakrabarti <Dhruva.Chakrabarti@amd.com>
Tue, 29 Jun 2021 22:07:57 +0000 (15:07 -0700)
committerDhruva Chakrabarti <Dhruva.Chakrabarti@amd.com>
Tue, 29 Jun 2021 22:34:35 +0000 (15:34 -0700)
commite0b713a0357aa31ef906111115d4e881503e56ba
tree3b8742299d74d65c486c9c8b1148369d62d49947
parent21e013303bb7d0dbb9106283af0fb966fe45af42
[libomptarget] [amdgpu] Change default number of teams per computation unit

This patch is related to https://reviews.llvm.org/D98832. Based on discussions there, I decided to separate out the teams default as this patch. This change is to increase the number of teams per computation unit so as to provide more wavefronts for hiding latency. This change improves performance for some programs, including 20-50% for some Stream benchmarks.

Reviewed By: JonChesterfield

Differential Revision: https://reviews.llvm.org/D99003
openmp/libomptarget/plugins/amdgpu/src/rtl.cpp