Use 2D thread distribution for small GEMMs.
authorTim Moon <moon13@llnl.gov>
Tue, 3 Oct 2017 20:43:39 +0000 (13:43 -0700)
committerTim Moon <moon13@llnl.gov>
Tue, 3 Oct 2017 20:43:39 +0000 (13:43 -0700)
commit860dcfc7037bdb022083c4cda39da8f72628f8bf
tree593348a15f8778879c8902d23b73616f525dada3
parenta89d6711c6b2472e2a7a32824e326c222aee6e89
Use 2D thread distribution for small GEMMs.

Allows maximum use of available cores if one of M and N is small and the other is large.
driver/level3/level3_thread.c