review.tizen.org Git - platform/upstream/openblas.git/commit

author	Marius Hillenbrand <mhillen@linux.ibm.com>
	Tue, 12 May 2020 12:40:30 +0000 (14:40 +0200)
committer	Marius Hillenbrand <mhillen@linux.ibm.com>
	Tue, 12 May 2020 13:59:51 +0000 (15:59 +0200)
commit	71b6eaf459e55e7b5fe5047052c39c49f16c3680
tree	57e9b7d1e45b10be4a68fc4d72e4fe4f1ba086ac	tree \| snapshot
parent	43c0d4f312ba3cd1a0ff8f389e6eded98113c0dd	commit \| diff

s390x: Use new sgemm kernel also for strmm on Z14 and newer

Employ the newly added GEMM kernel also for STRMM on Z14. The
implementation in C with vector intrinsics exploits FP32 SIMD operations
and thereby gains performance over the existing assembly code. Extend
the implementation for handling triangular matrix multiplication,
accordingly. As added benefit, the more flexible C code enables us to
adjust register blocking in the subsequent commit.

Tested via make -C test / ctest / utest and by a couple of additional
unit tests that exercise blocking.

Signed-off-by: Marius Hillenbrand <mhillen@linux.ibm.com>

kernel/zarch/KERNEL.Z14		diff \| blob \| history
kernel/zarch/gemm_vec.c		diff \| blob \| history