ARM Compute Library
17.04
|
Basic function to execute GEMM on OpenCL. More...
#include <CLGEMM.h>
Public Member Functions | |
CLGEMM () | |
Default constructor. More... | |
void | configure (const ICLTensor *a, const ICLTensor *b, const ICLTensor *c, ICLTensor *output, float alpha, float beta) |
Initialise the kernel's inputs and output. More... | |
void | run () override |
Run the kernels contained in the function. More... | |
Public Member Functions inherited from IFunction | |
virtual | ~IFunction ()=default |
Destructor. More... | |
Basic function to execute GEMM on OpenCL.
Data types supported: F32, F16. This function calls the following OpenCL kernels:
CLGEMM | ( | ) |
Default constructor.
void configure | ( | const ICLTensor * | a, |
const ICLTensor * | b, | ||
const ICLTensor * | c, | ||
ICLTensor * | output, | ||
float | alpha, | ||
float | beta | ||
) |
Initialise the kernel's inputs and output.
[in] | a | First input tensor (Matrix or Vector A). Data types supported: F32, F16 |
[in] | b | Second input tensor (Matrix B). Data type supported: same as a . |
[in] | c | Third input tensor (Matrix C). It can be a nullptr if just the multiplication between a and b is needed. Data type supported: same as a . |
[out] | output | Output tensor. Data type supported: same as a |
[in] | alpha | Weight of the matrix product |
[in] | beta | Weight of matrix C |
|
overridevirtual |
Run the kernels contained in the function.
For NEON kernels:
For OpenCL kernels:
Implements IFunction.