ARM Compute Library
17.03.1
|
Basic function to execute GEMM on NEON. More...
#include <NEGEMM.h>
Public Member Functions | |
NEGEMM () | |
Constructor. More... | |
void | configure (const ITensor *a, const ITensor *b, const ITensor *c, ITensor *d, float alpha, float beta) |
Initialise the kernel's inputs, output. More... | |
void | run () override |
Run the kernels contained in the function. More... | |
Public Member Functions inherited from IFunction | |
virtual | ~IFunction ()=default |
Destructor. More... | |
Basic function to execute GEMM on NEON.
This function calls the following NEON kernels:
NEGEMM | ( | ) |
Constructor.
void configure | ( | const ITensor * | a, |
const ITensor * | b, | ||
const ITensor * | c, | ||
ITensor * | d, | ||
float | alpha, | ||
float | beta | ||
) |
Initialise the kernel's inputs, output.
[in] | a | First input tensor (Matrix A or Vector A). Data type supported: F32, F16. |
[in] | b | Second input tensor (Matrix B). Data type supported: same as a |
[in] | c | Third input tensor (Matrix C). It can be a nullptr if just the multiplication between a and b is needed. Data type supported: same as a |
[out] | d | Output tensor. Data type supported: same as a |
[in] | alpha | Weight of the matrix product |
[in] | beta | Weight of matrix C |
|
overridevirtual |
Run the kernels contained in the function.
For NEON kernels:
For OpenCL kernels:
Implements IFunction.