ARM Compute Library
17.04
|
Basic function to execute GEMMLowp on OpenCL. More...
#include <CLGEMMLowp.h>
Public Member Functions | |
CLGEMMLowp () | |
Constructor. More... | |
void | configure (const ICLTensor *a, const ICLTensor *b, ICLTensor *output, int32_t a_offset, int32_t b_offset, int32_t output_offset, int32_t output_mult_int, int32_t shift) |
Initialise the kernel's inputs, output. More... | |
void | run () override |
Run the kernels contained in the function. More... | |
Public Member Functions inherited from IFunction | |
virtual | ~IFunction ()=default |
Destructor. More... | |
Basic function to execute GEMMLowp on OpenCL.
This function calls the following OpenCL kernels:
Definition at line 46 of file CLGEMMLowp.h.
CLGEMMLowp | ( | ) |
Constructor.
void configure | ( | const ICLTensor * | a, |
const ICLTensor * | b, | ||
ICLTensor * | output, | ||
int32_t | a_offset, | ||
int32_t | b_offset, | ||
int32_t | output_offset, | ||
int32_t | output_mult_int, | ||
int32_t | shift | ||
) |
Initialise the kernel's inputs, output.
[in] | a | First input tensor (Matrix A). Data types supported: U8. |
[in] | b | Second input tensor (Matrix B). Data types supported: same as a . |
[out] | output | Output tensor. Data types supported: same as a . |
[in] | a_offset | Offset to be added to each element of the matrix A. |
[in] | b_offset | Offset to be added to each element of the matrix B. |
[in] | output_offset | Offset to be added to each element of the output matrix |
[in] | output_mult_int | Multiplied with each element of the output matrix |
[in] | shift | Number of bits to shift right the result. |
|
overridevirtual |
Run the kernels contained in the function.
For NEON kernels:
For OpenCL kernels:
Implements IFunction.