Basic function to execute GEMMLowp on OpenCL. More...

#include <CLGEMMLowp.h>

Collaboration diagram for CLGEMMLowp:

Public Member Functions
	CLGEMMLowp ()
	Constructor. More...

void	configure (const ICLTensor a, const ICLTensor b, ICLTensor *output, int32_t a_offset, int32_t b_offset, int32_t output_offset, int32_t output_mult_int, int32_t shift)
	Initialise the kernel's inputs, output. More...

void	run () override
	Run the kernels contained in the function. More...

Public Member Functions inherited from IFunction
virtual	~IFunction ()=default
	Destructor. More...

Detailed Description

Basic function to execute GEMMLowp on OpenCL.

This function calls the following OpenCL kernels:

Definition at line 46 of file CLGEMMLowp.h.

Constructor & Destructor Documentation

CLGEMMLowp ( )

Constructor.

Initialise the kernel's inputs, output.

Note: GEMM_LOWP: low precision matrix multiply kernel This kernel performs the following computation:

Parameters

[in]	a	First input tensor (Matrix A). Data types supported: U8.
[in]	b	Second input tensor (Matrix B). Data types supported: same as `a`.
[out]	output	Output tensor. Data types supported: same as `a`.
[in]	a_offset	Offset to be added to each element of the matrix A.
[in]	b_offset	Offset to be added to each element of the matrix B.
[in]	output_offset	Offset to be added to each element of the output matrix
[in]	output_mult_int	Multiplied with each element of the output matrix
[in]	shift	Number of bits to shift right the result.

void run ( )

overridevirtual

Run the kernels contained in the function.

For NEON kernels:

Note: CPPScheduler::force_number_of_threads() can be used to manually set the number of threads

For OpenCL kernels:

Note: The function will not block until the kernels are executed. It is the user's responsibility to wait.

Implements IFunction.

The documentation for this class was generated from the following file: