Basic function to execute GEMM on OpenCL. More...

#include <CLGEMM.h>

Collaboration diagram for CLGEMM:

Public Member Functions
	CLGEMM ()
	Default constructor. More...

void	configure (const ICLTensor a, const ICLTensor b, const ICLTensor c, ICLTensor output, float alpha, float beta)
	Initialise the kernel's inputs and output. More...

void	run () override
	Run the kernels contained in the function. More...

Public Member Functions inherited from IFunction
virtual	~IFunction ()=default
	Destructor. More...

Detailed Description

Basic function to execute GEMM on OpenCL.

Data types supported: F32, F16. This function calls the following OpenCL kernels:

Definition at line 47 of file CLGEMM.h.

Constructor & Destructor Documentation

CLGEMM ( )

Default constructor.

Initialise the kernel's inputs and output.

Note: GEMM: General Matrix Multiply - [alpha * A * B + beta * C].; All tensors must have the same data type. Data types supported: F32, F16; Whilst the first input tensor can be a vector, the second input tensor must be at least a matrix

Parameters

[in]	a	First input tensor (Matrix or Vector A). Data types supported: F32, F16
[in]	b	Second input tensor (Matrix B). Data type supported: same as `a`.
[in]	c	Third input tensor (Matrix C). It can be a nullptr if just the multiplication between `a` and `b` is needed. Data type supported: same as `a`.
[out]	output	Output tensor. Data type supported: same as `a`
[in]	alpha	Weight of the matrix product
[in]	beta	Weight of matrix C

void run ( )

overridevirtual

Run the kernels contained in the function.

For NEON kernels:

Note: CPPScheduler::force_number_of_threads() can be used to manually set the number of threads

For OpenCL kernels:

Note: The function will not block until the kernels are executed. It is the user's responsibility to wait.

Implements IFunction.

The documentation for this class was generated from the following file: