Basic function to execute GEMM on NEON. More...

#include <NEGEMM.h>

Collaboration diagram for NEGEMM:

Public Member Functions
	NEGEMM ()
	Constructor. More...

void	configure (const ITensor a, const ITensor b, const ITensor c, ITensor d, float alpha, float beta)
	Initialise the kernel's inputs, output. More...

void	run () override
	Run the kernels contained in the function. More...

Public Member Functions inherited from IFunction
virtual	~IFunction ()=default
	Destructor. More...

Detailed Description

Basic function to execute GEMM on NEON.

This function calls the following NEON kernels:

Definition at line 45 of file NEGEMM.h.

Constructor & Destructor Documentation

NEGEMM ( )

Constructor.

Initialise the kernel's inputs, output.

Note: GEMM: General Matrix Multiply - [alpha * A * B + beta * C].; GEMM: The tensors a, b, c, d must have the same data type. All are either F32 or F16. You should not mix data types when calling this function.

Parameters

[in]	a	First input tensor (Matrix A or Vector A). Data type supported: F32, F16.
[in]	b	Second input tensor (Matrix B). Data type supported: same as `a`
[in]	c	Third input tensor (Matrix C). It can be a nullptr if just the multiplication between `a` and `b` is needed. Data type supported: same as `a`
[out]	d	Output tensor. Data type supported: same as `a`
[in]	alpha	Weight of the matrix product
[in]	beta	Weight of matrix C

void run ( )

overridevirtual

Run the kernels contained in the function.

For NEON kernels:

Note: CPPScheduler::force_number_of_threads() can be used to manually set the number of threads

For OpenCL kernels:

Note: The function will not block until the kernels are executed. It is the user's responsibility to wait.

Implements IFunction.

The documentation for this class was generated from the following file: