24 #ifndef __ARM_COMPUTE_CLGEMM_H__ 25 #define __ARM_COMPUTE_CLGEMM_H__ 79 bool _run_vector_matrix_multiplication;
OpenCL kernel which interleaves the elements of a matrix A in chunk of 4x4.
void configure(const ICLTensor *a, const ICLTensor *b, const ICLTensor *c, ICLTensor *output, float alpha, float beta)
Initialise the kernel's inputs and output.
Base class for all functions.
void run() override
Run the kernels contained in the function.
OpenCL kernel which transposes the elements of a matrix in chunks of 1x4 if the input data type is F3...
CLGEMM()
Default constructor.
OpenCL kernel to multiply two input matrices "A" and "B" or to multiply a vector "A" by a matrix "B"...
Basic function to execute GEMM on OpenCL.
Interface for OpenCL tensor.
OpenCL kernel to perform the in-place matrix addition between 2 matrices, taking into account that th...
Basic implementation of the OpenCL tensor interface.