24 #ifndef __ARM_COMPUTE_CLGEMMINTERLEAVE4X4KERNEL_H__ 25 #define __ARM_COMPUTE_CLGEMMINTERLEAVE4X4KERNEL_H__ OpenCL kernel which interleaves the elements of a matrix A in chunk of 4x4.
void configure(const ICLTensor *input, ICLTensor *output, int mult_interleave4x4_height=1)
Initialise the kernel's input and output.
void run(const Window &window, cl::CommandQueue &queue) override
Enqueue the OpenCL kernel to process the given window on the passed OpenCL command queue...
Store the tensor's metadata.
CLGEMMInterleave4x4Kernel()
Default constructor.
Common interface for all the OpenCL kernels.
This file contains all available output stages for GEMMLowp on OpenCL.
static Status validate(const ITensorInfo *input, const ITensorInfo *output, int mult_interleave4x4_height)
Static function to check if given info will lead to a valid configuration of CLGEMMInterleave4x4Kerne...
CLGEMMInterleave4x4Kernel & operator=(const CLGEMMInterleave4x4Kernel &)=delete
Prevent instances of this class from being copied (As this class contains pointers) ...
Interface for OpenCL tensor.
const Window & window() const
The maximum window the kernel can be executed on.
Describe a multidimensional execution window.