Interface for the kernel to run an arbitrary size convolution on a tensor. More...

#include <CLConvolutionKernel.h>

Collaboration diagram for CLConvolutionKernel< matrix_size >:

Public Member Functions
void	configure (const ICLTensor input, ICLTensor output, const int16_t *conv, uint32_t scale, bool border_undefined)
	Initialise the kernel's input, output and border mode. More...

BorderSize	border_size () const override
	The size of the border for that kernel. More...

Public Member Functions inherited from ICLSimple2DKernel
void	run (const Window &window, cl::CommandQueue &queue) override
	Enqueue the OpenCL kernel to process the given window on the passed OpenCL command queue. More...

Public Member Functions inherited from ICLSimpleKernel
	ICLSimpleKernel ()
	Constructor. More...

	ICLSimpleKernel (const ICLSimpleKernel &)=delete
	Prevent instances of this class from being copied (As this class contains pointers). More...

ICLSimpleKernel &	operator= (const ICLSimpleKernel &)=delete
	Prevent instances of this class from being copied (As this class contains pointers). More...

	ICLSimpleKernel (ICLSimpleKernel &&)=default
	Allow instances of this class to be moved. More...

ICLSimpleKernel &	operator= (ICLSimpleKernel &&)=default
	Allow instances of this class to be moved. More...

	~ICLSimpleKernel ()=default
	Default destructor. More...

void	configure (const ICLTensor input, ICLTensor output, unsigned int num_elems_processed_per_iteration, bool border_undefined=false, const BorderSize &border_size=BorderSize())
	Configure the kernel. More...

Public Member Functions inherited from ICLKernel
	ICLKernel ()
	Constructor. More...

cl::Kernel &	kernel ()
	Returns a reference to the OpenCL kernel of this object. More...

void	add_1D_tensor_argument (unsigned int &idx, const ICLTensor *tensor, const Window &window)
	Add the passed 1D tensor's parameters to the object's kernel's arguments starting from the index idx. More...

void	add_2D_tensor_argument (unsigned int &idx, const ICLTensor *tensor, const Window &window)
	Add the passed 2D tensor's parameters to the object's kernel's arguments starting from the index idx. More...

void	add_3D_tensor_argument (unsigned int &idx, const ICLTensor *tensor, const Window &window)
	Add the passed 3D tensor's parameters to the object's kernel's arguments starting from the index idx. More...

unsigned int	num_arguments_per_1D_tensor () const
	Returns the number of arguments enqueued per 1D tensor object. More...

unsigned int	num_arguments_per_2D_tensor () const
	Returns the number of arguments enqueued per 2D tensor object. More...

unsigned int	num_arguments_per_3D_tensor () const
	Returns the number of arguments enqueued per 3D tensor object. More...

template<typename T >
void	add_argument (unsigned int &idx, T value)
	Add the passed parameters to the object's kernel's arguments starting from the index idx. More...

Public Member Functions inherited from IKernel
	IKernel ()
	Constructor. More...

virtual	~IKernel ()=default
	Destructor. More...

virtual bool	is_parallelisable () const
	Indicates whether or not the kernel is parallelisable. More...

const Window &	window () const
	The maximum window the kernel can be executed on. More...

Detailed Description

template<unsigned int matrix_size>
class arm_compute::CLConvolutionKernel< matrix_size >

Interface for the kernel to run an arbitrary size convolution on a tensor.

(Currently supports 3x3, 5x5, 7x7 and 9x9). The client can supply a convolution matrix \( C_{m,n} \).

\begin{eqnarray} k_0 &=& \frac{m}{2} \\ l_0 &=& \frac{n}{2} \\ sum &=& \sum_{k=0,l=0}^{k=m-1,l=n-1} input(x+k-k_0, y+l-l_0) C_{k,l} \end{eqnarray}

Note: The above equation for this function is similar to the default OpenCV Filter2D function, which actually computes a correlation and not a convolution. In case of a real convolution the convolution matrix should be flipped both horizontally and vertically.

Definition at line 52 of file CLConvolutionKernel.h.

Member Function Documentation

BorderSize border_size ( ) const

overridevirtual

The size of the border for that kernel.

Returns: The width in number of elements of the border.

Reimplemented from IKernel.

void configure	(	const ICLTensor *	input,
		ICLTensor *	output,
		const int16_t *	conv,
		uint32_t	scale,
		bool	border_undefined
	)

Initialise the kernel's input, output and border mode.

Parameters

[in]	input	Source tensor. Data types supported: U8.
[out]	output	Destination tensor, Data types supported: U8, S16.
[in]	conv	Convolution matrix to apply to the input tensor.
[in]	scale	Scale of the convolution matrix. If 0 is passed, it will be set to the sum of the coefficients of the convolution or 1 if they add up to 0.
[in]	border_undefined	True if the border mode is undefined. False if it's replicate or constant.

The documentation for this class was generated from the following file:

arm_compute/core/CL/kernels/CLConvolutionKernel.h

Public Member Functions

Detailed Description

template<unsigned int matrix_size> class arm_compute::CLConvolutionKernel< matrix_size >

Member Function Documentation

template<unsigned int matrix_size>
class arm_compute::CLConvolutionKernel< matrix_size >