Basic function to simulate a convolution layer. More...

Collaboration diagram for NEWinogradConvolutionLayer:

Public Member Functions
	NEWinogradConvolutionLayer (std::shared_ptr< IMemoryManager > memory_manager=nullptr)
	Constructor. More...

void	configure (const ITensor input, const ITensor weights, const ITensor biases, ITensor output, const PadStrideInfo &conv_info, const ActivationLayerInfo &act_info=ActivationLayerInfo(), bool enable_fast_math=false)
	Set the input and output tensors. More...

void	run () override
	Run the kernels contained in the function. More...

	NEWinogradConvolutionLayer (const NEWinogradConvolutionLayer &)=delete
	Prevent instances of this class from being copied (As this class contains pointers) More...

NEWinogradConvolutionLayer &	operator= (const NEWinogradConvolutionLayer &)=delete
	Prevent instances of this class from being copied (As this class contains pointers) More...

Public Member Functions inherited from IFunction
virtual	~IFunction ()=default
	Destructor. More...

virtual void	prepare ()
	Prepare the function for executing. More...

Static Public Member Functions
static Status	validate (const ITensorInfo input, const ITensorInfo weights, const ITensorInfo biases, const ITensorInfo output, const PadStrideInfo &conv_info, const ActivationLayerInfo &act_info=ActivationLayerInfo(), bool enable_fast_math=false)
	Static function to check if given info will lead to a valid configuration of NEGEMMConvolutionLayer. More...

Detailed Description

Basic function to simulate a convolution layer.

This function calls the following NEON kernels:

Note: Some Winograd configurations (i.e. F(2x2, 5x5), F(4x4, 5x5)) are supported only with enable_fast_math = true

Constructor & Destructor Documentation

NEWinogradConvolutionLayer ( std::shared_ptr< IMemoryManager > memory_manager = nullptr )

Constructor.

NEWinogradConvolutionLayer ( const NEWinogradConvolutionLayer & )

delete

Prevent instances of this class from being copied (As this class contains pointers)

void configure	(	const ITensor *	input,
		const ITensor *	weights,
		const ITensor *	biases,
		ITensor *	output,
		const PadStrideInfo &	conv_info,
		const ActivationLayerInfo &	act_info = `ActivationLayerInfo()`,
		bool	enable_fast_math = `false`
	)

Set the input and output tensors.

Parameters

[in]	input	Source tensor. 3 lower dimensions represent a single input [width, height, IFM], while every optional dimension from 4 and above represent a batch of inputs. Data types supported: F32.
[in]	weights	Weights tensor. Weights are 4D tensor with dimensions [kernel_x, kernel_y, IFM, OFM]. Data type supported: Same as `input`. Currently only 3x3 and 5x5 kernels are supported.
[in]	biases	Biases tensor. Shared biases supported. Biases are 1D tensor with dimensions [OFM]. Data type supported: Same as `weights`.
[out]	output	Destination tensor. 3 lower dimensions represent a single output [width, height, OFM], while the rest represent batch of outputs. Data types supported: Same as `input`.
[in]	conv_info	Contains padding and stride information described in PadStrideInfo. Currently only unit strides are supported.
[in]	act_info	(Optional) Activation layer information in case of a fused activation.
[in]	enable_fast_math	(Optional) Enable fast math computation. In case this flag were set, the function could dispatch the fastest implementation available which may introduce a drop of accuracy as well. Default is false

NEWinogradConvolutionLayer& operator= ( const NEWinogradConvolutionLayer & )

delete

Prevent instances of this class from being copied (As this class contains pointers)

void run ( )

overridevirtual

Run the kernels contained in the function.

For NEON kernels:

Note: CPPScheduler::set_num_threads() can be used to manually set the number of threads

For OpenCL kernels:

Note: The function will not block until the kernels are executed. It is the user's responsibility to wait.; Will call prepare() on first run if hasn't been done

Implements IFunction.

static Status validate	(	const ITensorInfo *	input,
		const ITensorInfo *	weights,
		const ITensorInfo *	biases,
		const ITensorInfo *	output,
		const PadStrideInfo &	conv_info,
		const ActivationLayerInfo &	act_info = `ActivationLayerInfo()`,
		bool	enable_fast_math = `false`
	)

static

Static function to check if given info will lead to a valid configuration of NEGEMMConvolutionLayer.

Parameters

[in]	input	Source tensor. 3 lower dimensions represent a single input [width, height, IFM], while every optional dimension from 4 and above represent a batch of inputs. Data types supported: F32.
[in]	weights	Weights tensor. Weights are 4D tensor with dimensions [kernel_x, kernel_y, IFM, OFM]. Data type supported:Same as `input`. Currently only 3x3 and 5x5 kernels are supported.
[in]	biases	Biases tensor. Shared biases supported. Biases are 1D tensor with dimensions [OFM]. Data type supported: Same as `weights`.
[in]	output	Destination tensor. 3 lower dimensions represent a single output [width, height, OFM], while the rest represent batch of outputs. Data types supported: Same as `input`.
[in]	conv_info	Contains padding and stride information described in PadStrideInfo. Currently only unit strides are supported.
[in]	act_info	(Optional) Activation layer information in case of a fused activation.
[in]	enable_fast_math	(Optional) Enable fast math computation. In case this flag were set, the function could dispatch the fastest implementation available which may introduce a drop of accuracy as well. Default is false

The documentation for this class was generated from the following file: