24 #ifndef __ARM_COMPUTE_CLFULLYCONNECTEDLAYER_H__ 25 #define __ARM_COMPUTE_CLFULLYCONNECTEDLAYER_H__ 85 bool _transpose_weights;
87 bool _batched_fc_layer;
88 bool _accumulate_biases;
OpenCL kernel which interleaves the elements of a matrix A in chunk of 4x4.
void configure(const ICLTensor *input, const ICLTensor *weights, const ICLTensor *biases, ICLTensor *output, bool transpose_weights=true)
Set the input and output tensors.
Basic function to compute a Fully Connected layer on OpenCL.
Base class for all functions.
OpenCL kernel which transposes the elements of a matrix in chunks of 1x4 if the input data type is F3...
Interface for the im2col reshape kernel.
OpenCL kernel to multiply two input matrices "A" and "B" or to multiply a vector "A" by a matrix "B"...
Interface to add a bias to each row of the input tensor.
void run() override
Run the kernels contained in the function.
Interface for OpenCL tensor.
OpenCL kernel which transposes the elements of a matrix.
CLFullyConnectedLayer()
Constructor.
Basic implementation of the OpenCL tensor interface.