COMPMID-3851: Fix regression on NEDepthwiseConvolutionLayerNativeKernel
authorSang-Hoon Park <sang-hoon.park@arm.com>
Thu, 12 Nov 2020 17:41:32 +0000 (17:41 +0000)
committerSang-Hoon Park <sang-hoon.park@arm.com>
Fri, 13 Nov 2020 11:02:13 +0000 (11:02 +0000)
commit1a0a4bc78a12e85e1bd6b3207f244c91566ebdce
treeb2807e01171f47b4394476a111007167f54b4e38
parent412b789582c5992431028e9b91c4d8b99d5c4900
COMPMID-3851: Fix regression on NEDepthwiseConvolutionLayerNativeKernel

The exit condition of some for loops in quantized version
of the kernel with depth_multiplier=1 is decided
during compilation to fix performance issue.

Change-Id: I849b3d63b2a2cf5eb374ae681898ae1c296fb4fe
Signed-off-by: Sang-Hoon Park <sang-hoon.park@arm.com>
Reviewed-on: https://review.mlplatform.org/c/ml/ComputeLibrary/+/4392
Tested-by: Arm Jenkins <bsgcomp@arm.com>
Reviewed-by: Michele Di Giorgio <michele.digiorgio@arm.com>
Comments-Addressed: Arm Jenkins <bsgcomp@arm.com>
src/core/NEON/kernels/NEDepthwiseConvolutionLayerNativeKernel.cpp