COMPMID-3741: Remove OpenCL padding: CLWinogradOutputTransformKernel
- Refactor the OpenCL kernels for Winograd output transform NHWC to
avoid padding requirement
- The kernel adopt the reverse store approach to avoid out-of-bound
writes
Change-Id: If9aad20354ff2146f57ead07ba0aaadb3df919f9
Signed-off-by: Gian Marco Iodice <gianmarco.iodice@arm.com>
Reviewed-on: https://review.mlplatform.org/c/ml/ComputeLibrary/+/4222
Tested-by: Arm Jenkins <bsgcomp@arm.com>
Reviewed-by: Giorgio Arena <giorgio.arena@arm.com>
Comments-Addressed: Arm Jenkins <bsgcomp@arm.com>