fix comment

author Forrest Iandola <fiandola@gmail.com>

Mon, 16 Oct 2017 07:44:24 +0000 (00:44 -0700)

committer GitHub <noreply@github.com>

Mon, 16 Oct 2017 07:44:24 +0000 (00:44 -0700)
author Forrest Iandola <fiandola@gmail.com>
Mon, 16 Oct 2017 07:44:24 +0000 (00:44 -0700)
committer GitHub <noreply@github.com>
Mon, 16 Oct 2017 07:44:24 +0000 (00:44 -0700)
diff --git a/src/core/NEON/kernels/NEDirectConvolutionLayerKernel.cpp b/src/core/NEON/kernels/NEDirectConvolutionLayerKernel.cpp

index 2766d698d91a9a13e36cf2fa4bb0c2e86cff82d2..66d6d1fd40df03bac046749922cfe8fbf23737a5 100644 (file)
--- a/src/core/NEON/kernels/NEDirectConvolutionLayerKernel.cpp
+++ b/src/core/NEON/kernels/NEDirectConvolutionLayerKernel.cpp
@@ -1082,7 +1082,7 @@ public:
                      the third thread [16,24] and the fourth thread [25,31].
  
                      The algorithm outer loop iterates over Z, P, Y, X where P is the depth/3rd dimension of each kernel. This order is not arbitrary, the main benefit of this
-                    is that we setup the neon registers containing the kernerl's values only once and then compute each XY using the preloaded registers as opposed as doing this for every XY value.
+                    is that we setup the neon registers containing the kernel's values only once and then compute each XY using the preloaded registers as opposed as doing this for every XY value.
  
                      The algorithm does not require allocating any additional memory amd computes the results directly in-place in two stages:
                          1) Convolve plane 0 with kernel 0 and initialize the corresponding output plane with these values.
author	Forrest Iandola <fiandola@gmail.com>
	Mon, 16 Oct 2017 07:44:24 +0000 (00:44 -0700)
committer	GitHub <noreply@github.com>
	Mon, 16 Oct 2017 07:44:24 +0000 (00:44 -0700)