Update Neon™ pooling kernel

- Reduce duplication and simplify overall structure.
- Improve multi-threaded performance by sharing more data
  in lower-level caches.

Partially Resolves: COMPMID-5054
Signed-off-by: Ramy Elgammal<ramy.elgammal@arm.com>
Change-Id: I5f4dc50913401d5c1cbfc10b866fae9490cbc4d7
Reviewed-on: https://review.mlplatform.org/c/ml/ComputeLibrary/+/7404
Tested-by: Arm Jenkins <bsgcomp@arm.com>
Reviewed-by: Andrew Mundy
Reviewed-by: Sheri Zhang <sheri.zhang@arm.com>
Comments-Addressed: Arm Jenkins <bsgcomp@arm.com>
diff --git a/Android.bp b/Android.bp
index c19d0d4..17c56f3 100644
--- a/Android.bp
+++ b/Android.bp
@@ -297,6 +297,7 @@
         "src/core/NEON/kernels/NEStackLayerKernel.cpp",
         "src/core/NEON/kernels/NEStridedSliceKernel.cpp",
         "src/core/NEON/kernels/NETileKernel.cpp",
+        "src/core/NEON/kernels/arm_conv/addressing.cpp",
         "src/core/NEON/kernels/arm_conv/depthwise/depthwise_fp16.cpp",
         "src/core/NEON/kernels/arm_conv/depthwise/depthwise_fp32.cpp",
         "src/core/NEON/kernels/arm_conv/depthwise/depthwise_s8q.cpp",