commit | fe7ae817755577be29f4c07aa27d8ef9e821da45 | [log] [tgz] |
---|---|---|
author | Pablo Marquez Tello <pablo.tello@arm.com> | Wed Mar 03 12:12:35 2021 +0000 |
committer | Pablo Marquez Tello <pablo.tello@arm.com> | Mon Apr 19 15:02:29 2021 +0000 |
tree | 459b1b22f59cf5144cd72b839fbfdf21fa341479 | |
parent | 60c3b0e6821a80d78ffca5be30e05d062d071cd2 [diff] |
CLInstanceNormalizationLayer NHWC optimisation * Make changes to split the workload into two kernels. One kernel precomputes mean and variance and the second kernel just loads these precomputed values. * The new approach runs %30 faster than the original code for NHWC workloads like 32x192x256. * Resolves MLCE-337 Change-Id: I8356fcefa2d131ab4dcb32268ce7142421d073e4 Signed-off-by: Pablo Marquez Tello <pablo.tello@arm.com> Reviewed-on: https://review.mlplatform.org/c/ml/ComputeLibrary/+/5355 Tested-by: Arm Jenkins <bsgcomp@arm.com> Comments-Addressed: Arm Jenkins <bsgcomp@arm.com> Reviewed-by: Manuel Bottini <manuel.bottini@arm.com> Reviewed-by: Michele Di Giorgio <michele.digiorgio@arm.com>
Release repository: https://github.com/arm-software/ComputeLibrary
Development repository: https://review.mlplatform.org/#/admin/projects/ml/ComputeLibrary
Please report issues here: https://github.com/ARM-software/ComputeLibrary/issues
Make sure you are using the latest version of the library before opening an issue. Thanks
News:
Related projects:
Tutorials:
Documentation (API, changelogs, build guide, contribution guide, errata, etc.) available at https://github.com/ARM-software/ComputeLibrary/wiki/Documentation.
Binaries available at https://github.com/ARM-software/ComputeLibrary/releases.
Arm® CPUs:
Arm® Mali™ GPUs:
x86
The software is provided under MIT license. Contributions to this project are accepted under the same license.
For technical discussion, the ComputeLibrary project has a public mailing list: acl-dev@lists.linaro.org The list is open to anyone inside or outside of Arm to self subscribe. In order to subscribe, please visit the following website: https://lists.linaro.org/mailman/listinfo/acl-dev
Before the ComputeLibrary project accepts your contribution, you need to certify its origin and give us your permission. To manage this process we use the Developer Certificate of Origin (DCO) V1.1 (https://developercertificate.org/)
To indicate that you agree to the the terms of the DCO, you "sign off" your contribution by adding a line with your name and e-mail address to every git commit message:
Signed-off-by: John Doe <john.doe@example.org>
You must use your real name, no pseudonyms or anonymous contributions are accepted.
Android is a trademark of Google LLC.
Arm, Cortex, Mali and Neon are registered trademarks or trademarks of Arm Limited (or its subsidiaries) in the US and/or elsewhere.
Linux® is the registered trademark of Linus Torvalds in the U.S. and other countries.
Mac and macOS are trademarks of Apple Inc., registered in the U.S. and other countries.
Tizen is a registered trademark of The Linux Foundation.