- ab538a2 Use lookup table for Fp16 Tanh activation in hardware with SVE by Gunes Bayir · 6 weeks ago
- f5053f7 Update logic in the OpenMP scheduler to exclude LITTLE cores by Omar Al Khatib · 8 weeks ago
- b4b61a6 Rework CpuQuantizeKernel to enable FP16 in multi_isa builds by Ramy Elgammal · 7 weeks ago
- 21fb2ad Fix ReductionLayer FP16 for armv8a multi_isa builds by Ramy Elgammal · 8 weeks ago
- c1575b2 Add SME2 implementation of Softmax for QASYMM8 and QASYMM8_SIGNED. by Omar Al Khatib · 2 months ago
- 301e33f Add fp16 and integer data type support for ScatterNd in Gpu by Gunes Bayir · 9 weeks ago
- ada3200 Add update/index/output (m+1)/2d/(m+n) support for CLScatter by Gunes Bayir · 2 months ago
- 5057ce9 Update documentation for 24.04 release by Michael Kozlov · 3 months ago
- a668f9f Add s8f32 kernels and dynamic QuantizationInfo by Jonathan Deakin · 5 months ago
- cfca87b Add SME2 implementation of softmax for FP16 by Gunes Bayir · 3 months ago
- f1f1f87 Add in place summation to CPU GEMM kernels by Radu Salavat · 4 months ago
- 77bbe2e Add SME2 implementation of softmax for FP32 by Viet-Hoa Do · 7 months ago
- 36a75da [ONCPUML-1451] Add matmul kernel to enable bf16 to bf16 operations via PyTorch® autocast() function by Renato Arantes · 5 months ago
- 93e743f Optimize CpuSoftmaxKernel for axis != 0 and neon kernels by Omar Al Khatib · 6 months ago
- d0611c1 Update documentation for 24.02.1 release by Felix Thomasmathibalan · 4 months ago
- 9167c9c Prefer indirect Gemm vs. Direct convolution if supported by Gunes Bayir · 4 months ago
- ef63739 Integrate new pretranspose_b_array with extra fused transpose of B by Gunes Bayir · 5 months ago
- d98e27e Update documentation for 24.02 release by Felix Thomasmathibalan · 5 months ago
- 2db938c Parallelize CPU depthwise over batch if only 1 row by Jonathan Deakin · 5 months ago
- 9b72a6c Add scripts to generate Doxygen documentation by Jakub Sujak · 7 months ago
- c7f550d Improved documentation by Pablo Marquez Tello · 6 months ago
- 7467ba8 Use look up table for fp16 activation by Mohammed Suhail Munshi · 7 months ago
- 85cafff Add Mali™-G720 and Mali™-G620 as GpuTargets by Gunes Bayir · 7 months ago
- 4737094 Optimize CPU depth-to-space by Viet-Hoa Do · 8 months ago
- fadc9b1 Optimize CpuSoftmaxKernel for axis=0 by Gunes Bayir · 8 months ago
- 9f7aca9 Changes to enable FP16 in armv8a multi_isa by Pablo Marquez Tello · 11 months ago
- e30c874 Remove the legacy core library by Jakub Sujak · 8 months ago
- 38eb5fb Update Release notes for 23.11 by Anitha Raj · 8 months ago
- 667e82f Update list of supported operators in documentation by Jakub Sujak · 8 months ago
- a63ece7 Document how to build ACL with LLVM+Clang toolchain by Gunes Bayir · 8 months ago
- c5ab4df Optimize CpuGemmConv2d start-up time by SiCong Li · 9 months ago
- a7ddd60 Add support for Arm® Cortex®-A520 and Arm® Cortex®-R82 by Viet-Hoa Do · 8 months ago
- 704c22f [GPU] Update Reverse layer to allow negative axis and reversed axis order by Adnan AlSinan · 8 months ago
- fde45d8 Extend CKW MatMul with nt_t by Adnan AlSinan · 8 months ago
- 29254ae Optimize CL softmax by Viet-Hoa Do · 9 months ago
- c210c85 Optimize CL reduction operation by Viet-Hoa Do · 9 months ago
- 0b72aa4 Optimize NEStackLayer by Gunes Bayir · 9 months ago
- a23b468 Optimize CLTranspose operator by Jakub Sujak · 9 months ago
- a04ae3e Port DepthwiseConv2d operator to Ckw by ramy.elgammal@arm.com · 11 months ago
- bdcb4c1 Implement tflite compliant reverse for CPU by Adnan AlSinan · 10 months ago
- 500e10b Add CL command buffer class by Viet-Hoa Do · 10 months ago
- 40a9d3e Remove deprecated support for BF16 in CpuCast by Adnan AlSinan · 10 months ago
- e87fa66 Add skeleton of ClMatMulLowpNativeMMULKernel by Gunes Bayir · 10 months ago
- 0d27b2e Remove legacy PostOps code by Jakub Sujak · 11 months ago
- 2e6d659 Port ClTemplatePool2d to ckw by Adnan AlSinan · 11 months ago
- 91cb733 Port Resize operator to CKW by Gunes Bayir · 12 months ago
- 1b2ee3e CPU: Depthwise: Generate correct size for input indirection array. by David Mansell · 11 months ago
- eb5696d Optimize CpuReshapeKernel by Anitha Raj · 12 months ago
- f62eacd Update errata by ramy.elgammal@arm.com · 11 months ago
- 633ebd1 Update OpenCL headers to v2023.04.17 by Viet-Hoa Do · 11 months ago
- ebb91a2 Update Readme for 23.08 by ramy.elgammal@arm.com · 11 months ago
- 11b23f7 Update Release Notes by ramy.elgammal@arm.com · 11 months ago
- 29e27b0 Add support for S64 output in NEArgMinMaxLayer by Pablo Marquez Tello · 11 months ago
- 66b4a6a Setup pre-commit and include code formatting scripts by Gunes Bayir · 1 year ago
- e1c96e7 Port DirectConv2d to CKW backend by Jakub Sujak · 11 months ago
- 78ce273 Document the Conv2D heuristic by Gian Marco Iodice · 11 months ago
- 0c19f59 Fix CL Tile operator by Viet-Hoa Do · 11 months ago
- 9129549 Retain back-compatibility for arm_compute/core/Types.h by SiCong Li · 12 months ago
- 205ba24 Added S64/U64 support for the input in CLCast by Pablo Marquez Tello · 12 months ago
- 4c0a38a Disable kernel size 3 in argminmax for axis 0 by Pablo Marquez Tello · 12 months ago
- 019a7d9 Enable transpose convolution with non-square kernels by Viet-Hoa Do · 1 year ago
- 2f0ef00 Update README for patch release 23.05.1 by ramy.elgammal@arm.com · 1 year ago
- c952596 Implement FP32/FP16 MatMul NT/T kernel using the MMUL extension by Ramy Elgammal · 1 year, 2 months ago
- a2561f0 Fix doxygen warnings by ramy.elgammal@arm.com · 1 year, 1 month ago
- 8eb82d2 Fix CPU depthwise convolution in case of large padding by Viet-Hoa Do · 1 year ago
- 59b9ff0 Deprecate legacy libarm_compute_core by Jakub Sujak · 1 year, 1 month ago
- bea74ca Raise an erratum about sporadic CLArgMinMax nightly failure by SiCong Li · 1 year, 2 months ago
- 32a6250 Update release notes for the 23.05 release. by Omar Al Khatib · 1 year, 2 months ago
- 911d572 [scons multi_isa] extend multi_isa build to support armv8-a march by Sunita Nadampalli · 1 year, 2 months ago
- cd8b40d Guards to make NEReorder aarch64 only by David Svantesson · 1 year, 2 months ago
- b5d6c28 Bazel and CMake updates by David Svantesson · 1 year, 2 months ago
- d7113e4 Removes `experimental` from `experimental_fixed_format_kernels` flag by Nathan John Sircombe · 1 year, 2 months ago
- f0ff76d Add Reorder to changelog by David Svantesson · 1 year, 2 months ago
- 99145f7 Update contibutor guidelines with rules for header guards - Includes an example by Mohammed Suhail Munshi · 1 year, 3 months ago
- fff9a4c Add Cropping to CLBatchToSpace by Omar Al Khatib · 1 year, 3 months ago
- 1fad9f2 Update standard to include a section about when to use const by SiCong Li · 1 year, 3 months ago
- 8893e45 Add cropping support to NEBatchToSpace by SiCong Li · 1 year, 3 months ago
- 22e7613 Update release log for 23.02.1 patch release by Jakub Sujak · 1 year, 4 months ago
- 4537089 Fixes for CMake and Bazel builds, tests failing in scons by David Svantesson · 1 year, 4 months ago
- 117e17e Add Microsoft Windows® trademarks by Jakub Sujak · 1 year, 4 months ago
- 9eefd4b Update release version and change log documentation by Jakub Sujak · 1 year, 5 months ago
- 06db85e Update release version and change log documentation by Jakub Sujak · 1 year, 5 months ago
- 8ee9425 Remove legacy dynamic fusion example from documentation by Jakub Sujak · 1 year, 5 months ago
- 90e5720 Update recommended NDK to r20b in the documentation by SiCong Li · 1 year, 5 months ago
- e0c42ef Bazel and CMake builds Resolves: ONCPUML-1110, ONCPUML-1109 by David Svantesson · 1 year, 7 months ago
- 03b2971 Integrate SME2 kernels by Viet-Hoa Do · 2 years, 1 month ago
- 31df05a Remove dynamic fusion prototype with tests and examples by SiCong Li · 1 year, 8 months ago
- 38ac410 Fix documentation about BF16 acceleration by Viet-Hoa Do · 1 year, 8 months ago
- 293ab60 Fix release notes for 22.11 by Viet-Hoa Do · 1 year, 8 months ago
- b1f8288 Add release notes for 22.11 by Viet-Hoa Do · 1 year, 8 months ago
- c8cc024 Adding documentation section explaining how BF16 is used by Ramy Elgammal · 1 year, 9 months ago
- a9acd5f Use https to embed MathJax to documentation by Viet-Hoa Do · 1 year, 9 months ago
- c4f2743 Optimize Quantized/Integer Bilinear Scale for Neon™ by Gunes Bayir · 1 year, 10 months ago
- fa8ff8e Fix note in guidelines doc by Ramy Elgammal · 1 year, 11 months ago
- 966218d Update release notes about armv8.6 build flag change by Ramy Elgammal · 1 year, 11 months ago
- 0d274b7 Update README by Ramy Elgammal · 1 year, 11 months ago
- 992a741 Update Errata by Ramy Elgammal · 1 year, 11 months ago
- ab659ad Updated documentation by Pablo Marquez Tello · 2 years ago
- 3964f17 Remove data extraction scripts by Pablo Marquez Tello · 2 years ago
- a3f7cd2 Move build option explanations in how to build guide to scons help message by Michalis Spyrou · 2 years ago