1. ada3200 Add update/index/output (m+1)/2d/(m+n) support for CLScatter by Gunes Bayir · 6 weeks ago
  2. 5057ce9 Update documentation for 24.04 release by Michael Kozlov · 7 weeks ago
  3. a668f9f Add s8f32 kernels and dynamic QuantizationInfo by Jonathan Deakin · 4 months ago
  4. cfca87b Add SME2 implementation of softmax for FP16 by Gunes Bayir · 8 weeks ago
  5. f1f1f87 Add in place summation to CPU GEMM kernels by Radu Salavat · 3 months ago
  6. 77bbe2e Add SME2 implementation of softmax for FP32 by Viet-Hoa Do · 6 months ago
  7. 36a75da [ONCPUML-1451] Add matmul kernel to enable bf16 to bf16 operations via PyTorch® autocast() function by Renato Arantes · 4 months ago
  8. 93e743f Optimize CpuSoftmaxKernel for axis != 0 and neon kernels by Omar Al Khatib · 5 months ago
  9. d0611c1 Update documentation for 24.02.1 release by Felix Thomasmathibalan · 3 months ago
  10. 9167c9c Prefer indirect Gemm vs. Direct convolution if supported by Gunes Bayir · 3 months ago
  11. ef63739 Integrate new pretranspose_b_array with extra fused transpose of B by Gunes Bayir · 4 months ago
  12. d98e27e Update documentation for 24.02 release by Felix Thomasmathibalan · 4 months ago
  13. 2db938c Parallelize CPU depthwise over batch if only 1 row by Jonathan Deakin · 4 months ago
  14. 7467ba8 Use look up table for fp16 activation by Mohammed Suhail Munshi · 6 months ago
  15. 85cafff Add Mali™-G720 and Mali™-G620 as GpuTargets by Gunes Bayir · 6 months ago
  16. 4737094 Optimize CPU depth-to-space by Viet-Hoa Do · 7 months ago
  17. fadc9b1 Optimize CpuSoftmaxKernel for axis=0 by Gunes Bayir · 7 months ago
  18. 9f7aca9 Changes to enable FP16 in armv8a multi_isa by Pablo Marquez Tello · 10 months ago
  19. e30c874 Remove the legacy core library by Jakub Sujak · 7 months ago
  20. 38eb5fb Update Release notes for 23.11 by Anitha Raj · 7 months ago
  21. c5ab4df Optimize CpuGemmConv2d start-up time by SiCong Li · 8 months ago
  22. a7ddd60 Add support for Arm® Cortex®-A520 and Arm® Cortex®-R82 by Viet-Hoa Do · 7 months ago
  23. 704c22f [GPU] Update Reverse layer to allow negative axis and reversed axis order by Adnan AlSinan · 7 months ago
  24. fde45d8 Extend CKW MatMul with nt_t by Adnan AlSinan · 7 months ago
  25. 29254ae Optimize CL softmax by Viet-Hoa Do · 8 months ago
  26. c210c85 Optimize CL reduction operation by Viet-Hoa Do · 8 months ago
  27. 0b72aa4 Optimize NEStackLayer by Gunes Bayir · 8 months ago
  28. a23b468 Optimize CLTranspose operator by Jakub Sujak · 8 months ago
  29. a04ae3e Port DepthwiseConv2d operator to Ckw by ramy.elgammal@arm.com · 10 months ago
  30. bdcb4c1 Implement tflite compliant reverse for CPU by Adnan AlSinan · 9 months ago
  31. 500e10b Add CL command buffer class by Viet-Hoa Do · 9 months ago
  32. 40a9d3e Remove deprecated support for BF16 in CpuCast by Adnan AlSinan · 9 months ago
  33. e87fa66 Add skeleton of ClMatMulLowpNativeMMULKernel by Gunes Bayir · 9 months ago
  34. 0d27b2e Remove legacy PostOps code by Jakub Sujak · 9 months ago
  35. 2e6d659 Port ClTemplatePool2d to ckw by Adnan AlSinan · 10 months ago
  36. 91cb733 Port Resize operator to CKW by Gunes Bayir · 10 months ago
  37. 1b2ee3e CPU: Depthwise: Generate correct size for input indirection array. by David Mansell · 9 months ago
  38. eb5696d Optimize CpuReshapeKernel by Anitha Raj · 11 months ago
  39. 633ebd1 Update OpenCL headers to v2023.04.17 by Viet-Hoa Do · 10 months ago
  40. 11b23f7 Update Release Notes by ramy.elgammal@arm.com · 10 months ago
  41. 29e27b0 Add support for S64 output in NEArgMinMaxLayer by Pablo Marquez Tello · 10 months ago
  42. e1c96e7 Port DirectConv2d to CKW backend by Jakub Sujak · 10 months ago
  43. 0c19f59 Fix CL Tile operator by Viet-Hoa Do · 10 months ago
  44. 205ba24 Added S64/U64 support for the input in CLCast by Pablo Marquez Tello · 11 months ago
  45. 019a7d9 Enable transpose convolution with non-square kernels by Viet-Hoa Do · 11 months ago
  46. 2f0ef00 Update README for patch release 23.05.1 by ramy.elgammal@arm.com · 11 months ago
  47. c952596 Implement FP32/FP16 MatMul NT/T kernel using the MMUL extension by Ramy Elgammal · 1 year ago
  48. 8eb82d2 Fix CPU depthwise convolution in case of large padding by Viet-Hoa Do · 12 months ago
  49. 59b9ff0 Deprecate legacy libarm_compute_core by Jakub Sujak · 12 months ago
  50. 32a6250 Update release notes for the 23.05 release. by Omar Al Khatib · 1 year, 1 month ago
  51. cd8b40d Guards to make NEReorder aarch64 only by David Svantesson · 1 year, 1 month ago
  52. f0ff76d Add Reorder to changelog by David Svantesson · 1 year, 1 month ago
  53. fff9a4c Add Cropping to CLBatchToSpace by Omar Al Khatib · 1 year, 2 months ago
  54. 8893e45 Add cropping support to NEBatchToSpace by SiCong Li · 1 year, 2 months ago
  55. 22e7613 Update release log for 23.02.1 patch release by Jakub Sujak · 1 year, 3 months ago
  56. 117e17e Add Microsoft Windows® trademarks by Jakub Sujak · 1 year, 3 months ago
  57. 9eefd4b Update release version and change log documentation by Jakub Sujak · 1 year, 4 months ago
  58. 06db85e Update release version and change log documentation by Jakub Sujak · 1 year, 4 months ago
  59. 90e5720 Update recommended NDK to r20b in the documentation by SiCong Li · 1 year, 4 months ago
  60. 293ab60 Fix release notes for 22.11 by Viet-Hoa Do · 1 year, 7 months ago
  61. b1f8288 Add release notes for 22.11 by Viet-Hoa Do · 1 year, 7 months ago
  62. fa8ff8e Fix note in guidelines doc by Ramy Elgammal · 1 year, 10 months ago
  63. 966218d Update release notes about armv8.6 build flag change by Ramy Elgammal · 1 year, 10 months ago
  64. 0d274b7 Update README by Ramy Elgammal · 1 year, 10 months ago
  65. 13f96d0 Fix armv7a on Android "end-of-support" build and documentation by SiCong Li · 2 years ago
  66. 14ce094 Drop support on armv7a on Android by Michalis Spyrou · 2 years ago
  67. fe1b1f6 Allow clearer suppression of toolchain and compiler prefixes by SiCong Li · 2 years ago
  68. 2921e5b Update README by Adnan AlSinan · 2 years, 1 month ago
  69. bb8b235 Fix Doxygen build by Adnan AlSinan · 2 years, 4 months ago
  70. 149203b Port MaxUnpoolingLayer kernel and add KernelSelect vaidation test by Dana Zlotnik · 2 years, 4 months ago
  71. 69854ba Update Release Notes by Adnan AlSinan · 2 years, 4 months ago
  72. abc093b Inclusive language update by Adnan AlSinan · 2 years, 4 months ago
  73. 6863fa0 Remove deprecated remap functions. by Adnan AlSinan · 2 years, 4 months ago
  74. b2eba7f Replacing non-inclusive terms with proper terms by ramelg01 · 2 years, 5 months ago
  75. 0877370 Update changelog and toolchain information/versions by Gunes Bayir · 2 years, 7 months ago
  76. 5dda217 DirectConv3d support refine by Sheri Zhang · 2 years, 8 months ago
  77. b6af482 Per-operator build dependencies by Georgios Pinitas · 2 years, 9 months ago
  78. 77014ff Update Release Notes by Freddie Liardet · 2 years, 10 months ago
  79. d87aded Port CLGEMMConvolutionLayer by Manuel Bottini · 2 years, 11 months ago
  80. d844c08 Port CLIm2ColKernel to ClIm2ColKernel by Manuel Bottini · 2 years, 11 months ago
  81. 29599d0 Port NEGEMMConvolutionLayer by Manuel Bottini · 2 years, 11 months ago
  82. 7b23732 Port CLCol2ImKernel to ClCol2ImKernel by Manuel Bottini · 2 years, 11 months ago
  83. 96f977e Port NEWinogradConvolutionLayer by Michalis Spyrou · 3 years ago
  84. 24b8920 Port NECol2ImKernel by Manuel Bottini · 3 years ago
  85. cfac51c Port NEGEMMLowp Part 2 by Manuel Bottini · 3 years ago
  86. 9002899 Port NEIm2ColKernel by Manuel Bottini · 3 years ago
  87. 8155c02 Rework OpenCL Depthwise Convolution by Gian Marco Iodice · 3 years, 2 months ago
  88. c63b722 Revert "Rework OpenCL Depthwise Convolution" by Gian Marco Iodice · 3 years ago
  89. 4a578b9 Port the ClGemmLowp kernels to the new API by Georgios Pinitas · 3 years ago
  90. 53832b2 Port NEGEMM to memory injecting interface (Part 2) by Michele Di Giorgio · 3 years ago
  91. 93b75e0 Port NEGEMM to memory injecting interface (Part 1) by Michele Di Giorgio · 3 years ago
  92. 561c176 Rework OpenCL Depthwise Convolution by Gian Marco Iodice · 3 years, 2 months ago
  93. ae58bdf Port NEGEMMLowp Part 1 by Manuel Bottini · 3 years ago
  94. c6f4ec3 Port CLWinogradConvolutionLayer with ClWinogradConv2d by Manuel Bottini · 3 years, 1 month ago
  95. ee301b3 Fix errata in documentation by Jakub Sujak · 3 years ago
  96. 11d8415 Port DepthConvert to new Api by Georgios Pinitas · 3 years, 1 month ago
  97. 856f66e Port CLGEMM to memory injecting interface by Georgios Pinitas · 3 years, 1 month ago
  98. c2bed95 Update release note by Sheri Zhang · 3 years, 1 month ago
  99. d813bab Restructure documentation by Sheri Zhang · 3 years, 1 month ago[Renamed (61%) from docs/00_introduction.dox]
  100. 562bee5 Port CpuConvertFullyConnectedWeights to new API by Teresa Charlin · 3 years, 2 months ago