1. cfca87b Add SME2 implementation of softmax for FP16 by Gunes Bayir · 3 months ago
  2. f1f1f87 Add in place summation to CPU GEMM kernels by Radu Salavat · 4 months ago
  3. 1322065 Specify absolute tolerance by Sangwon Ha · 3 months ago
  4. 473b829 Adds Tests and reference implementation for scatter operator with 1D tensors. by Mohammed Suhail Munshi · 3 months ago
  5. 4908981 [ONCPUML-1451] Guard bf16 to bf16 tests with ARM_COMPUTE_ENABLE_FIXED_FORMAT_KERNELS by Renato Arantes · 3 months ago
  6. 7b3adf2 Fix for nightly build failures for android by Mohammed Suhail Munshi · 3 months ago
  7. 8609ca0 Add skeleton for CLScatter op, reference and tests by Mohammed Suhail Munshi · 4 months ago
  8. 36a75da [ONCPUML-1451] Add matmul kernel to enable bf16 to bf16 operations via PyTorch® autocast() function by Renato Arantes · 5 months ago
  9. d219115 Make Cpu/Gpu/Ref scalar/vectoral S32 division consistent by Gunes Bayir · 3 months ago
  10. 1618e95 Increase tolerance_num of Cpu RNNLayer tests by Gunes Bayir · 3 months ago
  11. 43ba0dd Increase MatMul and DilatedConv test Q8 thresholds to 1 by Gunes Bayir · 3 months ago
  12. 3ac0b87 Fix validation in pool2d assembly wrapper by Pablo Marquez Tello · 4 months ago
  13. 9167c9c Prefer indirect Gemm vs. Direct convolution if supported by Gunes Bayir · 4 months ago
  14. e77736f Set int8 test tolerance in FullyConnected to int8 by Gunes Bayir · 4 months ago
  15. 0a48c4c Requantization cases for offset changes only by Mohammed Suhail Munshi · 5 months ago
  16. 9469058 Fix linker errors in validation suite for WoA by Pablo Marquez Tello · 4 months ago
  17. 8528134 Fix validation suite on WoA by Pablo Marquez Tello · 4 months ago
  18. 0cba93f [QTest] Use dynamic output quantization in Depthwise Conv tests by Omar Al Khatib · 5 months ago
  19. 8614077 Disable some DirectConv2d tests in Dynamic Fusion by Gunes Bayir · 5 months ago
  20. 0e73498 Add support for QSYMM8 in ClCastKernel by Pablo Marquez Tello · 5 months ago
  21. 0ee13af Remove CKW prototype and Template Writer by Gunes Bayir · 5 months ago
  22. a3e1b50 Fix the bug in GpuTanh operator in dynamic fusion by Gunes Bayir · 5 months ago
  23. 8050d22 Disable FP16 tests compilation on Multi-Isa v8a by Mohammed Suhail Munshi · 5 months ago
  24. 2b9fa59 Use the stable CKW API in the GPU dynamic fusion backend by Gunes Bayir · 5 months ago
  25. 2aec5f1 Fix tolerance issue in BF16 MatMul tests by Gunes Bayir · 5 months ago
  26. fdf56fb Make GpuWorkloadContext own all tensor info objects by Viet-Hoa Do · 5 months ago
  27. c5df0c6 Fix test compilation error on GCC 13.2 by Jakub Sujak · 6 months ago
  28. 11ab451 Implement dynamic quantization for GEMMLowp tests by SiCong Li · 8 months ago
  29. 85cafff Add Mali™-G720 and Mali™-G620 as GpuTargets by Gunes Bayir · 6 months ago
  30. e37c318 Fix Run Example in Validate Tests by Mohammed Suhail Munshi · 7 months ago
  31. 6b5a361 Adjust NEReduceMean test tolerance by SiCong Li · 7 months ago
  32. fadc9b1 Optimize CpuSoftmaxKernel for axis=0 by Gunes Bayir · 8 months ago
  33. e30c874 Remove the legacy core library by Jakub Sujak · 8 months ago
  34. c63f8b0 Update comments to suppress doxygen warnings. by Anitha Raj · 8 months ago
  35. c5ab4df Optimize CpuGemmConv2d start-up time by SiCong Li · 8 months ago
  36. 449af40 Fix Elementwise Division Dynamic Shape tests by Anitha Raj · 8 months ago
  37. 02c452f Add Dynamic Quantization tests to Fully Connected Layer by Mohammed Suhail Munshi · 8 months ago
  38. c259aa5 Increase tolerance for MatMul in FP16 by Sangwon Ha · 8 months ago
  39. 704c22f [GPU] Update Reverse layer to allow negative axis and reversed axis order by Adnan AlSinan · 8 months ago
  40. 8f4b3df Fix clang-tidy errors by Jakub Sujak · 8 months ago
  41. 93a77cd Use dynamic quantization in Convolution and Dilated Convolution tests by Gunes Bayir · 9 months ago
  42. fde45d8 Extend CKW MatMul with nt_t by Adnan AlSinan · 8 months ago
  43. dfcd41a Use dynamic quantization in OpenCL™ Direct Convolution tests by Gunes Bayir · 9 months ago
  44. 4ea9bac Fix memory Error in Reverse Fixture. by Adnan AlSinan · 9 months ago
  45. 95d477e Remove padding from CL comparison operator by Viet-Hoa Do · 9 months ago
  46. 0b72aa4 Optimize NEStackLayer by Gunes Bayir · 9 months ago
  47. c6137d2 Optimize CLDeconvolutionLayer tests by Gunes Bayir · 9 months ago
  48. 3af4c9b Optimize CL and Neon depthwise convolution tests by Gunes Bayir · 9 months ago
  49. a23b468 Optimize CLTranspose operator by Jakub Sujak · 9 months ago
  50. 3831111 Change MatMul Native MMUL Kernel tests tolerance value by Adnan AlSinan · 9 months ago
  51. a04ae3e Port DepthwiseConv2d operator to Ckw by ramy.elgammal@arm.com · 11 months ago
  52. 745153b NEDeconvolutionLayer validation fix by Pablo Marquez Tello · 9 months ago
  53. 0a99c79 Fix nightly NEON Reverse reference failure by Adnan AlSinan · 9 months ago
  54. ebce280 Fix MacOS compilation error by Jakub Sujak · 9 months ago
  55. d82f797 Fix Nightly failing validation tests in NEON Reverse by Adnan AlSinan · 9 months ago
  56. c2a51bd Optimize CL and Neon Winograd tests by Gunes Bayir · 9 months ago
  57. a396da1 Implement Quantized Matmul T/T and T/Nt kernels using MMUL extension by Gunes Bayir · 9 months ago
  58. 2ad0a6b Implement Quantized Matmul Nt/T kernel using MMUL extension by Gunes Bayir · 9 months ago
  59. bdcb4c1 Implement tflite compliant reverse for CPU by Adnan AlSinan · 9 months ago
  60. e071b5e Fix the validation issue in AddMulAdd fused kernel by Gunes Bayir · 9 months ago
  61. 532ce2c Separate the output quantization calculation logic from matmul by Gunes Bayir · 10 months ago
  62. a116cd3 Implement Quantized MatMul kernel using MMUL extension by Gunes Bayir · 10 months ago
  63. 40a9d3e Remove deprecated support for BF16 in CpuCast by Adnan AlSinan · 10 months ago
  64. e87fa66 Add skeleton of ClMatMulLowpNativeMMULKernel by Gunes Bayir · 10 months ago
  65. e57eea3 Disable CKW ElementwiseBinary tests in Dynamic Fusion by Jakub Sujak · 10 months ago
  66. c85edf1 Make zip and combine variadic by Viet-Hoa Do · 10 months ago
  67. b566b6e Extend Neon ReshapeLayer validation tests by Anitha Raj · 10 months ago
  68. 0d27b2e Remove legacy PostOps code by Jakub Sujak · 10 months ago
  69. 2e6d659 Port ClTemplatePool2d to ckw by Adnan AlSinan · 10 months ago
  70. 91cb733 Port Resize operator to CKW by Gunes Bayir · 11 months ago
  71. b1fcb41 Disable NEArgMinMaxLayer RunSmall_F32_S64 for armv7a by Pablo Marquez Tello · 10 months ago
  72. eb5696d Optimize CpuReshapeKernel by Anitha Raj · 12 months ago
  73. 6075097 Remove functionality to add padding in Y dimension in validation tests by Anitha Raj · 11 months ago
  74. 29e27b0 Add support for S64 output in NEArgMinMaxLayer by Pablo Marquez Tello · 11 months ago
  75. 78da34c Fix failure in MeanReduce layer by Viet-Hoa Do · 11 months ago
  76. e1c96e7 Port DirectConv2d to CKW backend by Jakub Sujak · 11 months ago
  77. 0c19f59 Fix CL Tile operator by Viet-Hoa Do · 11 months ago
  78. 4cb0bd4 Improved testing for ArgMinMax by Pablo Marquez Tello · 11 months ago
  79. 16b3752 Port ElementwiseBinary to CKW part 2 by SiCong Li · 11 months ago
  80. 9129549 Retain back-compatibility for arm_compute/core/Types.h by SiCong Li · 11 months ago
  81. 9662ac0 Add missing tests for CLCast by Pablo Marquez Tello · 11 months ago
  82. 23882a9 Add GpuKernelArgumentBinding for runtime argument setting by SiCong Li · 12 months ago
  83. 0a59e69 Fix problem with exception handling in CPPScheduler by Matthew Bentham · 11 months ago
  84. 4a1c917 Add support for input S64/U64 in CpuCastKernel by Pablo Marquez Tello · 11 months ago
  85. 314d3e2 Break up core/Utils.h to reduce unused code being included everywhere by Matthew Bentham · 1 year ago
  86. 4184e86 Port ClTemplateActivation into Ckw by Adnan AlSinan · 12 months ago
  87. a5577db Fix dynamic fusion compilation error by Viet-Hoa Do · 12 months ago
  88. 205ba24 Added S64/U64 support for the input in CLCast by Pablo Marquez Tello · 12 months ago
  89. 945b8da Make test fixture setup methods not be templated by Matthew Bentham · 12 months ago
  90. 653b96c Improved Argminmax testing by Pablo Marquez Tello · 12 months ago
  91. 8e2dede Add Bias to MatMul Kernels and add support for use in Fully Connected Layer by Mohammed Suhail Munshi · 1 year ago
  92. 019a7d9 Enable transpose convolution with non-square kernels by Viet-Hoa Do · 1 year ago
  93. c9eeee5 Fix nightly failures in MatMulLowpNativeKernel when using bounded activation functions by Mohammed Suhail Munshi · 12 months ago
  94. 00474e9 Implement FP32/16 MatMul Lhs T Rhs T/NT kernel using MMUL extension by Gunes Bayir · 1 year ago
  95. a2bb80e Use MatMul in fully connected layer with dynamic weights when supported by Mohammed Suhail Munshi · 1 year ago
  96. c952596 Implement FP32/FP16 MatMul NT/T kernel using the MMUL extension by Ramy Elgammal · 1 year, 1 month ago
  97. a2561f0 Fix doxygen warnings by ramy.elgammal@arm.com · 1 year ago
  98. 90d15b9 Bazel and CMake optional fp16 support by David Svantesson · 1 year, 1 month ago
  99. 8eb82d2 Fix CPU depthwise convolution in case of large padding by Viet-Hoa Do · 1 year ago
  100. a8d8058 Implement FP32/FP16 MatMul NT/NT kernel using the MMUL extension by SiCong Li · 1 year, 1 month ago