1. 8609ca0 Add skeleton for CLScatter op, reference and tests by Mohammed Suhail Munshi · 4 months ago
  2. 36a75da [ONCPUML-1451] Add matmul kernel to enable bf16 to bf16 operations via PyTorch® autocast() function by Renato Arantes · 5 months ago
  3. d219115 Make Cpu/Gpu/Ref scalar/vectoral S32 division consistent by Gunes Bayir · 4 months ago
  4. 1618e95 Increase tolerance_num of Cpu RNNLayer tests by Gunes Bayir · 4 months ago
  5. 43ba0dd Increase MatMul and DilatedConv test Q8 thresholds to 1 by Gunes Bayir · 4 months ago
  6. 3ac0b87 Fix validation in pool2d assembly wrapper by Pablo Marquez Tello · 4 months ago
  7. 9167c9c Prefer indirect Gemm vs. Direct convolution if supported by Gunes Bayir · 4 months ago
  8. e77736f Set int8 test tolerance in FullyConnected to int8 by Gunes Bayir · 4 months ago
  9. 0a48c4c Requantization cases for offset changes only by Mohammed Suhail Munshi · 5 months ago
  10. 9469058 Fix linker errors in validation suite for WoA by Pablo Marquez Tello · 5 months ago
  11. 8528134 Fix validation suite on WoA by Pablo Marquez Tello · 5 months ago
  12. 0cba93f [QTest] Use dynamic output quantization in Depthwise Conv tests by Omar Al Khatib · 5 months ago
  13. 8614077 Disable some DirectConv2d tests in Dynamic Fusion by Gunes Bayir · 5 months ago
  14. 0e73498 Add support for QSYMM8 in ClCastKernel by Pablo Marquez Tello · 5 months ago
  15. 0ee13af Remove CKW prototype and Template Writer by Gunes Bayir · 5 months ago
  16. a3e1b50 Fix the bug in GpuTanh operator in dynamic fusion by Gunes Bayir · 5 months ago
  17. 8050d22 Disable FP16 tests compilation on Multi-Isa v8a by Mohammed Suhail Munshi · 5 months ago
  18. 2b9fa59 Use the stable CKW API in the GPU dynamic fusion backend by Gunes Bayir · 6 months ago
  19. 2aec5f1 Fix tolerance issue in BF16 MatMul tests by Gunes Bayir · 6 months ago
  20. fdf56fb Make GpuWorkloadContext own all tensor info objects by Viet-Hoa Do · 6 months ago
  21. c5df0c6 Fix test compilation error on GCC 13.2 by Jakub Sujak · 6 months ago
  22. 11ab451 Implement dynamic quantization for GEMMLowp tests by SiCong Li · 8 months ago
  23. 85cafff Add Mali™-G720 and Mali™-G620 as GpuTargets by Gunes Bayir · 7 months ago
  24. e37c318 Fix Run Example in Validate Tests by Mohammed Suhail Munshi · 7 months ago
  25. 6b5a361 Adjust NEReduceMean test tolerance by SiCong Li · 7 months ago
  26. fadc9b1 Optimize CpuSoftmaxKernel for axis=0 by Gunes Bayir · 8 months ago
  27. e30c874 Remove the legacy core library by Jakub Sujak · 8 months ago
  28. c63f8b0 Update comments to suppress doxygen warnings. by Anitha Raj · 8 months ago
  29. c5ab4df Optimize CpuGemmConv2d start-up time by SiCong Li · 9 months ago
  30. 449af40 Fix Elementwise Division Dynamic Shape tests by Anitha Raj · 8 months ago
  31. 02c452f Add Dynamic Quantization tests to Fully Connected Layer by Mohammed Suhail Munshi · 8 months ago
  32. c259aa5 Increase tolerance for MatMul in FP16 by Sangwon Ha · 8 months ago
  33. 704c22f [GPU] Update Reverse layer to allow negative axis and reversed axis order by Adnan AlSinan · 9 months ago
  34. 8f4b3df Fix clang-tidy errors by Jakub Sujak · 8 months ago
  35. 93a77cd Use dynamic quantization in Convolution and Dilated Convolution tests by Gunes Bayir · 9 months ago
  36. fde45d8 Extend CKW MatMul with nt_t by Adnan AlSinan · 9 months ago
  37. dfcd41a Use dynamic quantization in OpenCL™ Direct Convolution tests by Gunes Bayir · 9 months ago
  38. 4ea9bac Fix memory Error in Reverse Fixture. by Adnan AlSinan · 9 months ago
  39. 95d477e Remove padding from CL comparison operator by Viet-Hoa Do · 9 months ago
  40. 0b72aa4 Optimize NEStackLayer by Gunes Bayir · 9 months ago
  41. c6137d2 Optimize CLDeconvolutionLayer tests by Gunes Bayir · 9 months ago
  42. 3af4c9b Optimize CL and Neon depthwise convolution tests by Gunes Bayir · 9 months ago
  43. a23b468 Optimize CLTranspose operator by Jakub Sujak · 9 months ago
  44. 3831111 Change MatMul Native MMUL Kernel tests tolerance value by Adnan AlSinan · 9 months ago
  45. a04ae3e Port DepthwiseConv2d operator to Ckw by ramy.elgammal@arm.com · 12 months ago
  46. 745153b NEDeconvolutionLayer validation fix by Pablo Marquez Tello · 9 months ago
  47. 0a99c79 Fix nightly NEON Reverse reference failure by Adnan AlSinan · 9 months ago
  48. ebce280 Fix MacOS compilation error by Jakub Sujak · 9 months ago
  49. d82f797 Fix Nightly failing validation tests in NEON Reverse by Adnan AlSinan · 9 months ago
  50. c2a51bd Optimize CL and Neon Winograd tests by Gunes Bayir · 9 months ago
  51. a396da1 Implement Quantized Matmul T/T and T/Nt kernels using MMUL extension by Gunes Bayir · 10 months ago
  52. 2ad0a6b Implement Quantized Matmul Nt/T kernel using MMUL extension by Gunes Bayir · 10 months ago
  53. bdcb4c1 Implement tflite compliant reverse for CPU by Adnan AlSinan · 10 months ago
  54. e071b5e Fix the validation issue in AddMulAdd fused kernel by Gunes Bayir · 10 months ago
  55. 532ce2c Separate the output quantization calculation logic from matmul by Gunes Bayir · 10 months ago
  56. a116cd3 Implement Quantized MatMul kernel using MMUL extension by Gunes Bayir · 10 months ago
  57. 40a9d3e Remove deprecated support for BF16 in CpuCast by Adnan AlSinan · 10 months ago
  58. e87fa66 Add skeleton of ClMatMulLowpNativeMMULKernel by Gunes Bayir · 10 months ago
  59. e57eea3 Disable CKW ElementwiseBinary tests in Dynamic Fusion by Jakub Sujak · 10 months ago
  60. c85edf1 Make zip and combine variadic by Viet-Hoa Do · 10 months ago
  61. b566b6e Extend Neon ReshapeLayer validation tests by Anitha Raj · 11 months ago
  62. 0d27b2e Remove legacy PostOps code by Jakub Sujak · 11 months ago
  63. 2e6d659 Port ClTemplatePool2d to ckw by Adnan AlSinan · 11 months ago
  64. 91cb733 Port Resize operator to CKW by Gunes Bayir · 12 months ago
  65. b1fcb41 Disable NEArgMinMaxLayer RunSmall_F32_S64 for armv7a by Pablo Marquez Tello · 10 months ago
  66. eb5696d Optimize CpuReshapeKernel by Anitha Raj · 12 months ago
  67. 6075097 Remove functionality to add padding in Y dimension in validation tests by Anitha Raj · 11 months ago
  68. 29e27b0 Add support for S64 output in NEArgMinMaxLayer by Pablo Marquez Tello · 11 months ago
  69. 78da34c Fix failure in MeanReduce layer by Viet-Hoa Do · 11 months ago
  70. e1c96e7 Port DirectConv2d to CKW backend by Jakub Sujak · 11 months ago
  71. 0c19f59 Fix CL Tile operator by Viet-Hoa Do · 11 months ago
  72. 4cb0bd4 Improved testing for ArgMinMax by Pablo Marquez Tello · 12 months ago
  73. 16b3752 Port ElementwiseBinary to CKW part 2 by SiCong Li · 12 months ago
  74. 9129549 Retain back-compatibility for arm_compute/core/Types.h by SiCong Li · 12 months ago
  75. 9662ac0 Add missing tests for CLCast by Pablo Marquez Tello · 12 months ago
  76. 23882a9 Add GpuKernelArgumentBinding for runtime argument setting by SiCong Li · 1 year ago
  77. 0a59e69 Fix problem with exception handling in CPPScheduler by Matthew Bentham · 12 months ago
  78. 4a1c917 Add support for input S64/U64 in CpuCastKernel by Pablo Marquez Tello · 12 months ago
  79. 314d3e2 Break up core/Utils.h to reduce unused code being included everywhere by Matthew Bentham · 1 year ago
  80. 4184e86 Port ClTemplateActivation into Ckw by Adnan AlSinan · 12 months ago
  81. a5577db Fix dynamic fusion compilation error by Viet-Hoa Do · 12 months ago
  82. 205ba24 Added S64/U64 support for the input in CLCast by Pablo Marquez Tello · 12 months ago
  83. 945b8da Make test fixture setup methods not be templated by Matthew Bentham · 12 months ago
  84. 653b96c Improved Argminmax testing by Pablo Marquez Tello · 12 months ago
  85. 8e2dede Add Bias to MatMul Kernels and add support for use in Fully Connected Layer by Mohammed Suhail Munshi · 1 year ago
  86. 019a7d9 Enable transpose convolution with non-square kernels by Viet-Hoa Do · 1 year ago
  87. c9eeee5 Fix nightly failures in MatMulLowpNativeKernel when using bounded activation functions by Mohammed Suhail Munshi · 1 year ago
  88. 00474e9 Implement FP32/16 MatMul Lhs T Rhs T/NT kernel using MMUL extension by Gunes Bayir · 1 year, 1 month ago
  89. a2bb80e Use MatMul in fully connected layer with dynamic weights when supported by Mohammed Suhail Munshi · 1 year, 1 month ago
  90. c952596 Implement FP32/FP16 MatMul NT/T kernel using the MMUL extension by Ramy Elgammal · 1 year, 2 months ago
  91. a2561f0 Fix doxygen warnings by ramy.elgammal@arm.com · 1 year, 1 month ago
  92. 90d15b9 Bazel and CMake optional fp16 support by David Svantesson · 1 year, 1 month ago
  93. 8eb82d2 Fix CPU depthwise convolution in case of large padding by Viet-Hoa Do · 1 year, 1 month ago
  94. a8d8058 Implement FP32/FP16 MatMul NT/NT kernel using the MMUL extension by SiCong Li · 1 year, 2 months ago
  95. 94abde4 Add Fused Activation to OpenCL MatMul by Mohammed Suhail Munshi · 1 year, 1 month ago
  96. f1aeab9 Break up arm_compute/core/Types.h a bit by Matthew Bentham · 1 year, 1 month ago
  97. 3fcf3dc Add multi-sketch support for dynamic fusion by Viet-Hoa Do · 1 year, 2 months ago
  98. 48cfd5f Refactor activation LUT computation by Pablo Marquez Tello · 1 year, 1 month ago
  99. 1355ec4 Printing out the rerun command of each failed testcase by Ramy Elgammal · 1 year, 2 months ago
  100. 95f1e4a Raise abs_tolerance number for CL/DirectConvolution3D fp16 tests by Ramy Elgammal · 1 year, 2 months ago