1. 0e21236 Multi-Dimensional and Batched Scatter Reference and Dataset Implementation. by Mohammed Suhail Munshi · 3 months ago
  2. 7377107 Scatter GPU Kernel Implementation for 1D tensors. by Mohammed Suhail Munshi · 3 months ago
  3. f1f1f87 Add in place summation to CPU GEMM kernels by Radu Salavat · 4 months ago
  4. 473b829 Adds Tests and reference implementation for scatter operator with 1D tensors. by Mohammed Suhail Munshi · 3 months ago
  5. 8609ca0 Add skeleton for CLScatter op, reference and tests by Mohammed Suhail Munshi · 4 months ago
  6. 9167c9c Prefer indirect Gemm vs. Direct convolution if supported by Gunes Bayir · 4 months ago
  7. 11ab451 Implement dynamic quantization for GEMMLowp tests by SiCong Li · 8 months ago
  8. 3af4c9b Optimize CL and Neon depthwise convolution tests by Gunes Bayir · 9 months ago
  9. a04ae3e Port DepthwiseConv2d operator to Ckw by ramy.elgammal@arm.com · 11 months ago
  10. c2a51bd Optimize CL and Neon Winograd tests by Gunes Bayir · 9 months ago
  11. a116cd3 Implement Quantized MatMul kernel using MMUL extension by Gunes Bayir · 10 months ago
  12. b566b6e Extend Neon ReshapeLayer validation tests by Anitha Raj · 10 months ago
  13. 8eb82d2 Fix CPU depthwise convolution in case of large padding by Viet-Hoa Do · 1 year ago
  14. a8d8058 Implement FP32/FP16 MatMul NT/NT kernel using the MMUL extension by SiCong Li · 1 year, 1 month ago
  15. cd2502c Fix invalid vector length in CL by Viet-Hoa Do · 1 year, 2 months ago
  16. cdd1e03 Support multi-dimensional indices in the CL Gather Layer up to four-dimensional output tensors by Omar Al Khatib · 1 year, 2 months ago
  17. 3b162e5 Reorder added by David Svantesson · 1 year, 3 months ago
  18. 9c7c2d2 Add quantized support for CPU MatMul by Viet-Hoa Do · 1 year, 3 months ago
  19. 5a7d157 Fix BatchToSpaceFixture by SiCong Li · 1 year, 3 months ago
  20. b531b75 Add Texture Pipe Support for Matmul Lhs T/NT Rhs T kernels by Ramy Elgammal · 1 year, 3 months ago
  21. bbeef72 Add Texture Pipe Support for Matmul Lhs T/NT Rhs NT kernels by Gunes Bayir · 1 year, 3 months ago
  22. 8918b23 Implement OpenCL MatMul for Lhs T Rhs T/NT FP32/16 by Gunes Bayir · 1 year, 3 months ago
  23. 2b6ebfe Implement OpenCL MatMul for Lhs NT Rhs T/NT FP32/16 by Ramy Elgammal · 1 year, 4 months ago
  24. 37c989a Add support for arbitrary parameters for CPU Gather by Viet-Hoa Do · 1 year, 4 months ago
  25. a18d85c Dynamic Fusion Pooling Layer 2d by Mohammed Suhail Munshi · 1 year, 6 months ago
  26. 1b6377b Add broadcast batched matmul validation cases by SiCong Li · 1 year, 6 months ago
  27. b3077fb LHS broadcasting addition for dynamic fusion by Viet-Hoa Do · 1 year, 6 months ago
  28. 8ae5714 Add Resize/Scale operator to Dynamic Fusion interface by Jakub Sujak · 1 year, 7 months ago
  29. 404462a Adding GpuAdd to dynamic fusion operators by Ramy Elgammal · 1 year, 8 months ago
  30. 97a609b Fix GemmLowp BatchMatMul Tests to use quantized Outputs by Mohammed Suhail Munshi · 1 year, 8 months ago
  31. ec0113d Optimize Transposed Convolution for CL backend (FP32/16) by Gunes Bayir · 1 year, 8 months ago
  32. ead4d11 Fix unresolved symbol for target armv7a + Android by Pablo Marquez Tello · 1 year, 9 months ago
  33. a77c6d7 Add test for ClGemmLowpMatrixMultiplyCore to test a batched matrix multiplication with variable input tensors by Ramy Elgammal · 1 year, 10 months ago
  34. 622b8ad Fix bug in QASYMM8_SIGNED to F32 cast layer by Viet-Hoa Do · 1 year, 10 months ago
  35. c4f2743 Optimize Quantized/Integer Bilinear Scale for Neon™ by Gunes Bayir · 1 year, 10 months ago
  36. 926f502 Adding GELU activation by Murray Kornelsen · 2 years ago
  37. 26c9d1a Add test for NEGEMM to test a batched matrix multiplication with variable input tensors by Adnan AlSinan · 1 year, 10 months ago
  38. 13a2d00 Add test for CLGEMM to test a batched matrix multiplication with variable input tensors by Mohammed Suhail Munshi · 1 year, 10 months ago
  39. 894659a Add support for 2d and 3d indices for axis 1 by Pablo Marquez Tello · 2 years, 2 months ago
  40. dc4f276 Revert "Add support for 2d and 3d indices for axis 0" by Mohammed Suhail Munshi · 2 years, 2 months ago
  41. 920f2b6 Add support for 2d and 3d indices for axis 0 by Pablo Marquez Tello · 2 years, 2 months ago
  42. 3751569 Implementation of ClPooling3d by ramelg01 · 2 years, 4 months ago
  43. 3611852 Fix precision issue in ChannelShuffleKernel by Pablo Marquez Tello · 2 years, 8 months ago
  44. 6d9c982 Conv3d support by Sheri Zhang · 2 years, 9 months ago
  45. 8b8405a Optimize CpuScale NHWC F32/F16 by Gian Marco Iodice · 2 years, 9 months ago
  46. cc171f9 Provide tests for fusing pad layer in graph API by Gunes Bayir · 2 years, 10 months ago
  47. 9dc558f Review all shapes in datasets to account for padding removal Part 1 by Gian Marco Iodice · 3 years, 8 months ago
  48. 37a4611 Add test for validating batched-GEMM on CPU by Gian Marco Iodice · 2 years, 11 months ago
  49. a387e27 Add in-place calculation support for CL elementwise arithmetic kernels by Sheri Zhang · 3 years ago
  50. 5e28181 Fix manual LOOP_UNROLLING by Gian Marco Iodice · 3 years ago
  51. 8155c02 Rework OpenCL Depthwise Convolution by Gian Marco Iodice · 3 years, 2 months ago
  52. bc4e311 Implement FP GPU depthwise convolution 1x1 kernel for in-place computation by SiCongLi · 3 years ago
  53. 5f6fdc1 Rework gemmlowp reshaped_only_rhs using the new macros by Giorgio Arena · 3 years, 1 month ago
  54. d02d5ed Integrate improved CPU depthwise convolution kernels by Michele Di Giorgio · 3 years, 5 months ago
  55. ed4b8a0 Fix MeanStdDevNormalizationLayer reference outputting nan for FP16 by Giorgio Arena · 3 years, 2 months ago
  56. afcbb8f Fix Pooling Layer Bug when input is 1xN size by Freddie Liardet · 3 years, 2 months ago
  57. fd83bc8 Fix GEMMLowp output stage validation crash when input's first dimension == 1 by Giorgio Arena · 3 years, 2 months ago
  58. 223aba9 Remove stale/solved TODOs by Michele Di Giorgio · 3 years, 2 months ago
  59. bff2f9f Remove unused yolo_layer OpenCL kernel by Michele Di Giorgio · 3 years, 2 months ago
  60. e5e61bd Fix NeDepthwiseConvolution bad_alloc issue by Sheri Zhang · 3 years, 2 months ago
  61. 37d65e4 Review all shapes in datasets to account for padding removal Part 2 by SiCong Li · 3 years, 8 months ago
  62. ca62c6f Mixed data-layout testing on high priority operators by Manuel Bottini · 3 years, 3 months ago
  63. 33f41fa Fix trademarks throughout the codebase by Michele Di Giorgio · 3 years, 4 months ago
  64. ac6499a Comply with Trademark rules for use of Neon, Arm and Mali by Sheri Zhang · 3 years, 5 months ago
  65. 0841ca0 Fix OpenCL direct convolution by Gian Marco Iodice · 3 years, 5 months ago
  66. d556d7b Integrate improved pooling layer on NEON by Michele Di Giorgio · 3 years, 8 months ago
  67. 7e5b7bf Review all shapes in datasets to account for padding removal Part 3 by SiCong Li · 3 years, 7 months ago
  68. 00e86cd Add new shapes in WinogradOutputTransform dataset by Giorgio Arena · 3 years, 7 months ago
  69. 2567adf Add new shapes to WinogradInputTransform dataset and fix border size for NCHW data layout by Giorgio Arena · 3 years, 7 months ago
  70. 15bc848 [Review Shape] CLDepthwiseConvolutionLayer mismatches by Giorgio Arena · 3 years, 7 months ago
  71. 1e2af2a COMPMID-3712 Remove OpenCL padding: CLDepthwiseConvolutionLayer3x3NHWCKernel FP16/32 by Giorgio Arena · 3 years, 8 months ago
  72. 5304884 COMPMID-3704 Remove OpenCL padding: CLBatchConcatenateLayerKernel by Giorgio Arena · 3 years, 9 months ago
  73. d304adb COMPMID-3703 Remove OpenCL padding: CLActivationLayerKernel + create utility macro by Giorgio Arena · 3 years, 9 months ago
  74. fc2f6d0 COMPMID-3145: Remove padding from NEScaleKernel by Manuel Bottini · 3 years, 10 months ago
  75. 2d22139 COMPMID-3154: Deprecate NE/CL LocallyConnectedLayer by Georgios Pinitas · 3 years, 10 months ago
  76. d9eaf61 COMPMID-3324: Adjusting capitalization of Arm copyright claim to reflect Arm preferred presentation by Michele Di Giorgio · 4 years ago
  77. 1902383 Revert "COMPMID-3480: Perform in-place computations in NEArithmeticAdditionKernel" by Michele Di Giorgio · 4 years ago
  78. 27a9e4f COMPMID-3364: improve the test suite for Scale on OpenCL by Sang-Hoon Park · 4 years, 1 month ago
  79. 4a61653 COMPMID-3480: Perform in-place computations in NEArithmeticAdditionKernel by Michele Di Giorgio · 4 years, 1 month ago
  80. e37662a COMPMID-3128: Test improvement for GEMMConvolutionLayer on CL and NEON(Function-level) by Michele Di Giorgio · 4 years, 2 months ago
  81. 8b2a7d3 COMPMID-3101 Fuse activation with floating point elementwise operation layers in CL by Giorgio Arena · 4 years, 5 months ago
  82. 2e5fd63 COMPMID-3098 Fuse Relu and Bounded Relu with FullyConnected NEON by SiCongLi · 4 years, 4 months ago
  83. 80838f1 COMPMID-2819: Avoid overflows in pooling shape calculation by Georgios Pinitas · 4 years, 7 months ago
  84. 11fedda COMPMID-2985 add data_layout to PoolingLayerInfo by Sang-Hoon Park · 4 years, 5 months ago
  85. cbbed28 COMPMID-2991: Add support for QASYMM8_SIGNED in CL kernels/functions - part 2 by Michele Di Giorgio · 4 years, 6 months ago
  86. 959c26d COMPMID-2790: Add support for QASYMM8_SIGNED in CLGEMMLowpMatrixMultiplyCore by Manuel Bottini · 4 years, 7 months ago
  87. ec00da1 COMPMID-2728: Add support for split sizes in CLSplit by Kurtis Charnock · 4 years, 7 months ago
  88. aadf846 COMPMID-2819 Fix depthwise reference when using ceil by Giorgio Arena · 4 years, 6 months ago
  89. 1d89777 COMPMID-2819 Disable GEMM MatrixMultiply nightly tests by Michalis Spyrou · 4 years, 7 months ago
  90. f464337 COMPMID-2826 Comply with DCL51-CPP by Michalis Spyrou · 4 years, 7 months ago
  91. 951b8a4 COMPMID-2309 : CLConvolutionLayer: support QUANT8_SYMM_PER_CHANNEL filters by Vidhya Sudhan Loganathan · 4 years, 8 months ago
  92. 8217c8e COMPMID-2895: Remove QASYMM8_PER_CHANNEL data type by Georgios Pinitas · 4 years, 8 months ago
  93. a046e16 COMPMID-2600: Implement a new and generic depthwise convolution for CL QASYMM8 NHWC by Michele Di Giorgio · 4 years, 9 months ago
  94. cecb0a7 COMPMID-2637 [CL] fix broadcast pixel-wise multiplication with 5D tensors by Sang-Hoon Park · 4 years, 9 months ago
  95. f1cf394 COMPMID-2575 Implement Winograd 7x1/1x7 with FP16 by Pablo Tello · 4 years, 9 months ago
  96. 29a01c9 COMPMID-2417: NEDequantizationLayer support for QASYMM8_PER_CHANNEL by Michalis Spyrou · 4 years, 10 months ago
  97. fb0fdcd COMPMID-2460: Add ELU activation on NEON,CL,GLES by Georgios Pinitas · 4 years, 10 months ago
  98. 70e890e COMPMID-2541 Remove DepthwiseSeparableConvolutionLayer for NEON/CL by Giorgio Arena · 5 years ago
  99. f44fe3d COMPMID-2568: NEON Convolution layer failure by giuros01 · 4 years, 11 months ago
  100. f3622be COMPMID-1979: Fuse Activation Function in CLGEMM - part 4 by Gian Marco Iodice · 5 years ago