1. ada3200 Add update/index/output (m+1)/2d/(m+n) support for CLScatter by Gunes Bayir · 6 weeks ago
  2. 7377107 Scatter GPU Kernel Implementation for 1D tensors. by Mohammed Suhail Munshi · 10 weeks ago
  3. 29254ae Optimize CL softmax by Viet-Hoa Do · 8 months ago
  4. afd38f0 Apply clang-format on repository by Felix Thomasmathibalan · 8 months ago
  5. e87fa66 Add skeleton of ClMatMulLowpNativeMMULKernel by Gunes Bayir · 9 months ago
  6. 0d27b2e Remove legacy PostOps code by Jakub Sujak · 9 months ago
  7. 00474e9 Implement FP32/16 MatMul Lhs T Rhs T/NT kernel using MMUL extension by Gunes Bayir · 12 months ago
  8. c952596 Implement FP32/FP16 MatMul NT/T kernel using the MMUL extension by Ramy Elgammal · 1 year ago
  9. a8d8058 Implement FP32/FP16 MatMul NT/NT kernel using the MMUL extension by SiCong Li · 1 year ago
  10. 5e99a3e Add quantized CL MatMul kernel for LHS NT, RHS T by Jakub Sujak · 1 year, 2 months ago
  11. 467daef Implement CL kernel for a native batched matmul Quantized - LHS transposed, RHS transposed by Omar Al Khatib · 1 year, 2 months ago
  12. 9d0c4de Add quantized CL MatMul kernels for Lhs NT/T, Rhs NT by Gunes Bayir · 1 year, 2 months ago
  13. 8918b23 Implement OpenCL MatMul for Lhs T Rhs T/NT FP32/16 by Gunes Bayir · 1 year, 3 months ago
  14. 14d7b53 Implementation of RSQRT for quantized int8 by Ramy Elgammal · 1 year, 4 months ago
  15. 2b6ebfe Implement OpenCL MatMul for Lhs NT Rhs T/NT FP32/16 by Ramy Elgammal · 1 year, 3 months ago
  16. 3274172 Add CLAMP operator to Dynamic Fusion interface by Jakub Sujak · 1 year, 6 months ago
  17. 76335eb Implement the OpenCL kernel to compute the indirect convolution by Gian Marco Iodice · 1 year, 7 months ago
  18. 5d01681 Implement address precalculation for indirect conv2d - OpenCL by Gian Marco Iodice · 1 year, 7 months ago
  19. ec0113d Optimize Transposed Convolution for CL backend (FP32/16) by Gunes Bayir · 1 year, 7 months ago
  20. e572dff Add GemmLowp MMUL Reshaped Only Rhs Support for QASYMM8/QASYMM8_SIGNED by Freddie Liardet · 2 years, 1 month ago
  21. 4bfc70e Add Gemm MMUL Reshaped Only Rhs Support for FP32/FP16 by Gunes Bayir · 2 years, 6 months ago
  22. 5e549fa Add CLPool3d Int8 Support by Mohammed Suhail Munshi · 2 years, 3 months ago
  23. ca364df Include missing embedded headers by SiCong Li · 2 years, 2 months ago
  24. 892b70a Fix embedded kernel header inclusion for dynamic fusion by Giorgio Arena · 2 years, 2 months ago
  25. 3751569 Implementation of ClPooling3d by ramelg01 · 2 years, 3 months ago
  26. 6863fa0 Remove deprecated remap functions. by Adnan AlSinan · 2 years, 4 months ago
  27. 451c309 Revert "Rework gemm_mm_reshaped_only_rhs_ kernels with new macros" by Ramy Elgammal · 2 years, 4 months ago
  28. 10e88a7 Rework gemm_mm_reshaped_only_rhs_ kernels with new macros by Gian Marco Iodice · 2 years, 6 months ago
  29. 3012435 Remove padding from ClDirectConv2dKernel by Adnan AlSinan · 2 years, 6 months ago
  30. 9cca592 Improve start-up timer for GeMM (floating-point): by ramelg01 · 2 years, 7 months ago
  31. afa1972 Add post ops to ClGemmMatrixMultiplyReshapedOnlyRHSKernel and ClGemmMatrixMultiplyNativeKernel Part 3 by SiCongLi · 2 years, 7 months ago
  32. 1af5416 Add experimental PostOp interface to ClGemmMatrixMultiplyReshapedKernel Part 1 by SiCongLi · 2 years, 8 months ago
  33. c9cecc0 Remove legacy GeMM kernels on OpenCL by Gian Marco Iodice · 2 years, 8 months ago
  34. 945ae9e Implement CLDirectConv3D f32/f16 by Giorgio Arena · 2 years, 8 months ago
  35. 8fce496 Remove padding from ClPool2dKernel NCHW by Giorgio Arena · 2 years, 9 months ago
  36. 7891a73 Move CPU/GPU files from Core/Runtime to the respective backend folders by Georgios Pinitas · 2 years, 10 months ago[Renamed (99%) from src/core/gpu/cl/ClKernelLibrary.cpp]
  37. 511771f Remove padding from ClScaleKernel by Giorgio Arena · 2 years, 10 months ago
  38. f81f51c Reorganize the kernels into nhwc, nchw and common folders by Adnan AlSinan · 2 years, 10 months ago
  39. 7075fe2 Reorganize the kernels into nhwc, nchw and common folders by Adnan AlSinan · 2 years, 11 months ago
  40. 8155c02 Rework OpenCL Depthwise Convolution by Gian Marco Iodice · 3 years, 2 months ago
  41. c63b722 Revert "Rework OpenCL Depthwise Convolution" by Gian Marco Iodice · 3 years ago
  42. 561c176 Rework OpenCL Depthwise Convolution by Gian Marco Iodice · 3 years, 2 months ago
  43. 36dff9f Add NHWC support to CLRemap by Frederick Liardet · 3 years, 1 month ago
  44. 11d8415 Port DepthConvert to new Api by Georgios Pinitas · 3 years, 1 month ago
  45. 908f616 Creates ClKerneLibrary to serve just as a kernel container without build logic by Georgios Pinitas · 3 years, 1 month ago