1. 4bfc70e Add Gemm MMUL Reshaped Only Rhs Support for FP32/FP16 by Gunes Bayir · 2 years, 7 months ago
  2. b1fcefd Implement new Elementwise Dynamic Fusion Operators: Div, Floor by Michalis Spyrou · 2 years, 1 month ago
  3. 82169b3 Add cl_khr_integer_dot_product extension support by Viet-Hoa Do · 2 years, 2 months ago
  4. 06adbc5 Mismatches in dynamically fused direct conv2d + add kernel by Michalis Spyrou · 2 years, 2 months ago
  5. ca364df Include missing embedded headers by SiCong Li · 2 years, 3 months ago
  6. 451c309 Revert "Rework gemm_mm_reshaped_only_rhs_ kernels with new macros" by Ramy Elgammal · 2 years, 6 months ago
  7. 10e88a7 Rework gemm_mm_reshaped_only_rhs_ kernels with new macros by Gian Marco Iodice · 2 years, 8 months ago
  8. 3e155a5 Rework gemm_reshape_lhs_ with new macros by Adnan AlSinan · 2 years, 7 months ago
  9. 4fb5670 Rework gemm_reshape_rhs_(nt,t) with new macros by Gian Marco Iodice · 2 years, 8 months ago
  10. 17975a6 Improve start-up time for ClScale by Adnan AlSinan · 2 years, 8 months ago
  11. 945ae9e Implement CLDirectConv3D f32/f16 by Giorgio Arena · 2 years, 9 months ago
  12. 767dbf9 Fix oclgrind int overflow warning by Freddie Liardet · 3 years ago
  13. c38ca38 Fix CL kernel compilation failure by Michalis Spyrou · 3 years ago
  14. 8155c02 Rework OpenCL Depthwise Convolution by Gian Marco Iodice · 3 years, 3 months ago
  15. 6683165 Add quantization helper functions for OpenCL by Georgios Pinitas · 3 years, 1 month ago
  16. c63b722 Revert "Rework OpenCL Depthwise Convolution" by Gian Marco Iodice · 3 years, 1 month ago
  17. 561c176 Rework OpenCL Depthwise Convolution by Gian Marco Iodice · 3 years, 3 months ago
  18. ea8d266 Enable unroll through pragma based on DDK version by Giorgio Arena · 3 years, 2 months ago
  19. bdd16d1 Add macro to manually unroll loops in OpenCL by Giorgio Arena · 3 years, 2 months ago
  20. 2ba39b6 Fix missing DATA_TYPE in DOT_PRODUCT4_INTEGER8 OpenCL macro by Gian Marco Iodice · 3 years, 2 months ago
  21. ada6cbc Remove OpenCL padding: CLPixelWiseMultiplicationKernel by Giorgio Arena · 3 years, 3 months ago
  22. 0b76f7d Add support for cl_image in CLDirectConvolutionLayer by Gian Marco Iodice · 3 years, 3 months ago
  23. 534b889 Rework the OpenCL Winograd Input Transformations NHWC by Gian Marco Iodice · 3 years, 4 months ago
  24. a8903c8 Improve performance of Winograd Output Transform 3x3 by Gian Marco Iodice · 3 years, 4 months ago
  25. 5c9eed8 Extend direct convolution (F32/F16/QASYMM8) by Gian Marco Iodice · 3 years, 4 months ago