1. f1f4906 COMPMID-655 : Check FP16 is supported by the GPU by Vidhya Sudhan Loganathan · 6 years ago
  2. 750641d COMPMID-1052 - Rework validate method in CLGEMM by Gian Marco Iodice · 6 years ago
  3. bb36a8e COMPMID-922 - CLGEMM FP16 optimizations - part2 by Gian Marco Iodice · 6 years ago
  4. 535fedd COMPMID-1117: TransposeAccessWindow leads to high padding by Georgios Pinitas · 6 years ago
  5. e52a300 COMPMID-1026 - Add support for 4x4 output tile in CLWinogradConvolutionLayer by Gian Marco Iodice · 6 years ago
  6. fd68311 COMPMID-922 - CLGEMM FP16 optimizations - part1 by Gian Marco Iodice · 6 years ago
  7. 56e8e86 COMPMID-1031: Use LWS hints for G51, G51BIG, G51LIT, and TNOX by Sam Laynton · 6 years ago
  8. 81b28c4 COMPMID-1032 - Fixing bug in CLGEMM when is_interleaved_transposed=true by Gian Marco Iodice · 6 years ago
  9. d2fab73 COMPMID-935 - Implementing Convolution with Winograd on OpenCL (part 4) by Gian Marco Iodice · 6 years ago
  10. a967611 COMPMID-886 Don't use LWS hints by default for GPU post Mali-G72 by Michalis Spyrou · 6 years ago
  11. ae2af74 COMPMID-935 - Implementing Convolution with Winograd on OpenCL (Part 1) by Gian Marco · 6 years ago
  12. d56e770 COMPMID-979: Add NHWC data layout to the tensor's metadata (Part 2) by Isabella Gottardi · 6 years ago
  13. 78c0090 COMPMID-754: Add validation to kernels. by Georgios Pinitas · 6 years ago
  14. 36a0a46 COMPMID-748 - Integrating optimized SGEMM for bifrost by Gian Marco · 6 years ago
  15. 1d25ed5 COMPMID-759 - CLGEMM optimization for McVail benchmarks by Gian Marco · 7 years ago
  16. 358ca20 COMPMID-617: Adds CLFullyConnectionLayer validation support by Georgios Pinitas · 7 years ago
  17. fcd52fb COMPMID-661: Vectorize im2col and add lws heuristics for convolution kernels #46 by Anthony Barbier · 7 years ago
  18. 3e80c7f COMPMID-661: Optimize FC layer with 2 new Bifrost kernels and LWS tuning (#33) by Anton Lokhmotov · 7 years ago
  19. de691f0 COMPMID-524 - Implemented CLTuner object by Gian Marco · 7 years ago
  20. edfa9f4 COMPMID-477 - Optimized batched case in CLConvolutionLayer by Gian Marco Iodice · 7 years ago
  21. 768e9f1 COMPMID-417: Cleanup CL FullyConnectedLayer by Moritz Pflanzer · 7 years ago
  22. 21efeb4 COMPMID-417: DepthConvert NEON for QS8/QS16. by Georgios Pinitas · 7 years ago
  23. 8a38369 COMPMID-434 - Port CLGEMM to support 16 bit fixed point by Gian Marco Iodice · 7 years ago
  24. 3a3066b COMPMID-411 - Port CLGEMM to support 8 bit fixed point by Gian Marco Iodice · 7 years ago
  25. 6ff3b19 COMPMID-344 Updated doxygen by Anthony Barbier · 7 years ago