1. d28b751 COMPMID-1340 - Implementing Winograd Convolution Layer 1x5/5x1 on OpenCL NHWC by Gian Marco Iodice · 6 years ago
  2. 149fdf3 COMPMID-1337 Implementing Winograd Convolution Layer 1x3 and 3x1 kernels on OpenCL NHWC by Giorgio Arena · 6 years ago
  3. 876be2a COMPMID-1339 - Implementing Winograd Convolution Layer 1x5 and 5x1 kernels on OpenCL NCHW by Gian Marco Iodice · 6 years ago
  4. 7485d5a COMPMID-970 : Remove QS8 / QS16 support by Vidhya Sudhan Loganathan · 6 years ago
  5. a50e5e0 COMPMID-1338 Split winograd.cl by Giorgio Arena · 6 years ago
  6. d051e97 COMPMID-811 Add NHWC data format support for CL depthwise convolution by Giorgio Arena · 6 years ago
  7. f1c2bf0 COMPMID-1201 - Implementing Winograd Convolution Layer 1x3 and 3x1 kernels on OpenCL by Gian Marco Iodice · 6 years ago
  8. 4622ac1 COMPMID-1336: Add CLArithmeticAddition support for QASYMM8 by Michele Di Giorgio · 6 years ago
  9. 19ea419 COMPMID-809: Add NHWC data format on CLGEMMConvolutionLayer. by Georgios Pinitas · 6 years ago
  10. be39f12 COMPMID-1204 Add NHWC data format support to Winograd input transform 4x4_5x5 by Giorgio Arena · 6 years ago
  11. 80d65d8 COMPMID-1204 Add NHWC data format support to Winograd filter transform 4x4_5x5 by Giorgio Arena · 6 years ago
  12. 7210fe8 COMPMID-1204 Add NHWC data format support to Winograd output transform 4x4_5x5 by Giorgio Arena · 6 years ago
  13. c42f28d COMPMID-1048 Add NHWC data format support to Winograd input transform 4x4_3x3 by Giorgio Arena · 6 years ago
  14. 0a88792 COMPMID-1222 Implementing CLArithmeticDivision - FP32 / FP16 by Michalis Spyrou · 6 years ago
  15. 4a626a7 COMPMID-801: NHWC support in CLIm2Col. by Pablo Tello · 6 years ago
  16. dcb5b28 COMPMID-1048 Add NHWC data format support to Winograd filter transform 4x4_3x3 by Giorgio Arena · 6 years ago
  17. 3695f9a COMPMID-1048 Add NHWC data format support to Winograd output transform 4x4_3x3 by Giorgio Arena · 6 years ago
  18. e03342e COMPMID-799 - Use new OpenCL 8-bit dot product instruction by Michalis Spyrou · 6 years ago
  19. 46da23f COMPMID-813 Add NHWC data format support for CL scale by Michalis Spyrou · 6 years ago
  20. df473ea COMPMID-1182: printf doesn't work by Georgios Pinitas · 6 years ago
  21. 76c8564 COMPMID-1083 : Compute library should be made usable on non-ARM platforms by Vidhya Sudhan Loganathan · 6 years ago
  22. f1f4906 COMPMID-655 : Check FP16 is supported by the GPU by Vidhya Sudhan Loganathan · 6 years ago
  23. 55b3d12 COMPMID-1137 OpenCL concatenate width by Michalis Spyrou · 6 years ago
  24. 657bdb3 COMPMID-1050 CL/NEON: Create a function to convert the 2D weights of FC layer from NHWC to NCHW and viceversa by Giorgio Arena · 6 years ago
  25. bb36a8e COMPMID-922 - CLGEMM FP16 optimizations - part2 by Gian Marco Iodice · 6 years ago
  26. 7217563 COMPMID-1107: Add support for ChannelShuffle in CL by Michele Di Giorgio · 6 years ago
  27. e74b201 COMPMID-805 Add NHWC data format support for CL pooling by Michalis Spyrou · 6 years ago
  28. bf3c662 COMPMID-803: Add NHWC data format support for CL batch normalisation by Michele Di Giorgio · 6 years ago
  29. d727e85 COMPMID-855: Get the library to work on non Mali GPUs by Anthony Barbier · 6 years ago
  30. e52a300 COMPMID-1026 - Add support for 4x4 output tile in CLWinogradConvolutionLayer by Gian Marco Iodice · 6 years ago
  31. dd03870 COMPMID-1037 Add support for F(4x4, 5x5) in CLWinogradOutputTransformKernel by Giorgio Arena · 6 years ago
  32. 7da55aa COMPMID-959: Add accessors for the OpenCL program cache by Anthony Barbier · 6 years ago
  33. fd68311 COMPMID-922 - CLGEMM FP16 optimizations - part1 by Gian Marco Iodice · 6 years ago
  34. dfca60b COMPMID-811 Add NHWC data format support for CL depthwise convolution QASYMM8 by Giorgio Arena · 6 years ago
  35. fe5ef38 COMPMID-1037 Add support for F(4x4, 5x5) in CLWinogradInputTransformKernel by Giorgio Arena · 6 years ago
  36. ecb1c62 COMPMID-959: Fixed order of init/destruction of CLSymbols / CLKernelLibrary by Anthony Barbier · 6 years ago
  37. 9373c8b COMPMID-1037 Add support for F(4x4, 5x5) in CLWinogradFilterTransformKernel by Giorgio Arena · 6 years ago
  38. 3ebef32 COMPMID-949: Optimizing CLDepthwiseConvolution3x3Kernel for FP16 by Michele Di Giorgio · 6 years ago
  39. e86a09f COMPMID-337: Adding OpenCL SVM support. by Pablo Tello · 6 years ago
  40. 5c8e05c COMPMID-1019 Implement copy function CL by Michalis Spyrou · 6 years ago
  41. 2d9de0a COMPMID-1009 Support 4x4 output tile for Winograd Filter Transform on OpenCL. by Giorgio Arena · 6 years ago
  42. d2fab73 COMPMID-935 - Implementing Convolution with Winograd on OpenCL (part 4) by Gian Marco Iodice · 6 years ago
  43. 7e4b239 COMPMID-935 - Implementing Convolution with Winograd on OpenCL (part 2) by Gian Marco Iodice · 6 years ago
  44. 1f9ca1d COMPMID-935 Implementing Convolution with Winograd on OpenCL (part 3) by Giorgio Arena · 6 years ago
  45. a967611 COMPMID-886 Don't use LWS hints by default for GPU post Mali-G72 by Michalis Spyrou · 6 years ago
  46. 847864d COMPMID-995 Add CL_DEVICE_VERSION to the test framework output by Anthony Barbier · 6 years ago
  47. 933fe86 COMPMID-927: Adding support for FP16 in CLDepthwiseConvolutionLayer3x3 by Michele Di Giorgio · 6 years ago
  48. 19835e5 COMPMID-882 - Optimizing GEMMLowp on OpenCL reshaping matrices by Gian Marco · 6 years ago
  49. 4402cb9 COMPMID-905 Optimize CLSoftmaxLayer for QASYMM8 by Giorgio Arena · 6 years ago
  50. a086a0a COMPMID-765 Move direct convolution output stage to the right file by Giorgio Arena · 6 years ago
  51. de5a1cc COMPMID-856: CL Depthwise Convolution QASYMM8 support by Georgios Pinitas · 6 years ago
  52. a527e8c COMPMID-828 - Add support for pool widths 4, 5 & 6 and for non square data sizes - Part 2 (CL) by Isabella Gottardi · 6 years ago
  53. c799ed8 COMPMID-895 - Optimizing CLDepthwiseConvolution3x3Kernel by Gian Marco · 6 years ago
  54. 76faef8 COMPMID-855 - Optimizing im2col on OpenCL (DCHW) by Gian Marco · 6 years ago
  55. 36a0a46 COMPMID-748 - Integrating optimized SGEMM for bifrost by Gian Marco · 6 years ago
  56. 7b4d547 COMPMID-816 - Optimizing CLGEMMLowpMatrixMultiplyCore - Part1 by Gian Marco · 6 years ago
  57. 5237e01 COMPMID-838 Implement CLPermute by Michalis Spyrou · 6 years ago
  58. 780db4e COMPMID-471 Implement Deconvolution on OpenCL by Michalis Spyrou · 7 years ago
  59. fcd52fb COMPMID-661: Vectorize im2col and add lws heuristics for convolution kernels #46 by Anthony Barbier · 7 years ago
  60. 58c5794 COMPMID-706 - Add GEMMLowp output stage for scaling by a fixed point number by Gian Marco · 7 years ago
  61. 0162436 COMPMID-684: 2D In-Map normalization support for CL by Georgios Pinitas · 7 years ago
  62. 05288a2 COMPMID-697 - Rework GEMMLowp interface on OpenCL by Gian Marco · 7 years ago
  63. 3e80c7f COMPMID-661: Optimize FC layer with 2 new Bifrost kernels and LWS tuning (#33) by Anton Lokhmotov · 7 years ago
  64. d7295b7 COMPMID-661: Add QASYMM8 support (and basic tests) to CLDepthwiseConvolution3x3 kernel (#28) by Dmitry Savenko · 7 years ago
  65. f450caa COMPMID-661: softmax-uint8 implementation (#16) by Chunosov · 7 years ago
  66. af6204c COMPMID-661: Add avgpool-uint8 support. Optimize avgpool-fp32 for Bifrost. (#13) by Anton Lokhmotov · 7 years ago
  67. d6afedc COMPMID-661: softmax-fp32 optimisation (#14) by Chunosov · 7 years ago
  68. d621bca COMPMID-661: directconv-uint8 (#20) by Chunosov · 7 years ago
  69. 388d3ec COMPMID-556: Support beta for all softmax data types. by Georgios Pinitas · 7 years ago
  70. 6f31f8c Allow running without cl_khr_fp16 by Matthew Bentham · 7 years ago
  71. 0063380 IVGCVSW-619: Support for Cl u8 bounded Relu by Michel Iwaniec · 7 years ago
  72. 5a6e053 COMPUTE-8024 Fixed the maximum OpenCL workgroup size by Abel Bernabeu · 7 years ago
  73. 9fe4144 COMPMID-452 CL Generic Depthwise Convolution implementation. by Giorgio Arena · 7 years ago
  74. bf17955 COMPMID-522 - Added support for GlobalPooling in CLPoolingLayer and CLFlattening for 3D tensor by Gian Marco Iodice · 7 years ago
  75. 5ee66ea COMPMID-462: Implement TensorReshape for NEON and CL. by Georgios Pinitas · 7 years ago
  76. 56dd726 COMPMID-448: Implement CL Quantization/Dequantization Layer. by Michele Di Giorgio · 7 years ago
  77. 1c8409d COMPMID-477 - Optimized CLDirectConvolution1x1 for Bifrost by Gian Marco Iodice · 7 years ago
  78. 04f089c COMPMID-476 L2 Normalization for CL by Michalis Spyrou · 7 years ago
  79. 3e36369 COMPMID-358 Implement OpenCL ROI Pooling by SiCong Li · 7 years ago
  80. edfa9f4 COMPMID-477 - Optimized batched case in CLConvolutionLayer by Gian Marco Iodice · 7 years ago
  81. 5f91072 COMPMID-513 Choose maximum local workgroup size at run time by steniu01 · 7 years ago
  82. 93a690e COMPMID-452 CL Depthwise Separable Convolution Layer kernel implementation, validation and benchmarking for 3x3xC depthwise filter and DataType::F32. by Giorgio Arena · 7 years ago
  83. cb29283 COMPMID-477 - Optimizing Pooling 3x3 with stride_x <= 3 on OpenCL by Gian Marco Iodice · 7 years ago
  84. 1246b63 COMPMID-477 - Optimized Direct Convolution 3x3 and 5x5 (f32) for Bifrost. by Gian Marco Iodice · 7 years ago
  85. db00668 COMPMID-478 Implemnt CL direct convolution 5x5 by steniu01 · 7 years ago
  86. d8e765b COMPMID-472 : Implement Floor for CL and NEON. by Georgios Pinitas · 7 years ago
  87. c51b72f COMPMID-355 Implement CL DirectConvolution1x1 by SiCong Li · 7 years ago
  88. 3a62324 COMPMID-455 - Optimizing CLIm2ColKernel by Gian Marco Iodice · 7 years ago
  89. 27b386c COMPMID-355 Implement 3x3 CL direct convolution by steniu01 · 7 years ago
  90. 3470247 COMPMID-417 Checking CL non uniform support at runtime. by steniu01 · 7 years ago
  91. 8a38369 COMPMID-434 - Port CLGEMM to support 16 bit fixed point by Gian Marco Iodice · 7 years ago
  92. ac69aa1 COMPMID-418 Add check and fix comments after preprocessor conditions by Anthony Barbier · 7 years ago
  93. d7e8281 COMPMID-408 Create OpenCL complex math functions for 8 bit fixed point arithmetic. by Michalis Spyrou · 7 years ago
  94. 3a3066b COMPMID-411 - Port CLGEMM to support 8 bit fixed point by Gian Marco Iodice · 7 years ago
  95. e5f8fd6 COMPMID-423: Port CLSoftmaxLayer to QS8 by Georgios Pinitas · 7 years ago
  96. 578ab61 COMPMID-414 - Port CLConvolutionLayer to support 8 bit fixed point - CLGEMMMatrixAccumulateBiasesKernel by Gian Marco Iodice · 7 years ago
  97. 9f89bae COMPMID-411 - Ported CLGEMMInterleave4x4Kernel and CLGEMMTranspose1xWKernel to support 8 bit fixed point by Gian Marco Iodice · 7 years ago
  98. ce09314 COMPMID-403:Add support for 7x7 pooling on CL. by Georgios Pinitas · 7 years ago
  99. 6ff3b19 COMPMID-344 Updated doxygen by Anthony Barbier · 7 years ago