1. 2d9de0a COMPMID-1009 Support 4x4 output tile for Winograd Filter Transform on OpenCL. by Giorgio Arena · 6 years ago
  2. d2fab73 COMPMID-935 - Implementing Convolution with Winograd on OpenCL (part 4) by Gian Marco Iodice · 6 years ago
  3. 7e4b239 COMPMID-935 - Implementing Convolution with Winograd on OpenCL (part 2) by Gian Marco Iodice · 6 years ago
  4. 1f9ca1d COMPMID-935 Implementing Convolution with Winograd on OpenCL (part 3) by Giorgio Arena · 6 years ago
  5. a967611 COMPMID-886 Don't use LWS hints by default for GPU post Mali-G72 by Michalis Spyrou · 6 years ago
  6. 847864d COMPMID-995 Add CL_DEVICE_VERSION to the test framework output by Anthony Barbier · 6 years ago
  7. 933fe86 COMPMID-927: Adding support for FP16 in CLDepthwiseConvolutionLayer3x3 by Michele Di Giorgio · 6 years ago
  8. 19835e5 COMPMID-882 - Optimizing GEMMLowp on OpenCL reshaping matrices by Gian Marco · 6 years ago
  9. 4402cb9 COMPMID-905 Optimize CLSoftmaxLayer for QASYMM8 by Giorgio Arena · 6 years ago
  10. a086a0a COMPMID-765 Move direct convolution output stage to the right file by Giorgio Arena · 6 years ago
  11. de5a1cc COMPMID-856: CL Depthwise Convolution QASYMM8 support by Georgios Pinitas · 6 years ago
  12. a527e8c COMPMID-828 - Add support for pool widths 4, 5 & 6 and for non square data sizes - Part 2 (CL) by Isabella Gottardi · 6 years ago
  13. c799ed8 COMPMID-895 - Optimizing CLDepthwiseConvolution3x3Kernel by Gian Marco · 6 years ago
  14. 76faef8 COMPMID-855 - Optimizing im2col on OpenCL (DCHW) by Gian Marco · 6 years ago
  15. 36a0a46 COMPMID-748 - Integrating optimized SGEMM for bifrost by Gian Marco · 6 years ago
  16. 7b4d547 COMPMID-816 - Optimizing CLGEMMLowpMatrixMultiplyCore - Part1 by Gian Marco · 6 years ago
  17. 5237e01 COMPMID-838 Implement CLPermute by Michalis Spyrou · 6 years ago
  18. 780db4e COMPMID-471 Implement Deconvolution on OpenCL by Michalis Spyrou · 7 years ago
  19. fcd52fb COMPMID-661: Vectorize im2col and add lws heuristics for convolution kernels #46 by Anthony Barbier · 7 years ago
  20. 58c5794 COMPMID-706 - Add GEMMLowp output stage for scaling by a fixed point number by Gian Marco · 7 years ago
  21. 0162436 COMPMID-684: 2D In-Map normalization support for CL by Georgios Pinitas · 7 years ago
  22. 05288a2 COMPMID-697 - Rework GEMMLowp interface on OpenCL by Gian Marco · 7 years ago
  23. 3e80c7f COMPMID-661: Optimize FC layer with 2 new Bifrost kernels and LWS tuning (#33) by Anton Lokhmotov · 7 years ago
  24. d7295b7 COMPMID-661: Add QASYMM8 support (and basic tests) to CLDepthwiseConvolution3x3 kernel (#28) by Dmitry Savenko · 7 years ago
  25. f450caa COMPMID-661: softmax-uint8 implementation (#16) by Chunosov · 7 years ago
  26. af6204c COMPMID-661: Add avgpool-uint8 support. Optimize avgpool-fp32 for Bifrost. (#13) by Anton Lokhmotov · 7 years ago
  27. d6afedc COMPMID-661: softmax-fp32 optimisation (#14) by Chunosov · 7 years ago
  28. d621bca COMPMID-661: directconv-uint8 (#20) by Chunosov · 7 years ago
  29. 388d3ec COMPMID-556: Support beta for all softmax data types. by Georgios Pinitas · 7 years ago
  30. 6f31f8c Allow running without cl_khr_fp16 by Matthew Bentham · 7 years ago
  31. 0063380 IVGCVSW-619: Support for Cl u8 bounded Relu by Michel Iwaniec · 7 years ago
  32. 5a6e053 COMPUTE-8024 Fixed the maximum OpenCL workgroup size by Abel Bernabeu · 7 years ago
  33. 9fe4144 COMPMID-452 CL Generic Depthwise Convolution implementation. by Giorgio Arena · 7 years ago
  34. bf17955 COMPMID-522 - Added support for GlobalPooling in CLPoolingLayer and CLFlattening for 3D tensor by Gian Marco Iodice · 7 years ago
  35. 5ee66ea COMPMID-462: Implement TensorReshape for NEON and CL. by Georgios Pinitas · 7 years ago
  36. 56dd726 COMPMID-448: Implement CL Quantization/Dequantization Layer. by Michele Di Giorgio · 7 years ago
  37. 1c8409d COMPMID-477 - Optimized CLDirectConvolution1x1 for Bifrost by Gian Marco Iodice · 7 years ago
  38. 04f089c COMPMID-476 L2 Normalization for CL by Michalis Spyrou · 7 years ago
  39. 3e36369 COMPMID-358 Implement OpenCL ROI Pooling by SiCong Li · 7 years ago
  40. edfa9f4 COMPMID-477 - Optimized batched case in CLConvolutionLayer by Gian Marco Iodice · 7 years ago
  41. 5f91072 COMPMID-513 Choose maximum local workgroup size at run time by steniu01 · 7 years ago
  42. 93a690e COMPMID-452 CL Depthwise Separable Convolution Layer kernel implementation, validation and benchmarking for 3x3xC depthwise filter and DataType::F32. by Giorgio Arena · 7 years ago
  43. cb29283 COMPMID-477 - Optimizing Pooling 3x3 with stride_x <= 3 on OpenCL by Gian Marco Iodice · 7 years ago
  44. 1246b63 COMPMID-477 - Optimized Direct Convolution 3x3 and 5x5 (f32) for Bifrost. by Gian Marco Iodice · 7 years ago
  45. db00668 COMPMID-478 Implemnt CL direct convolution 5x5 by steniu01 · 7 years ago
  46. d8e765b COMPMID-472 : Implement Floor for CL and NEON. by Georgios Pinitas · 7 years ago
  47. c51b72f COMPMID-355 Implement CL DirectConvolution1x1 by SiCong Li · 7 years ago
  48. 3a62324 COMPMID-455 - Optimizing CLIm2ColKernel by Gian Marco Iodice · 7 years ago
  49. 27b386c COMPMID-355 Implement 3x3 CL direct convolution by steniu01 · 7 years ago
  50. 3470247 COMPMID-417 Checking CL non uniform support at runtime. by steniu01 · 7 years ago
  51. 8a38369 COMPMID-434 - Port CLGEMM to support 16 bit fixed point by Gian Marco Iodice · 7 years ago
  52. ac69aa1 COMPMID-418 Add check and fix comments after preprocessor conditions by Anthony Barbier · 7 years ago
  53. d7e8281 COMPMID-408 Create OpenCL complex math functions for 8 bit fixed point arithmetic. by Michalis Spyrou · 7 years ago
  54. 3a3066b COMPMID-411 - Port CLGEMM to support 8 bit fixed point by Gian Marco Iodice · 7 years ago
  55. e5f8fd6 COMPMID-423: Port CLSoftmaxLayer to QS8 by Georgios Pinitas · 7 years ago
  56. 578ab61 COMPMID-414 - Port CLConvolutionLayer to support 8 bit fixed point - CLGEMMMatrixAccumulateBiasesKernel by Gian Marco Iodice · 7 years ago
  57. 9f89bae COMPMID-411 - Ported CLGEMMInterleave4x4Kernel and CLGEMMTranspose1xWKernel to support 8 bit fixed point by Gian Marco Iodice · 7 years ago
  58. ce09314 COMPMID-403:Add support for 7x7 pooling on CL. by Georgios Pinitas · 7 years ago
  59. 6ff3b19 COMPMID-344 Updated doxygen by Anthony Barbier · 7 years ago