Gitiles
Code Review
Sign In
review.mlplatform.org
/
ml
/
ComputeLibrary
/
d28b751cf2ba9fcf4ccf294b31bf9d2ec5dfd8bb
/
src
/
core
/
CL
/
CLKernelLibrary.cpp
d28b751
COMPMID-1340 - Implementing Winograd Convolution Layer 1x5/5x1 on OpenCL NHWC
by Gian Marco Iodice
· 6 years ago
149fdf3
COMPMID-1337 Implementing Winograd Convolution Layer 1x3 and 3x1 kernels on OpenCL NHWC
by Giorgio Arena
· 6 years ago
876be2a
COMPMID-1339 - Implementing Winograd Convolution Layer 1x5 and 5x1 kernels on OpenCL NCHW
by Gian Marco Iodice
· 6 years ago
7485d5a
COMPMID-970 : Remove QS8 / QS16 support
by Vidhya Sudhan Loganathan
· 6 years ago
a50e5e0
COMPMID-1338 Split winograd.cl
by Giorgio Arena
· 6 years ago
d051e97
COMPMID-811 Add NHWC data format support for CL depthwise convolution
by Giorgio Arena
· 6 years ago
f1c2bf0
COMPMID-1201 - Implementing Winograd Convolution Layer 1x3 and 3x1 kernels on OpenCL
by Gian Marco Iodice
· 6 years ago
4622ac1
COMPMID-1336: Add CLArithmeticAddition support for QASYMM8
by Michele Di Giorgio
· 6 years ago
19ea419
COMPMID-809: Add NHWC data format on CLGEMMConvolutionLayer.
by Georgios Pinitas
· 6 years ago
be39f12
COMPMID-1204 Add NHWC data format support to Winograd input transform 4x4_5x5
by Giorgio Arena
· 6 years ago
80d65d8
COMPMID-1204 Add NHWC data format support to Winograd filter transform 4x4_5x5
by Giorgio Arena
· 6 years ago
7210fe8
COMPMID-1204 Add NHWC data format support to Winograd output transform 4x4_5x5
by Giorgio Arena
· 6 years ago
c42f28d
COMPMID-1048 Add NHWC data format support to Winograd input transform 4x4_3x3
by Giorgio Arena
· 6 years ago
0a88792
COMPMID-1222 Implementing CLArithmeticDivision - FP32 / FP16
by Michalis Spyrou
· 6 years ago
4a626a7
COMPMID-801: NHWC support in CLIm2Col.
by Pablo Tello
· 6 years ago
dcb5b28
COMPMID-1048 Add NHWC data format support to Winograd filter transform 4x4_3x3
by Giorgio Arena
· 6 years ago
3695f9a
COMPMID-1048 Add NHWC data format support to Winograd output transform 4x4_3x3
by Giorgio Arena
· 6 years ago
e03342e
COMPMID-799 - Use new OpenCL 8-bit dot product instruction
by Michalis Spyrou
· 6 years ago
46da23f
COMPMID-813 Add NHWC data format support for CL scale
by Michalis Spyrou
· 6 years ago
df473ea
COMPMID-1182: printf doesn't work
by Georgios Pinitas
· 6 years ago
76c8564
COMPMID-1083 : Compute library should be made usable on non-ARM platforms
by Vidhya Sudhan Loganathan
· 6 years ago
f1f4906
COMPMID-655 : Check FP16 is supported by the GPU
by Vidhya Sudhan Loganathan
· 6 years ago
55b3d12
COMPMID-1137 OpenCL concatenate width
by Michalis Spyrou
· 6 years ago
657bdb3
COMPMID-1050 CL/NEON: Create a function to convert the 2D weights of FC layer from NHWC to NCHW and viceversa
by Giorgio Arena
· 6 years ago
bb36a8e
COMPMID-922 - CLGEMM FP16 optimizations - part2
by Gian Marco Iodice
· 6 years ago
7217563
COMPMID-1107: Add support for ChannelShuffle in CL
by Michele Di Giorgio
· 6 years ago
e74b201
COMPMID-805 Add NHWC data format support for CL pooling
by Michalis Spyrou
· 6 years ago
bf3c662
COMPMID-803: Add NHWC data format support for CL batch normalisation
by Michele Di Giorgio
· 6 years ago
d727e85
COMPMID-855: Get the library to work on non Mali GPUs
by Anthony Barbier
· 6 years ago
e52a300
COMPMID-1026 - Add support for 4x4 output tile in CLWinogradConvolutionLayer
by Gian Marco Iodice
· 6 years ago
dd03870
COMPMID-1037 Add support for F(4x4, 5x5) in CLWinogradOutputTransformKernel
by Giorgio Arena
· 6 years ago
7da55aa
COMPMID-959: Add accessors for the OpenCL program cache
by Anthony Barbier
· 6 years ago
fd68311
COMPMID-922 - CLGEMM FP16 optimizations - part1
by Gian Marco Iodice
· 6 years ago
dfca60b
COMPMID-811 Add NHWC data format support for CL depthwise convolution QASYMM8
by Giorgio Arena
· 6 years ago
fe5ef38
COMPMID-1037 Add support for F(4x4, 5x5) in CLWinogradInputTransformKernel
by Giorgio Arena
· 6 years ago
ecb1c62
COMPMID-959: Fixed order of init/destruction of CLSymbols / CLKernelLibrary
by Anthony Barbier
· 6 years ago
9373c8b
COMPMID-1037 Add support for F(4x4, 5x5) in CLWinogradFilterTransformKernel
by Giorgio Arena
· 6 years ago
3ebef32
COMPMID-949: Optimizing CLDepthwiseConvolution3x3Kernel for FP16
by Michele Di Giorgio
· 6 years ago
e86a09f
COMPMID-337: Adding OpenCL SVM support.
by Pablo Tello
· 6 years ago
5c8e05c
COMPMID-1019 Implement copy function CL
by Michalis Spyrou
· 6 years ago
2d9de0a
COMPMID-1009 Support 4x4 output tile for Winograd Filter Transform on OpenCL.
by Giorgio Arena
· 6 years ago
d2fab73
COMPMID-935 - Implementing Convolution with Winograd on OpenCL (part 4)
by Gian Marco Iodice
· 6 years ago
7e4b239
COMPMID-935 - Implementing Convolution with Winograd on OpenCL (part 2)
by Gian Marco Iodice
· 6 years ago
1f9ca1d
COMPMID-935 Implementing Convolution with Winograd on OpenCL (part 3)
by Giorgio Arena
· 6 years ago
a967611
COMPMID-886 Don't use LWS hints by default for GPU post Mali-G72
by Michalis Spyrou
· 6 years ago
847864d
COMPMID-995 Add CL_DEVICE_VERSION to the test framework output
by Anthony Barbier
· 6 years ago
933fe86
COMPMID-927: Adding support for FP16 in CLDepthwiseConvolutionLayer3x3
by Michele Di Giorgio
· 6 years ago
19835e5
COMPMID-882 - Optimizing GEMMLowp on OpenCL reshaping matrices
by Gian Marco
· 6 years ago
4402cb9
COMPMID-905 Optimize CLSoftmaxLayer for QASYMM8
by Giorgio Arena
· 6 years ago
a086a0a
COMPMID-765 Move direct convolution output stage to the right file
by Giorgio Arena
· 6 years ago
de5a1cc
COMPMID-856: CL Depthwise Convolution QASYMM8 support
by Georgios Pinitas
· 6 years ago
a527e8c
COMPMID-828 - Add support for pool widths 4, 5 & 6 and for non square data sizes - Part 2 (CL)
by Isabella Gottardi
· 6 years ago
c799ed8
COMPMID-895 - Optimizing CLDepthwiseConvolution3x3Kernel
by Gian Marco
· 6 years ago
76faef8
COMPMID-855 - Optimizing im2col on OpenCL (DCHW)
by Gian Marco
· 6 years ago
36a0a46
COMPMID-748 - Integrating optimized SGEMM for bifrost
by Gian Marco
· 6 years ago
7b4d547
COMPMID-816 - Optimizing CLGEMMLowpMatrixMultiplyCore - Part1
by Gian Marco
· 6 years ago
5237e01
COMPMID-838 Implement CLPermute
by Michalis Spyrou
· 6 years ago
780db4e
COMPMID-471 Implement Deconvolution on OpenCL
by Michalis Spyrou
· 7 years ago
fcd52fb
COMPMID-661: Vectorize im2col and add lws heuristics for convolution kernels #46
by Anthony Barbier
· 7 years ago
58c5794
COMPMID-706 - Add GEMMLowp output stage for scaling by a fixed point number
by Gian Marco
· 7 years ago
0162436
COMPMID-684: 2D In-Map normalization support for CL
by Georgios Pinitas
· 7 years ago
05288a2
COMPMID-697 - Rework GEMMLowp interface on OpenCL
by Gian Marco
· 7 years ago
3e80c7f
COMPMID-661: Optimize FC layer with 2 new Bifrost kernels and LWS tuning (#33)
by Anton Lokhmotov
· 7 years ago
d7295b7
COMPMID-661: Add QASYMM8 support (and basic tests) to CLDepthwiseConvolution3x3 kernel (#28)
by Dmitry Savenko
· 7 years ago
f450caa
COMPMID-661: softmax-uint8 implementation (#16)
by Chunosov
· 7 years ago
af6204c
COMPMID-661: Add avgpool-uint8 support. Optimize avgpool-fp32 for Bifrost. (#13)
by Anton Lokhmotov
· 7 years ago
d6afedc
COMPMID-661: softmax-fp32 optimisation (#14)
by Chunosov
· 7 years ago
d621bca
COMPMID-661: directconv-uint8 (#20)
by Chunosov
· 7 years ago
388d3ec
COMPMID-556: Support beta for all softmax data types.
by Georgios Pinitas
· 7 years ago
6f31f8c
Allow running without cl_khr_fp16
by Matthew Bentham
· 7 years ago
0063380
IVGCVSW-619: Support for Cl u8 bounded Relu
by Michel Iwaniec
· 7 years ago
5a6e053
COMPUTE-8024 Fixed the maximum OpenCL workgroup size
by Abel Bernabeu
· 7 years ago
9fe4144
COMPMID-452 CL Generic Depthwise Convolution implementation.
by Giorgio Arena
· 7 years ago
bf17955
COMPMID-522 - Added support for GlobalPooling in CLPoolingLayer and CLFlattening for 3D tensor
by Gian Marco Iodice
· 7 years ago
5ee66ea
COMPMID-462: Implement TensorReshape for NEON and CL.
by Georgios Pinitas
· 7 years ago
56dd726
COMPMID-448: Implement CL Quantization/Dequantization Layer.
by Michele Di Giorgio
· 7 years ago
1c8409d
COMPMID-477 - Optimized CLDirectConvolution1x1 for Bifrost
by Gian Marco Iodice
· 7 years ago
04f089c
COMPMID-476 L2 Normalization for CL
by Michalis Spyrou
· 7 years ago
3e36369
COMPMID-358 Implement OpenCL ROI Pooling
by SiCong Li
· 7 years ago
edfa9f4
COMPMID-477 - Optimized batched case in CLConvolutionLayer
by Gian Marco Iodice
· 7 years ago
5f91072
COMPMID-513 Choose maximum local workgroup size at run time
by steniu01
· 7 years ago
93a690e
COMPMID-452 CL Depthwise Separable Convolution Layer kernel implementation, validation and benchmarking for 3x3xC depthwise filter and DataType::F32.
by Giorgio Arena
· 7 years ago
cb29283
COMPMID-477 - Optimizing Pooling 3x3 with stride_x <= 3 on OpenCL
by Gian Marco Iodice
· 7 years ago
1246b63
COMPMID-477 - Optimized Direct Convolution 3x3 and 5x5 (f32) for Bifrost.
by Gian Marco Iodice
· 7 years ago
db00668
COMPMID-478 Implemnt CL direct convolution 5x5
by steniu01
· 7 years ago
d8e765b
COMPMID-472 : Implement Floor for CL and NEON.
by Georgios Pinitas
· 7 years ago
c51b72f
COMPMID-355 Implement CL DirectConvolution1x1
by SiCong Li
· 7 years ago
3a62324
COMPMID-455 - Optimizing CLIm2ColKernel
by Gian Marco Iodice
· 7 years ago
27b386c
COMPMID-355 Implement 3x3 CL direct convolution
by steniu01
· 7 years ago
3470247
COMPMID-417 Checking CL non uniform support at runtime.
by steniu01
· 7 years ago
8a38369
COMPMID-434 - Port CLGEMM to support 16 bit fixed point
by Gian Marco Iodice
· 7 years ago
ac69aa1
COMPMID-418 Add check and fix comments after preprocessor conditions
by Anthony Barbier
· 7 years ago
d7e8281
COMPMID-408 Create OpenCL complex math functions for 8 bit fixed point arithmetic.
by Michalis Spyrou
· 7 years ago
3a3066b
COMPMID-411 - Port CLGEMM to support 8 bit fixed point
by Gian Marco Iodice
· 7 years ago
e5f8fd6
COMPMID-423: Port CLSoftmaxLayer to QS8
by Georgios Pinitas
· 7 years ago
578ab61
COMPMID-414 - Port CLConvolutionLayer to support 8 bit fixed point - CLGEMMMatrixAccumulateBiasesKernel
by Gian Marco Iodice
· 7 years ago
9f89bae
COMPMID-411 - Ported CLGEMMInterleave4x4Kernel and CLGEMMTranspose1xWKernel to support 8 bit fixed point
by Gian Marco Iodice
· 7 years ago
ce09314
COMPMID-403:Add support for 7x7 pooling on CL.
by Georgios Pinitas
· 7 years ago
6ff3b19
COMPMID-344 Updated doxygen
by Anthony Barbier
· 7 years ago