Gitiles
Code Review
Sign In
review.mlplatform.org
/
ml
/
ComputeLibrary
/
d39e2b1e0be12420b1e00279ebee0c34bae3dd8c
/
src
/
core
/
CL
/
kernels
/
CLGEMMMatrixMultiplyKernel.cpp
d39e2b1
COMPMID-1188 - Fixed performance degradation with GEMM3D
by Gian Marco Iodice
· 6 years ago
68a3f56
COMPMID-1276 - Allow GEMM to work with 3D input tensor
by Gian Marco Iodice
· 6 years ago
e8bd2c7
COMPMID-1384: graph_mobilenet fails for NHWC on OpenCL
by Georgios Pinitas
· 6 years ago
7485d5a
COMPMID-970 : Remove QS8 / QS16 support
by Vidhya Sudhan Loganathan
· 6 years ago
8e74f44
COMPMID-911: Allow GEMM to work with 3D tensors
by Isabella Gottardi
· 6 years ago
17812ba
COMPMID-817: Tuner: Port kernels to new design.
by Georgios Pinitas
· 6 years ago
f1f4906
COMPMID-655 : Check FP16 is supported by the GPU
by Vidhya Sudhan Loganathan
· 6 years ago
750641d
COMPMID-1052 - Rework validate method in CLGEMM
by Gian Marco Iodice
· 6 years ago
bb36a8e
COMPMID-922 - CLGEMM FP16 optimizations - part2
by Gian Marco Iodice
· 6 years ago
535fedd
COMPMID-1117: TransposeAccessWindow leads to high padding
by Georgios Pinitas
· 6 years ago
e52a300
COMPMID-1026 - Add support for 4x4 output tile in CLWinogradConvolutionLayer
by Gian Marco Iodice
· 6 years ago
fd68311
COMPMID-922 - CLGEMM FP16 optimizations - part1
by Gian Marco Iodice
· 6 years ago
56e8e86
COMPMID-1031: Use LWS hints for G51, G51BIG, G51LIT, and TNOX
by Sam Laynton
· 6 years ago
81b28c4
COMPMID-1032 - Fixing bug in CLGEMM when is_interleaved_transposed=true
by Gian Marco Iodice
· 6 years ago
d2fab73
COMPMID-935 - Implementing Convolution with Winograd on OpenCL (part 4)
by Gian Marco Iodice
· 6 years ago
a967611
COMPMID-886 Don't use LWS hints by default for GPU post Mali-G72
by Michalis Spyrou
· 6 years ago
ae2af74
COMPMID-935 - Implementing Convolution with Winograd on OpenCL (Part 1)
by Gian Marco
· 6 years ago
d56e770
COMPMID-979: Add NHWC data layout to the tensor's metadata (Part 2)
by Isabella Gottardi
· 6 years ago
78c0090
COMPMID-754: Add validation to kernels.
by Georgios Pinitas
· 6 years ago
36a0a46
COMPMID-748 - Integrating optimized SGEMM for bifrost
by Gian Marco
· 6 years ago
1d25ed5
COMPMID-759 - CLGEMM optimization for McVail benchmarks
by Gian Marco
· 7 years ago
358ca20
COMPMID-617: Adds CLFullyConnectionLayer validation support
by Georgios Pinitas
· 7 years ago
fcd52fb
COMPMID-661: Vectorize im2col and add lws heuristics for convolution kernels #46
by Anthony Barbier
· 7 years ago
3e80c7f
COMPMID-661: Optimize FC layer with 2 new Bifrost kernels and LWS tuning (#33)
by Anton Lokhmotov
· 7 years ago
de691f0
COMPMID-524 - Implemented CLTuner object
by Gian Marco
· 7 years ago
edfa9f4
COMPMID-477 - Optimized batched case in CLConvolutionLayer
by Gian Marco Iodice
· 7 years ago
768e9f1
COMPMID-417: Cleanup CL FullyConnectedLayer
by Moritz Pflanzer
· 7 years ago
21efeb4
COMPMID-417: DepthConvert NEON for QS8/QS16.
by Georgios Pinitas
· 7 years ago
8a38369
COMPMID-434 - Port CLGEMM to support 16 bit fixed point
by Gian Marco Iodice
· 7 years ago
3a3066b
COMPMID-411 - Port CLGEMM to support 8 bit fixed point
by Gian Marco Iodice
· 7 years ago
6ff3b19
COMPMID-344 Updated doxygen
by Anthony Barbier
· 7 years ago