Gitiles
Code Review
Sign In
review.mlplatform.org
/
ml
/
ComputeLibrary
/
cbbed288a71f2f048123db3cf396361e5d66ce93
/
src
/
core
/
CL
/
kernels
/
CLGEMMMatrixMultiplyKernel.cpp
b238f5f
COMPMID-2539: Add bias addition check in CLGEMM validation
by Gian Marco Iodice
· 5 years ago
d1f5476
COMPMID-1979: Fuse Activation Function in CLGEMM - part 3
by Gian Marco Iodice
· 5 years ago
82d9dd1
COMPMID-2380: Create utility functions for is_one and is_zero with float
by Gian Marco Iodice
· 5 years ago
ebc3a90
COMPMID-1706: Fuse the bias addition within CLGEMM
by Michele Di Giorgio
· 6 years ago
17a01a3
COMPMID-1866: Revisit padding and window on CLDepthwiseConvolutionNHWC
by Michele Di Giorgio
· 6 years ago
1c9efeb
Issue COMPMID-1835: Remove CLGEMMInterleave4x4Kernel and replace with CLGEMMReshapeLHSMatrixKernel
by giuros01
· 6 years ago
8b6b4a9
COMPMID-1836: Remove CLGEMMTranspose1xWKernel and replace with CLGEMMReshapeRHSMatrixKernel
by giuros01
· 6 years ago
38d93bd
COMPMID-1801 : (Nightly) CLWinogradConvolutionLayer FP16 mismatches
by Vidhya Sudhan Loganathan
· 6 years ago
a25d16c
COMPMID-1266 : Add support for FP16 in CLWinogradConvolutionLayer: 5x5 kernels
by Vidhya Sudhan Loganathan
· 6 years ago
3139f03
COMPMID-1736: Fixed out-of-bound write in CLIm2Col
by Gian Marco Iodice
· 6 years ago
c4f582e
COMPMID-1451: Reverting changes for CLGEMM and CLGEMMLowp previuosly done (384496)
by Isabella Gottardi
· 6 years ago
f02e527
COMPMID-1607 - (Nightly) CLGEMMLowpMatrixMultiplyCore errors and mismatches
by Isabella Gottardi
· 6 years ago
b92805b
COMPMID-1607 - (Nightly) CLGEMMLowpMatrixMultiplyCore errors and mismatches
by Isabella Gottardi
· 6 years ago
e3d24ce
COMPMID-708 Fix AccessWindowTranspose
by Giorgio Arena
· 6 years ago
b6eb353
COMPMID-1478: Stop relying on static default OpenCL objects in cl2.hpp
by Anthony Barbier
· 6 years ago
d39e2b1
COMPMID-1188 - Fixed performance degradation with GEMM3D
by Gian Marco Iodice
· 6 years ago
68a3f56
COMPMID-1276 - Allow GEMM to work with 3D input tensor
by Gian Marco Iodice
· 6 years ago
e8bd2c7
COMPMID-1384: graph_mobilenet fails for NHWC on OpenCL
by Georgios Pinitas
· 6 years ago
7485d5a
COMPMID-970 : Remove QS8 / QS16 support
by Vidhya Sudhan Loganathan
· 6 years ago
8e74f44
COMPMID-911: Allow GEMM to work with 3D tensors
by Isabella Gottardi
· 6 years ago
17812ba
COMPMID-817: Tuner: Port kernels to new design.
by Georgios Pinitas
· 6 years ago
f1f4906
COMPMID-655 : Check FP16 is supported by the GPU
by Vidhya Sudhan Loganathan
· 6 years ago
750641d
COMPMID-1052 - Rework validate method in CLGEMM
by Gian Marco Iodice
· 6 years ago
bb36a8e
COMPMID-922 - CLGEMM FP16 optimizations - part2
by Gian Marco Iodice
· 6 years ago
535fedd
COMPMID-1117: TransposeAccessWindow leads to high padding
by Georgios Pinitas
· 6 years ago
e52a300
COMPMID-1026 - Add support for 4x4 output tile in CLWinogradConvolutionLayer
by Gian Marco Iodice
· 6 years ago
fd68311
COMPMID-922 - CLGEMM FP16 optimizations - part1
by Gian Marco Iodice
· 6 years ago
56e8e86
COMPMID-1031: Use LWS hints for G51, G51BIG, G51LIT, and TNOX
by Sam Laynton
· 6 years ago
81b28c4
COMPMID-1032 - Fixing bug in CLGEMM when is_interleaved_transposed=true
by Gian Marco Iodice
· 6 years ago
d2fab73
COMPMID-935 - Implementing Convolution with Winograd on OpenCL (part 4)
by Gian Marco Iodice
· 6 years ago
a967611
COMPMID-886 Don't use LWS hints by default for GPU post Mali-G72
by Michalis Spyrou
· 6 years ago
ae2af74
COMPMID-935 - Implementing Convolution with Winograd on OpenCL (Part 1)
by Gian Marco
· 6 years ago
d56e770
COMPMID-979: Add NHWC data layout to the tensor's metadata (Part 2)
by Isabella Gottardi
· 6 years ago
78c0090
COMPMID-754: Add validation to kernels.
by Georgios Pinitas
· 7 years ago
36a0a46
COMPMID-748 - Integrating optimized SGEMM for bifrost
by Gian Marco
· 7 years ago
1d25ed5
COMPMID-759 - CLGEMM optimization for McVail benchmarks
by Gian Marco
· 7 years ago
358ca20
COMPMID-617: Adds CLFullyConnectionLayer validation support
by Georgios Pinitas
· 7 years ago
fcd52fb
COMPMID-661: Vectorize im2col and add lws heuristics for convolution kernels #46
by Anthony Barbier
· 7 years ago
3e80c7f
COMPMID-661: Optimize FC layer with 2 new Bifrost kernels and LWS tuning (#33)
by Anton Lokhmotov
· 7 years ago
de691f0
COMPMID-524 - Implemented CLTuner object
by Gian Marco
· 7 years ago
edfa9f4
COMPMID-477 - Optimized batched case in CLConvolutionLayer
by Gian Marco Iodice
· 7 years ago
768e9f1
COMPMID-417: Cleanup CL FullyConnectedLayer
by Moritz Pflanzer
· 7 years ago
21efeb4
COMPMID-417: DepthConvert NEON for QS8/QS16.
by Georgios Pinitas
· 7 years ago
8a38369
COMPMID-434 - Port CLGEMM to support 16 bit fixed point
by Gian Marco Iodice
· 7 years ago
3a3066b
COMPMID-411 - Port CLGEMM to support 8 bit fixed point
by Gian Marco Iodice
· 7 years ago
6ff3b19
COMPMID-344 Updated doxygen
by Anthony Barbier
· 7 years ago