Gitiles
Code Review
Sign In
review.mlplatform.org
/
ml
/
ComputeLibrary
/
d2d5f759601bd91ffa0f9258dcf501c9a6955869
/
src
/
core
/
CL
/
cl_kernels
d2d5f75
COMPMID-1199: Fixed mismatches in CLColorConvert.
by Pablo Tello
· 6 years ago
e03342e
COMPMID-799 - Use new OpenCL 8-bit dot product instruction
by Michalis Spyrou
· 7 years ago
46da23f
COMPMID-813 Add NHWC data format support for CL scale
by Michalis Spyrou
· 6 years ago
1b99338
COMPMID-568: Implement Canny edge function for CL/NEON
by Abe Mbise
· 7 years ago
76c8564
COMPMID-1083 : Compute library should be made usable on non-ARM platforms
by Vidhya Sudhan Loganathan
· 6 years ago
bc06f99
COMPMID-959: Fix deconvolution window.
by Georgios Pinitas
· 6 years ago
bdff491
COMPMID-1083 : Compute library should be made usable on non-ARM platforms
by Vidhya Sudhan Loganathan
· 6 years ago
793f87d
COMPMID-1176: Add FP16 support in CLDeconvolutionLayer.
by Georgios Pinitas
· 6 years ago
efac7c6
COMPMID-959: Fix CLChannelShuffle
by Michele Di Giorgio
· 6 years ago
f6f08da
COMPMID-1044: Optimizing GCGEMM - Support for not reshaped GEMM on GLES
by Michele Di Giorgio
· 6 years ago
55b3d12
COMPMID-1137 OpenCL concatenate width
by Michalis Spyrou
· 6 years ago
8422558
COMPMID-1150 : (OCLGrind) Kernel compilation error and assertion
by Georgios Pinitas
· 6 years ago
657bdb3
COMPMID-1050 CL/NEON: Create a function to convert the 2D weights of FC layer from NHWC to NCHW and viceversa
by Giorgio Arena
· 6 years ago
bb36a8e
COMPMID-922 - CLGEMM FP16 optimizations - part2
by Gian Marco Iodice
· 6 years ago
d24af8a
COMPMID-1125: Add support for FP16 in CLDepthwiseConvolution
by Michele Di Giorgio
· 6 years ago
7217563
COMPMID-1107: Add support for ChannelShuffle in CL
by Michele Di Giorgio
· 6 years ago
7657224
COMPMID-926 Add depth multiplier support to NEON/CL/GLES depthwise convolution
by Giorgio Arena
· 6 years ago
e74b201
COMPMID-805 Add NHWC data format support for CL pooling
by Michalis Spyrou
· 6 years ago
2213d4b
COMPMID-1096 - Add fast_math flag to CLConvolutionLayer
by Gian Marco Iodice
· 6 years ago
bf3c662
COMPMID-803: Add NHWC data format support for CL batch normalisation
by Michele Di Giorgio
· 6 years ago
fda901f
COMPMID-1078 Fix CL DepthwiseConvolutionLayer QASYMM8 failing validation
by Giorgio Arena
· 6 years ago
e52a300
COMPMID-1026 - Add support for 4x4 output tile in CLWinogradConvolutionLayer
by Gian Marco Iodice
· 6 years ago
dd03870
COMPMID-1037 Add support for F(4x4, 5x5) in CLWinogradOutputTransformKernel
by Giorgio Arena
· 6 years ago
4d33630
COMPMID-987: Make beta and gamma optional in BatchNormalization
by Michele Di Giorgio
· 6 years ago
c9c62c2
COMPMID-1056 - Optimizing CLGEMMMatrixMultiplyKernel refactoring the inner loop
by Gian Marco Iodice
· 6 years ago
fd68311
COMPMID-922 - CLGEMM FP16 optimizations - part1
by Gian Marco Iodice
· 6 years ago
dfca60b
COMPMID-811 Add NHWC data format support for CL depthwise convolution QASYMM8
by Giorgio Arena
· 6 years ago
fe5ef38
COMPMID-1037 Add support for F(4x4, 5x5) in CLWinogradInputTransformKernel
by Giorgio Arena
· 6 years ago
9373c8b
COMPMID-1037 Add support for F(4x4, 5x5) in CLWinogradFilterTransformKernel
by Giorgio Arena
· 6 years ago
ae3c8ab
COMPMID-584: Add validation to channel_combine kernels
by Ioan-Cristian Szabo
· 7 years ago
8de9261
COMPMID-585: Port OpticalFlow to new validation
by John Richardson
· 6 years ago
eb8a399
COMPMID-994 : Check cl_arm_printf is supported in the CLScheduler
by Vidhya Sudhan Loganathan
· 6 years ago
3ebef32
COMPMID-949: Optimizing CLDepthwiseConvolution3x3Kernel for FP16
by Michele Di Giorgio
· 6 years ago
684cb0f
COMPMID-596: Port HOGDetector to new validation
by John Richardson
· 7 years ago
7da29b6
COMPMID-1017: Implement dilated convolution in NEON, OpenCL, and GC
by Alex Gilday
· 6 years ago
81b28c4
COMPMID-1032 - Fixing bug in CLGEMM when is_interleaved_transposed=true
by Gian Marco Iodice
· 6 years ago
5c8e05c
COMPMID-1019 Implement copy function CL
by Michalis Spyrou
· 6 years ago
c357c47
COMPMID-1008: Fix Doxygen issues
by Alex Gilday
· 6 years ago
2d9de0a
COMPMID-1009 Support 4x4 output tile for Winograd Filter Transform on OpenCL.
by Giorgio Arena
· 6 years ago
d2fab73
COMPMID-935 - Implementing Convolution with Winograd on OpenCL (part 4)
by Gian Marco Iodice
· 6 years ago
99ac60b
COMPMID-853 Fuse CL DepthwiseConvolution with Activation for QASYM8
by Giorgio Arena
· 6 years ago
7e4b239
COMPMID-935 - Implementing Convolution with Winograd on OpenCL (part 2)
by Gian Marco Iodice
· 6 years ago
1f9ca1d
COMPMID-935 Implementing Convolution with Winograd on OpenCL (part 3)
by Giorgio Arena
· 6 years ago
ae2af74
COMPMID-935 - Implementing Convolution with Winograd on OpenCL (Part 1)
by Gian Marco
· 6 years ago
5b2191e
COMPMID-765: Fix incorrect comma position in DepthwiseConv cl kernel.
by Georgios Pinitas
· 6 years ago
933fe86
COMPMID-927: Adding support for FP16 in CLDepthwiseConvolutionLayer3x3
by Michele Di Giorgio
· 6 years ago
9414f64
COMPMID-582: Add validation to channel_extract kernels.
by Ioan-Cristian Szabo
· 7 years ago
287b570
COMPMID-853 Use tile 2 for CL depthwise convolution QASYM8
by Giorgio Arena
· 6 years ago
72f39be
COMPMID-939 Fix mismatches and finalize CLSoftmaxLayer optimization
by Giorgio Arena
· 6 years ago
3cfd237
COMPMID-938: OCLgrind: Mismatches in depthwise convolution on Bifrost
by Georgios Pinitas
· 6 years ago
19835e5
COMPMID-882 - Optimizing GEMMLowp on OpenCL reshaping matrices
by Gian Marco
· 6 years ago
4402cb9
COMPMID-905 Optimize CLSoftmaxLayer for QASYMM8
by Giorgio Arena
· 6 years ago
a086a0a
COMPMID-765 Move direct convolution output stage to the right file
by Giorgio Arena
· 6 years ago
de5a1cc
COMPMID-856: CL Depthwise Convolution QASYMM8 support
by Georgios Pinitas
· 6 years ago
b99f00d
COMPMID-905 Asymm functions support for all vec sizes
by Giorgio Arena
· 6 years ago
a527e8c
COMPMID-828 - Add support for pool widths 4, 5 & 6 and for non square data sizes - Part 2 (CL)
by Isabella Gottardi
· 6 years ago
1167487
COMPMID-897 Merge batch normalization with bounded relu
by Giorgio Arena
· 6 years ago
4e1e7dc
COMPMID-892: OCLGrind failures on both validation and benchmark
by Georgios Pinitas
· 6 years ago
6232d04
COMPMID-907 Optimizing FixedPoint calculation in the output stage of GEMMLowp
by Giorgio Arena
· 6 years ago
25a340f
COMPMID-578: Implement FAST corners for CL/NEON
by Abe Mbise
· 7 years ago
54f18c4
COMPMID-901 - Optimizing CLCol2ImKernel
by Gian Marco
· 6 years ago
c799ed8
COMPMID-895 - Optimizing CLDepthwiseConvolution3x3Kernel
by Gian Marco
· 6 years ago
76faef8
COMPMID-855 - Optimizing im2col on OpenCL (DCHW)
by Gian Marco
· 6 years ago
f6402dd
COMPMID-834 Fix arm_compute_nightly_validation getting killed
by Michalis Spyrou
· 7 years ago
36a0a46
COMPMID-748 - Integrating optimized SGEMM for bifrost
by Gian Marco
· 7 years ago
7b4d547
COMPMID-816 - Optimizing CLGEMMLowpMatrixMultiplyCore - Part1
by Gian Marco
· 7 years ago
a1f7e33
COMPMID-841: Add CL QASYMM8 RELU Activation
by Michele Di Giorgio
· 7 years ago
5237e01
COMPMID-838 Implement CLPermute
by Michalis Spyrou
· 7 years ago
652bde5
COMPMID-674 - Create Google InceptionV3 example
by Georgios Pinitas
· 7 years ago
a0d1183
COMPMID-751 QASYMM8 ActivationLayer optimisation: don't requantize if not necessary
by Giorgio Arena
· 7 years ago
944d3f7
COMPMID-751 Processing 8 elements makes computation up to 80us faster on MobileNet QASYMM8 dwc layers
by Giorgio Arena
· 7 years ago
780db4e
COMPMID-471 Implement Deconvolution on OpenCL
by Michalis Spyrou
· 7 years ago
25f2368
COMPMID-589: Port HOGDescriptor to new validation
by John Richardson
· 7 years ago
1d08a31
COMPMID-765: Collapse execution window in CL kernels.
by Georgios Pinitas
· 7 years ago
5124be5
COMPMID-661: Convolution quantized (#32)
by Chunosov
· 7 years ago
fcd52fb
COMPMID-661: Vectorize im2col and add lws heuristics for convolution kernels #46
by Anthony Barbier
· 7 years ago
58c5794
COMPMID-706 - Add GEMMLowp output stage for scaling by a fixed point number
by Gian Marco
· 7 years ago
0162436
COMPMID-684: 2D In-Map normalization support for CL
by Georgios Pinitas
· 7 years ago
45bcc3a
COMPMID-661: QASYMM8 support for fully connected layer.
by Georgios Pinitas
· 7 years ago
6fdfaa8
COMPMID-713: Address failures in OCLGrind for CLDirectConvolution
by Georgios Pinitas
· 7 years ago
47b5603
COMPMID-712: OCLGrind CLSoftmaxLayer quantized failures
by Georgios Pinitas
· 7 years ago
c000fb8
COMPMID-714: Resolve failures in OCLGrind for CLPhase
by Georgios Pinitas
· 7 years ago
05288a2
COMPMID-697 - Rework GEMMLowp interface on OpenCL
by Gian Marco
· 7 years ago
f202e50
COMPMID-556 Improved indentation and error handling in format_doxygen.py
by Anthony Barbier
· 7 years ago
c809712
COMPMID-556 Add saturation to 8-bit activation. This prevents undefined overflow
by Rob Hughes
· 7 years ago
02bf80d
COMPMID-661: Fix scale border issue (#38)
by Daniil Efremov
· 7 years ago
3e80c7f
COMPMID-661: Optimize FC layer with 2 new Bifrost kernels and LWS tuning (#33)
by Anton Lokhmotov
· 7 years ago
d7295b7
COMPMID-661: Add QASYMM8 support (and basic tests) to CLDepthwiseConvolution3x3 kernel (#28)
by Dmitry Savenko
· 7 years ago
540d008
COMPMID-556: Fixes bias in CLDirectConvolutionLayer to be int32.
by Georgios Pinitas
· 7 years ago
7a49c79
COMPMID-661: issue# 23 Scale border fix (#26)
by Daniil Efremov
· 7 years ago
f1f3ebd
APPBROWSER-298, APPBROWSER-306: Reimplement the common code of compute shader
by Joel Liang
· 7 years ago
624b778
COMPMID-556: Fix CLNormalization issues.
by Georgios Pinitas
· 7 years ago
2f18579
COMPMID-661: Fix rounding in average pooling for uint8.
by Georgios Pinitas
· 7 years ago
f450caa
COMPMID-661: softmax-uint8 implementation (#16)
by Chunosov
· 7 years ago
7068f99
COMPMID-631: Merge branches/gles_compute branch
by Anthony Barbier
· 7 years ago
4df76c9
COMPMID-661: Fix beta in softmax_layer kernel
by Georgios Pinitas
· 7 years ago
af6204c
COMPMID-661: Add avgpool-uint8 support. Optimize avgpool-fp32 for Bifrost. (#13)
by Anton Lokhmotov
· 7 years ago
d6afedc
COMPMID-661: softmax-fp32 optimisation (#14)
by Chunosov
· 7 years ago
d621bca
COMPMID-661: directconv-uint8 (#20)
by Chunosov
· 7 years ago
6f31f8c
Allow running without cl_khr_fp16
by Matthew Bentham
· 7 years ago
Next »