Gitiles
Code Review
Sign In
review.mlplatform.org
/
ml
/
ComputeLibrary
/
575c5f1ae6bf6703b694d51d0818926809c7d8be
/
src
/
core
7377107
Scatter GPU Kernel Implementation for 1D tensors.
by Mohammed Suhail Munshi
· 3 months ago
6ac82a4
fix compilation errors on linux with gcc12
by Sunita Nadampalli
· 3 months ago
a668f9f
Add s8f32 kernels and dynamic QuantizationInfo
by Jonathan Deakin
· 5 months ago
cfca87b
Add SME2 implementation of softmax for FP16
by Gunes Bayir
· 3 months ago
f1f1f87
Add in place summation to CPU GEMM kernels
by Radu Salavat
· 4 months ago
553e241
Fix compiler error
by Pablo Marquez Tello
· 3 months ago
905786e
Added new NEON fixed format fast math mode hybrid kernel with maximum height of 6 for accumulation and updated heuristics
by Milos Puzovic
· 3 months ago
36a75da
[ONCPUML-1451] Add matmul kernel to enable bf16 to bf16 operations via PyTorch® autocast() function
by Renato Arantes
· 5 months ago
d219115
Make Cpu/Gpu/Ref scalar/vectoral S32 division consistent
by Gunes Bayir
· 3 months ago
3e4b193
Fix quant. gemv kernel driver by adding set_quantized_bias()
by Gunes Bayir
· 3 months ago
5a67733
arm_gemm: Fix bias handling for sme2 FP16 GEMV.
by David Mansell
· 3 months ago
57a8852
Fix WoA nightly failure
by Pablo Marquez Tello
· 4 months ago
2676424
Fix segfault in DWC in WoA
by Pablo Marquez Tello
· 4 months ago
c1787f0
Fix OpenBSD® build failure caused by patch 11144
by Gunes Bayir
· 4 months ago
ef63739
Integrate new pretranspose_b_array with extra fused transpose of B
by Gunes Bayir
· 4 months ago
7976f08
Fix compiler errors in cl-clang
by Pablo Marquez Tello
· 4 months ago
2db938c
Parallelize CPU depthwise over batch if only 1 row
by Jonathan Deakin
· 5 months ago
e695579
arm_gemm: SME: Remove artificial single-thread constraint on quantized int8 kernels.
by David Mansell
· 6 months ago
fb92e22
arm_gemm: convolution: optimize convolver.hpp.
by David Mansell
· 7 months ago
e8e016e
Fix for unchecked return value detected in Coverity checks.
by Anitha Raj
· 5 months ago
6829e02
Fix divide-by-zero compilation error
by Viet-Hoa Do
· 5 months ago
8896cf7
Fix minor issue, clean lut code
by Mohammed Suhail Munshi
· 5 months ago
27dee1e
Fix potential threading issue in LUTManager
by Mohammed Suhail Munshi
· 6 months ago
0eb9cfb
[ONCPUML-1387] Add ACL based reorder for f32 to bf16 data type conversion.
by Renato Arantes
· 7 months ago
7467ba8
Use look up table for fp16 activation
by Mohammed Suhail Munshi
· 7 months ago
c310c11
Fix nightly issue caused by gemm_reshaped_only_rhs_mmul kernel
by Gunes Bayir
· 6 months ago
85cafff
Add Mali™-G720 and Mali™-G620 as GpuTargets
by Gunes Bayir
· 6 months ago
feef9b9
Fix validation error in CL generate proposals kernel
by Gunes Bayir
· 7 months ago
0660172
Fix validation error in graph_ssd_mobilenet
by Gunes Bayir
· 7 months ago
eb475ec
Fix unit tests failing in CL/UNIT/TensorAllocator
by Gunes Bayir
· 7 months ago
4737094
Optimize CPU depth-to-space
by Viet-Hoa Do
· 8 months ago
fadc9b1
Optimize CpuSoftmaxKernel for axis=0
by Gunes Bayir
· 8 months ago
9f7aca9
Changes to enable FP16 in armv8a multi_isa
by Pablo Marquez Tello
· 11 months ago
8d4cdd4
BatchNorm changes to enable fp16 in armv8a multi_isa builds
by Pablo Marquez Tello
· 7 months ago
ba93371
NormalizationLayer changes to enable fp16 in armv8a multi_isa builds
by Pablo Marquez Tello
· 8 months ago
d4650e9
Fix various coverity issues
by SiCong Li
· 8 months ago
ec2afd6
Fix device issue with CL softmax
by Viet-Hoa Do
· 8 months ago
92c3d71
Remove duplicate definitions of BF16 fixed format kernels.
by David Mansell
· 8 months ago
c5ab4df
Optimize CpuGemmConv2d start-up time
by SiCong Li
· 8 months ago
bcf9552
Fix compilation error with clang and multi-isa
by Viet-Hoa Do
· 8 months ago
704c22f
[GPU] Update Reverse layer to allow negative axis and reversed axis order
by Adnan AlSinan
· 8 months ago
5ef0bdd
Fix SVE kernel using SVE2 instruction
by Viet-Hoa Do
· 8 months ago
29254ae
Optimize CL softmax
by Viet-Hoa Do
· 9 months ago
0fa92b8
arm_gemm: Add SME2 FP16 GEMV using FP16->FP32 dot product.
by David Mansell
· 8 months ago
098efc4
Revert "arm_gemm: Add SME2 FP16 GEMV."
by David Mansell
· 8 months ago
aeced74
arm_gemm: Add SME2 FP16 GEMV.
by David Mansell
· 9 months ago
95d477e
Remove padding from CL comparison operator
by Viet-Hoa Do
· 9 months ago
c210c85
Optimize CL reduction operation
by Viet-Hoa Do
· 9 months ago
fb9c25d
arm_gemm: fix 2D threading mode for SME2
by David Mansell
· 9 months ago
dfd56a6
Fix NEReorderKernel validation
by David Svantesson
· 9 months ago
0b72aa4
Optimize NEStackLayer
by Gunes Bayir
· 9 months ago
b6718c8
Fix compilation error caused by ambiguous std::abs call
by Gunes Bayir
· 9 months ago
a23b468
Optimize CLTranspose operator
by Jakub Sujak
· 9 months ago
0a99c79
Fix nightly NEON Reverse reference failure
by Adnan AlSinan
· 9 months ago
a396da1
Implement Quantized Matmul T/T and T/Nt kernels using MMUL extension
by Gunes Bayir
· 9 months ago
6e56bf3
Revise clang-format configuration
by Jakub Sujak
· 10 months ago
2ad0a6b
Implement Quantized Matmul Nt/T kernel using MMUL extension
by Gunes Bayir
· 9 months ago
ef9da00
Reimplement erf function
by Viet-Hoa Do
· 9 months ago
afd38f0
Apply clang-format on repository
by Felix Thomasmathibalan
· 9 months ago
bdcb4c1
Implement tflite compliant reverse for CPU
by Adnan AlSinan
· 9 months ago
0392160
Re-arrange header inclusion order
by Felix Thomasmathibalan
· 9 months ago
1f841a5
Optimize the main loop in mat_mul_native_quantized_mmul_nt_nt
by Gunes Bayir
· 9 months ago
500e10b
Add CL command buffer class
by Viet-Hoa Do
· 10 months ago
a116cd3
Implement Quantized MatMul kernel using MMUL extension
by Gunes Bayir
· 10 months ago
c071328
Fix include dependencies for mass reformatting patch
by Gunes Bayir
· 10 months ago
e87fa66
Add skeleton of ClMatMulLowpNativeMMULKernel
by Gunes Bayir
· 10 months ago
b566b6e
Extend Neon ReshapeLayer validation tests
by Anitha Raj
· 10 months ago
0d27b2e
Remove legacy PostOps code
by Jakub Sujak
· 10 months ago
b7aefd7
GEMM: AArch32: Split assembler block in a32_merge_float_8x6.hpp
by David Mansell
· 10 months ago
082630b
Update CpuGemmConv2d and CpuFlatten to use CpuReshape operator
by Anitha Raj
· 10 months ago
1b2ee3e
CPU: Depthwise: Generate correct size for input indirection array.
by David Mansell
· 10 months ago
eb5696d
Optimize CpuReshapeKernel
by Anitha Raj
· 12 months ago
246fe08
Fix various static check issues
by Viet-Hoa Do
· 11 months ago
48b6d17
Check CL command buffer extension
by Viet-Hoa Do
· 11 months ago
338ef46
Optimize CLReduce for Min/Max Axis=0
by Gunes Bayir
· 11 months ago
0c19f59
Fix CL Tile operator
by Viet-Hoa Do
· 11 months ago
16b3752
Port ElementwiseBinary to CKW part 2
by SiCong Li
· 11 months ago
9129549
Retain back-compatibility for arm_compute/core/Types.h
by SiCong Li
· 11 months ago
8dfb882
Enable S64 output in CLArgMinMax
by Pablo Marquez Tello
· 12 months ago
4a1c917
Add support for input S64/U64 in CpuCastKernel
by Pablo Marquez Tello
· 11 months ago
314d3e2
Break up core/Utils.h to reduce unused code being included everywhere
by Matthew Bentham
· 1 year ago
4c30de0
Enable premultiplication for depthwise convolution with fp16 and quantized types
by Michael Tyler
· 12 months ago
8e2dede
Add Bias to MatMul Kernels and add support for use in Fully Connected Layer
by Mohammed Suhail Munshi
· 1 year ago
4c0a38a
Disable kernel size 3 in argminmax for axis 0
by Pablo Marquez Tello
· 12 months ago
c9eeee5
Fix nightly failures in MatMulLowpNativeKernel when using bounded activation functions
by Mohammed Suhail Munshi
· 12 months ago
9b392d7
Rewrote CLArgMinMax for axis 0
by Pablo Marquez Tello
· 1 year ago
4cf8067
Fix unused function warning
by Michael Tyler
· 12 months ago
8deee9b
Depthwise channel pre-multiplication
by Michael Tyler
· 12 months ago
19844f6
Add Kernel Writer driver code to dynamic fusion
by SiCong Li
· 1 year, 1 month ago
00474e9
Implement FP32/16 MatMul Lhs T Rhs T/NT kernel using MMUL extension
by Gunes Bayir
· 1 year ago
8c49f16
Add helpers to set CKW tensor components as OpenCL kernel arguments
by Jakub Sujak
· 1 year ago
a2bb80e
Use MatMul in fully connected layer with dynamic weights when supported
by Mohammed Suhail Munshi
· 1 year ago
c952596
Implement FP32/FP16 MatMul NT/T kernel using the MMUL extension
by Ramy Elgammal
· 1 year, 1 month ago
a2561f0
Fix doxygen warnings
by ramy.elgammal@arm.com
· 1 year ago
3f16c57
Enable vmfa in arm7va/aarch32 when present
by Pablo Marquez Tello
· 1 year ago
a8d8058
Implement FP32/FP16 MatMul NT/NT kernel using the MMUL extension
by SiCong Li
· 1 year, 1 month ago
94abde4
Add Fused Activation to OpenCL MatMul
by Mohammed Suhail Munshi
· 1 year, 1 month ago
043613f
Break up Utils.h a bit to reduce unused code being included everywhere
by Matthew Bentham
· 1 year, 1 month ago
f1aeab9
Break up arm_compute/core/Types.h a bit
by Matthew Bentham
· 1 year, 1 month ago
6c7cf31
Fix build error for armv7a
by Pablo Marquez Tello
· 1 year, 1 month ago
Next »