Gitiles
Code Review
Sign In
review.mlplatform.org
/
ml
/
ComputeLibrary
/
26764247073f959e3e56db2a14b7e9dd81bb1092
/
src
2676424
Fix segfault in DWC in WoA
by Pablo Marquez Tello
· 4 months ago
c1787f0
Fix OpenBSD® build failure caused by patch 11144
by Gunes Bayir
· 4 months ago
ef63739
Integrate new pretranspose_b_array with extra fused transpose of B
by Gunes Bayir
· 5 months ago
0a48c4c
Requantization cases for offset changes only
by Mohammed Suhail Munshi
· 5 months ago
7976f08
Fix compiler errors in cl-clang
by Pablo Marquez Tello
· 5 months ago
0c85334
Fix parallel depthwise perf regression from 2db938c
by Jonathan Deakin
· 5 months ago
0e73498
Add support for QSYMM8 in ClCastKernel
by Pablo Marquez Tello
· 5 months ago
0ee13af
Remove CKW prototype and Template Writer
by Gunes Bayir
· 5 months ago
a3e1b50
Fix the bug in GpuTanh operator in dynamic fusion
by Gunes Bayir
· 5 months ago
a5a81ae
Mark GpuSoftmax and GpuReshape as not supported
by Gunes Bayir
· 5 months ago
2db938c
Parallelize CPU depthwise over batch if only 1 row
by Jonathan Deakin
· 5 months ago
e695579
arm_gemm: SME: Remove artificial single-thread constraint on quantized int8 kernels.
by David Mansell
· 6 months ago
0c17c4b
Fix leftover cols in CpuGemmLowpMatrixBReductionKernel
by Jonathan Deakin
· 6 months ago
2b9fa59
Use the stable CKW API in the GPU dynamic fusion backend
by Gunes Bayir
· 5 months ago
fb92e22
arm_gemm: convolution: optimize convolver.hpp.
by David Mansell
· 7 months ago
bde6e78
Fix for Logically dead code detected in Coverity checks
by Anitha Raj
· 5 months ago
e8e016e
Fix for unchecked return value detected in Coverity checks.
by Anitha Raj
· 5 months ago
fdf56fb
Make GpuWorkloadContext own all tensor info objects
by Viet-Hoa Do
· 5 months ago
6829e02
Fix divide-by-zero compilation error
by Viet-Hoa Do
· 5 months ago
8896cf7
Fix minor issue, clean lut code
by Mohammed Suhail Munshi
· 5 months ago
27dee1e
Fix potential threading issue in LUTManager
by Mohammed Suhail Munshi
· 6 months ago
0eb9cfb
[ONCPUML-1387] Add ACL based reorder for f32 to bf16 data type conversion.
by Renato Arantes
· 7 months ago
5d7a93a
Fix compilation error on GCC 13.2
by Jakub Sujak
· 6 months ago
7467ba8
Use look up table for fp16 activation
by Mohammed Suhail Munshi
· 7 months ago
7fe7791
Prevent RELU from being processed thru LUT in INT8
by Sangwon Ha
· 6 months ago
c310c11
Fix nightly issue caused by gemm_reshaped_only_rhs_mmul kernel
by Gunes Bayir
· 6 months ago
85cafff
Add Mali™-G720 and Mali™-G620 as GpuTargets
by Gunes Bayir
· 6 months ago
306a8a9
Fix nightly bug caused by not validation 3d cases for input tensor
by Gunes Bayir
· 7 months ago
ec0a057
Revert "Fix nightly bug caused by wrong validation in Gemm mmul kernel"
by Gunes Bayir
· 7 months ago
feef9b9
Fix validation error in CL generate proposals kernel
by Gunes Bayir
· 7 months ago
270576a
Fix nightly bug caused by wrong validation in Gemm mmul kernel
by Gunes Bayir
· 7 months ago
b526431
Winograd changes to enable fp16 in armv8a multi_isa builds
by Pablo Marquez Tello
· 7 months ago
0660172
Fix validation error in graph_ssd_mobilenet
by Gunes Bayir
· 7 months ago
eb475ec
Fix unit tests failing in CL/UNIT/TensorAllocator
by Gunes Bayir
· 7 months ago
4737094
Optimize CPU depth-to-space
by Viet-Hoa Do
· 8 months ago
17e116e
Revert "thread_local _custom_scheduler"
by Pablo Marquez Tello
· 7 months ago
fadc9b1
Optimize CpuSoftmaxKernel for axis=0
by Gunes Bayir
· 8 months ago
9f7aca9
Changes to enable FP16 in armv8a multi_isa
by Pablo Marquez Tello
· 11 months ago
8d4cdd4
BatchNorm changes to enable fp16 in armv8a multi_isa builds
by Pablo Marquez Tello
· 7 months ago
568aab6
CpuMul changes to enable fp16 in armv8a multi_isa builds
by Pablo Marquez Tello
· 7 months ago
ded5b18
thread_local _custom_scheduler
by David Svantesson
· 11 months ago
ba93371
NormalizationLayer changes to enable fp16 in armv8a multi_isa builds
by Pablo Marquez Tello
· 8 months ago
d4650e9
Fix various coverity issues
by SiCong Li
· 8 months ago
ec2afd6
Fix device issue with CL softmax
by Viet-Hoa Do
· 8 months ago
c63f8b0
Update comments to suppress doxygen warnings.
by Anitha Raj
· 8 months ago
24c140f
Fix CpuGemmConv2d int8 segfault
by SiCong Li
· 8 months ago
92c3d71
Remove duplicate definitions of BF16 fixed format kernels.
by David Mansell
· 8 months ago
01b0f9b
Pooling changes to enable fp16 in armv8a multi_isa builds
by Pablo Marquez Tello
· 8 months ago
64f4a30
DepthwiseConvolution changes to enable fp16 in armv8a multi_isa builds
by Pablo Marquez Tello
· 8 months ago
c5ab4df
Optimize CpuGemmConv2d start-up time
by SiCong Li
· 9 months ago
4a9dbed
Update heuristic for MatMul Native U8
by Gian Marco Iodice
· 8 months ago
a7ddd60
Add support for Arm® Cortex®-A520 and Arm® Cortex®-R82
by Viet-Hoa Do
· 8 months ago
bcf9552
Fix compilation error with clang and multi-isa
by Viet-Hoa Do
· 8 months ago
704c22f
[GPU] Update Reverse layer to allow negative axis and reversed axis order
by Adnan AlSinan
· 8 months ago
fde45d8
Extend CKW MatMul with nt_t
by Adnan AlSinan
· 8 months ago
5ef0bdd
Fix SVE kernel using SVE2 instruction
by Viet-Hoa Do
· 8 months ago
29254ae
Optimize CL softmax
by Viet-Hoa Do
· 9 months ago
e5362e7
DirectConv and Im2Col changes to enable fp16 in armv8a multi_isa builds
by Pablo Marquez Tello
· 9 months ago
72b7471
Add check to disable dynamic bias with quantized datatypes in Conv2D layer
by Mohammed Suhail Munshi
· 9 months ago
074b985
FuseBatchNorm changes to enable fp16 in armv8a multi_isa builds
by Pablo Marquez Tello
· 9 months ago
0fa92b8
arm_gemm: Add SME2 FP16 GEMV using FP16->FP32 dot product.
by David Mansell
· 9 months ago
098efc4
Revert "arm_gemm: Add SME2 FP16 GEMV."
by David Mansell
· 9 months ago
c1204c7
Connect MatMul MMUL kernels to ClMatMul operator
by Gunes Bayir
· 9 months ago
d8a397e
Fix build error in CpuScale
by Pablo Marquez Tello
· 9 months ago
b5cb4d2
Scale changes to enable fp16 in armv8a multi_isa builds
by Pablo Marquez Tello
· 9 months ago
aeced74
arm_gemm: Add SME2 FP16 GEMV.
by David Mansell
· 9 months ago
95d477e
Remove padding from CL comparison operator
by Viet-Hoa Do
· 9 months ago
c210c85
Optimize CL reduction operation
by Viet-Hoa Do
· 9 months ago
fb9c25d
arm_gemm: fix 2D threading mode for SME2
by David Mansell
· 9 months ago
9aa153a
Fix build error
by Pablo Marquez Tello
· 9 months ago
dfd56a6
Fix NEReorderKernel validation
by David Svantesson
· 9 months ago
d9c1d44
Port MatMul to Dynamic Fusion + CKW boilerplate code
by Adnan AlSinan
· 9 months ago
0b72aa4
Optimize NEStackLayer
by Gunes Bayir
· 9 months ago
b6718c8
Fix compilation error caused by ambiguous std::abs call
by Gunes Bayir
· 9 months ago
6777359
CpuSubKernel changes to enable fp16 in armv8a multi_isa builds
by Pablo Marquez Tello
· 9 months ago
1434155
Change heuristics for FP16 Deconv
by Sangwon Ha
· 9 months ago
68b6dce
Pool2d changes to enable fp16 in armv8a multi_isa builds
by Pablo Marquez Tello
· 9 months ago
a23b468
Optimize CLTranspose operator
by Jakub Sujak
· 9 months ago
a04ae3e
Port DepthwiseConv2d operator to Ckw
by ramy.elgammal@arm.com
· 11 months ago
745153b
NEDeconvolutionLayer validation fix
by Pablo Marquez Tello
· 9 months ago
0a99c79
Fix nightly NEON Reverse reference failure
by Adnan AlSinan
· 9 months ago
c2a51bd
Optimize CL and Neon Winograd tests
by Gunes Bayir
· 9 months ago
a396da1
Implement Quantized Matmul T/T and T/Nt kernels using MMUL extension
by Gunes Bayir
· 9 months ago
6e56bf3
Revise clang-format configuration
by Jakub Sujak
· 10 months ago
2ad0a6b
Implement Quantized Matmul Nt/T kernel using MMUL extension
by Gunes Bayir
· 9 months ago
ef9da00
Reimplement erf function
by Viet-Hoa Do
· 9 months ago
afd38f0
Apply clang-format on repository
by Felix Thomasmathibalan
· 9 months ago
bdcb4c1
Implement tflite compliant reverse for CPU
by Adnan AlSinan
· 9 months ago
729099c
Enable job-chaining with incremental job_chaining_size.
by Anitha Raj
· 9 months ago
0392160
Re-arrange header inclusion order
by Felix Thomasmathibalan
· 9 months ago
6d87887
Select changes to enable fp16 in armv8a multi_isa builds
by Pablo Marquez Tello
· 9 months ago
6b6ba9e
Maxunpooling changes to enable fp16 in armv8a multi_isa builds
by Pablo Marquez Tello
· 9 months ago
8562a4e
Remove CommonGraphOptions from Utils target and warnings
by Paolo Tricerri
· 9 months ago
1f841a5
Optimize the main loop in mat_mul_native_quantized_mmul_nt_nt
by Gunes Bayir
· 9 months ago
e9fd8b4
L2Norm changes to enable fp16 in armv8a multi_isa builds
by Pablo Marquez Tello
· 9 months ago
f57d6ec
Gemm changes to enable fp16 in armv8a multi_isa builds
by Pablo Marquez Tello
· 9 months ago
e071b5e
Fix the validation issue in AddMulAdd fused kernel
by Gunes Bayir
· 9 months ago
500e10b
Add CL command buffer class
by Viet-Hoa Do
· 10 months ago
a116cd3
Implement Quantized MatMul kernel using MMUL extension
by Gunes Bayir
· 10 months ago
40a9d3e
Remove deprecated support for BF16 in CpuCast
by Adnan AlSinan
· 10 months ago
Next »