Gitiles
Code Review
Sign In
review.mlplatform.org
/
ml
/
ComputeLibrary
/
refs/heads/branches/arm_compute_24_04
/
src
« Previous
473b829
Adds Tests and reference implementation for scatter operator with 1D tensors.
by Mohammed Suhail Munshi
· 4 months ago
8609ca0
Add skeleton for CLScatter op, reference and tests
by Mohammed Suhail Munshi
· 5 months ago
36a75da
[ONCPUML-1451] Add matmul kernel to enable bf16 to bf16 operations via PyTorch® autocast() function
by Renato Arantes
· 6 months ago
d219115
Make Cpu/Gpu/Ref scalar/vectoral S32 division consistent
by Gunes Bayir
· 4 months ago
c00a82b
Fix overflow in NEMeanStdDevNormalizationKernel
by Pablo Marquez Tello
· 4 months ago
3e4b193
Fix quant. gemv kernel driver by adding set_quantized_bias()
by Gunes Bayir
· 4 months ago
5a67733
arm_gemm: Fix bias handling for sme2 FP16 GEMV.
by David Mansell
· 5 months ago
3ac0b87
Fix validation in pool2d assembly wrapper
by Pablo Marquez Tello
· 5 months ago
93e743f
Optimize CpuSoftmaxKernel for axis != 0 and neon kernels
by Omar Al Khatib
· 7 months ago
57a8852
Fix WoA nightly failure
by Pablo Marquez Tello
· 5 months ago
9167c9c
Prefer indirect Gemm vs. Direct convolution if supported
by Gunes Bayir
· 5 months ago
40af090
Disable FP16 on 32 bit
by Pablo Marquez Tello
· 5 months ago
bf05373
Fix performance regression in fixed-format kernels
by Gunes Bayir
· 5 months ago
6fe9eaf
Set Neon™ as present for WoA
by Pablo Marquez Tello
· 5 months ago
2676424
Fix segfault in DWC in WoA
by Pablo Marquez Tello
· 5 months ago
c1787f0
Fix OpenBSD® build failure caused by patch 11144
by Gunes Bayir
· 5 months ago
ef63739
Integrate new pretranspose_b_array with extra fused transpose of B
by Gunes Bayir
· 6 months ago
0a48c4c
Requantization cases for offset changes only
by Mohammed Suhail Munshi
· 6 months ago
7976f08
Fix compiler errors in cl-clang
by Pablo Marquez Tello
· 6 months ago
0c85334
Fix parallel depthwise perf regression from 2db938c
by Jonathan Deakin
· 6 months ago
0e73498
Add support for QSYMM8 in ClCastKernel
by Pablo Marquez Tello
· 6 months ago
0ee13af
Remove CKW prototype and Template Writer
by Gunes Bayir
· 6 months ago
a3e1b50
Fix the bug in GpuTanh operator in dynamic fusion
by Gunes Bayir
· 6 months ago
a5a81ae
Mark GpuSoftmax and GpuReshape as not supported
by Gunes Bayir
· 6 months ago
2db938c
Parallelize CPU depthwise over batch if only 1 row
by Jonathan Deakin
· 6 months ago
e695579
arm_gemm: SME: Remove artificial single-thread constraint on quantized int8 kernels.
by David Mansell
· 7 months ago
0c17c4b
Fix leftover cols in CpuGemmLowpMatrixBReductionKernel
by Jonathan Deakin
· 7 months ago
2b9fa59
Use the stable CKW API in the GPU dynamic fusion backend
by Gunes Bayir
· 6 months ago
fb92e22
arm_gemm: convolution: optimize convolver.hpp.
by David Mansell
· 8 months ago
bde6e78
Fix for Logically dead code detected in Coverity checks
by Anitha Raj
· 6 months ago
e8e016e
Fix for unchecked return value detected in Coverity checks.
by Anitha Raj
· 6 months ago
fdf56fb
Make GpuWorkloadContext own all tensor info objects
by Viet-Hoa Do
· 6 months ago
6829e02
Fix divide-by-zero compilation error
by Viet-Hoa Do
· 6 months ago
8896cf7
Fix minor issue, clean lut code
by Mohammed Suhail Munshi
· 6 months ago
27dee1e
Fix potential threading issue in LUTManager
by Mohammed Suhail Munshi
· 7 months ago
0eb9cfb
[ONCPUML-1387] Add ACL based reorder for f32 to bf16 data type conversion.
by Renato Arantes
· 8 months ago
5d7a93a
Fix compilation error on GCC 13.2
by Jakub Sujak
· 7 months ago
7467ba8
Use look up table for fp16 activation
by Mohammed Suhail Munshi
· 8 months ago
7fe7791
Prevent RELU from being processed thru LUT in INT8
by Sangwon Ha
· 7 months ago
c310c11
Fix nightly issue caused by gemm_reshaped_only_rhs_mmul kernel
by Gunes Bayir
· 7 months ago
85cafff
Add Mali™-G720 and Mali™-G620 as GpuTargets
by Gunes Bayir
· 7 months ago
306a8a9
Fix nightly bug caused by not validation 3d cases for input tensor
by Gunes Bayir
· 8 months ago
ec0a057
Revert "Fix nightly bug caused by wrong validation in Gemm mmul kernel"
by Gunes Bayir
· 8 months ago
feef9b9
Fix validation error in CL generate proposals kernel
by Gunes Bayir
· 8 months ago
270576a
Fix nightly bug caused by wrong validation in Gemm mmul kernel
by Gunes Bayir
· 8 months ago
b526431
Winograd changes to enable fp16 in armv8a multi_isa builds
by Pablo Marquez Tello
· 8 months ago
0660172
Fix validation error in graph_ssd_mobilenet
by Gunes Bayir
· 8 months ago
eb475ec
Fix unit tests failing in CL/UNIT/TensorAllocator
by Gunes Bayir
· 8 months ago
4737094
Optimize CPU depth-to-space
by Viet-Hoa Do
· 9 months ago
17e116e
Revert "thread_local _custom_scheduler"
by Pablo Marquez Tello
· 8 months ago
fadc9b1
Optimize CpuSoftmaxKernel for axis=0
by Gunes Bayir
· 9 months ago
9f7aca9
Changes to enable FP16 in armv8a multi_isa
by Pablo Marquez Tello
· 12 months ago
8d4cdd4
BatchNorm changes to enable fp16 in armv8a multi_isa builds
by Pablo Marquez Tello
· 8 months ago
568aab6
CpuMul changes to enable fp16 in armv8a multi_isa builds
by Pablo Marquez Tello
· 8 months ago
ded5b18
thread_local _custom_scheduler
by David Svantesson
· 12 months ago
ba93371
NormalizationLayer changes to enable fp16 in armv8a multi_isa builds
by Pablo Marquez Tello
· 9 months ago
d4650e9
Fix various coverity issues
by SiCong Li
· 9 months ago
ec2afd6
Fix device issue with CL softmax
by Viet-Hoa Do
· 9 months ago
c63f8b0
Update comments to suppress doxygen warnings.
by Anitha Raj
· 9 months ago
24c140f
Fix CpuGemmConv2d int8 segfault
by SiCong Li
· 9 months ago
92c3d71
Remove duplicate definitions of BF16 fixed format kernels.
by David Mansell
· 9 months ago
01b0f9b
Pooling changes to enable fp16 in armv8a multi_isa builds
by Pablo Marquez Tello
· 9 months ago
64f4a30
DepthwiseConvolution changes to enable fp16 in armv8a multi_isa builds
by Pablo Marquez Tello
· 9 months ago
c5ab4df
Optimize CpuGemmConv2d start-up time
by SiCong Li
· 10 months ago
4a9dbed
Update heuristic for MatMul Native U8
by Gian Marco Iodice
· 9 months ago
a7ddd60
Add support for Arm® Cortex®-A520 and Arm® Cortex®-R82
by Viet-Hoa Do
· 9 months ago
bcf9552
Fix compilation error with clang and multi-isa
by Viet-Hoa Do
· 9 months ago
704c22f
[GPU] Update Reverse layer to allow negative axis and reversed axis order
by Adnan AlSinan
· 9 months ago
fde45d8
Extend CKW MatMul with nt_t
by Adnan AlSinan
· 9 months ago
5ef0bdd
Fix SVE kernel using SVE2 instruction
by Viet-Hoa Do
· 9 months ago
29254ae
Optimize CL softmax
by Viet-Hoa Do
· 10 months ago
e5362e7
DirectConv and Im2Col changes to enable fp16 in armv8a multi_isa builds
by Pablo Marquez Tello
· 10 months ago
72b7471
Add check to disable dynamic bias with quantized datatypes in Conv2D layer
by Mohammed Suhail Munshi
· 10 months ago
074b985
FuseBatchNorm changes to enable fp16 in armv8a multi_isa builds
by Pablo Marquez Tello
· 10 months ago
0fa92b8
arm_gemm: Add SME2 FP16 GEMV using FP16->FP32 dot product.
by David Mansell
· 10 months ago
098efc4
Revert "arm_gemm: Add SME2 FP16 GEMV."
by David Mansell
· 10 months ago
c1204c7
Connect MatMul MMUL kernels to ClMatMul operator
by Gunes Bayir
· 10 months ago
d8a397e
Fix build error in CpuScale
by Pablo Marquez Tello
· 10 months ago
b5cb4d2
Scale changes to enable fp16 in armv8a multi_isa builds
by Pablo Marquez Tello
· 10 months ago
aeced74
arm_gemm: Add SME2 FP16 GEMV.
by David Mansell
· 10 months ago
95d477e
Remove padding from CL comparison operator
by Viet-Hoa Do
· 10 months ago
c210c85
Optimize CL reduction operation
by Viet-Hoa Do
· 10 months ago
fb9c25d
arm_gemm: fix 2D threading mode for SME2
by David Mansell
· 10 months ago
9aa153a
Fix build error
by Pablo Marquez Tello
· 10 months ago
dfd56a6
Fix NEReorderKernel validation
by David Svantesson
· 10 months ago
d9c1d44
Port MatMul to Dynamic Fusion + CKW boilerplate code
by Adnan AlSinan
· 10 months ago
0b72aa4
Optimize NEStackLayer
by Gunes Bayir
· 10 months ago
b6718c8
Fix compilation error caused by ambiguous std::abs call
by Gunes Bayir
· 10 months ago
6777359
CpuSubKernel changes to enable fp16 in armv8a multi_isa builds
by Pablo Marquez Tello
· 10 months ago
1434155
Change heuristics for FP16 Deconv
by Sangwon Ha
· 10 months ago
68b6dce
Pool2d changes to enable fp16 in armv8a multi_isa builds
by Pablo Marquez Tello
· 10 months ago
a23b468
Optimize CLTranspose operator
by Jakub Sujak
· 10 months ago
a04ae3e
Port DepthwiseConv2d operator to Ckw
by ramy.elgammal@arm.com
· 1 year ago
745153b
NEDeconvolutionLayer validation fix
by Pablo Marquez Tello
· 10 months ago
0a99c79
Fix nightly NEON Reverse reference failure
by Adnan AlSinan
· 10 months ago
c2a51bd
Optimize CL and Neon Winograd tests
by Gunes Bayir
· 10 months ago
a396da1
Implement Quantized Matmul T/T and T/Nt kernels using MMUL extension
by Gunes Bayir
· 10 months ago
6e56bf3
Revise clang-format configuration
by Jakub Sujak
· 11 months ago
2ad0a6b
Implement Quantized Matmul Nt/T kernel using MMUL extension
by Gunes Bayir
· 10 months ago
ef9da00
Reimplement erf function
by Viet-Hoa Do
· 10 months ago
Next »