Gitiles
Code Review
Sign In
review.mlplatform.org
/
ml
/
ComputeLibrary
/
4c3f716371da92977d4a998fe5c89dee04ddbced
4c3f716
Improve CPU extension detection on macos
by Viet-Hoa Do
· 7 weeks ago
05269f0
ScatterND fix for scalar cases
by Gunes Bayir
· 8 weeks ago
48f120c
Make quantization rounding consistent
by Jonathan Deakin
· 8 weeks ago
c1575b2
Add SME2 implementation of Softmax for QASYMM8 and QASYMM8_SIGNED.
by Omar Al Khatib
· 10 weeks ago
2fea135
Add batched indices support to Scatter GPU Implementation
by Mohammed Suhail Munshi
· 9 weeks ago
c22e126
arm_gemm: fix SVE check on fast mode kernels.
by David Mansell
· 8 weeks ago
0c5ba9e
Change reorder implementation to be vector length agnostic for OHWIo8 reorder
by Radu Salavat
· 3 months ago
5c76742
New SME2 heuristics.
by David Mansell
· 4 months ago
301e33f
Add fp16 and integer data type support for ScatterNd in Gpu
by Gunes Bayir
· 9 weeks ago
e5ef8c1
Disable SME2 Gemmlowp s8f32 kernel selection in case results needs to be accumulated
by Gunes Bayir
· 9 weeks ago
499b5bc
Disable SME2 Gemm kernel selection in case results needs to be accumulated
by Gunes Bayir
· 9 weeks ago
ada3200
Add update/index/output (m+1)/2d/(m+n) support for CLScatter
by Gunes Bayir
· 10 weeks ago
62d600f
Move s32 to f32 conversion in reference layers from quantization to dequantization
by Radu Salavat
· 3 months ago
2481e95
Add memory stress tests for per channel quantized convolution
by Gunes Bayir
· 10 weeks ago
0fa28be
Add padding to the shift and multipliers buffers
by Pablo Marquez Tello
· 10 weeks ago
575c5f1
Fix compiler error in the validation tests
by Pablo Marquez Tello
· 10 weeks ago
0e21236
Multi-Dimensional and Batched Scatter Reference and Dataset Implementation.
by Mohammed Suhail Munshi
· 3 months ago
7377107
Scatter GPU Kernel Implementation for 1D tensors.
by Mohammed Suhail Munshi
· 3 months ago
5057ce9
Update documentation for 24.04 release
by Michael Kozlov
· 2 months ago
83ca105
Fix v7 test failure when core matmul result is dequantized into fp32
by Gunes Bayir
· 3 months ago
6ac82a4
fix compilation errors on linux with gcc12
by Sunita Nadampalli
· 3 months ago
a668f9f
Add s8f32 kernels and dynamic QuantizationInfo
by Jonathan Deakin
· 5 months ago
34bdffb
Add guarding for accumulation validation test in aarch32
by Radu Salavat
· 3 months ago
64f2300
Runtime checks for bf16 fixed format tests
by David Svantesson-Yeung
· 3 months ago
cdce25b
Accumulation in Cpu Gemm kernels is not supported for quantized kernels in aarch32. This patch guards the relevant tests.
by Radu Salavat
· 3 months ago
cfca87b
Add SME2 implementation of softmax for FP16
by Gunes Bayir
· 3 months ago
f1f1f87
Add in place summation to CPU GEMM kernels
by Radu Salavat
· 4 months ago
1322065
Specify absolute tolerance
by Sangwon Ha
· 3 months ago
553e241
Fix compiler error
by Pablo Marquez Tello
· 3 months ago
1e91d71
Parallelise im2col along dimensions with higher number of iterations
by Milos Puzovic
· 3 months ago
77bbe2e
Add SME2 implementation of softmax for FP32
by Viet-Hoa Do
· 7 months ago
905786e
Added new NEON fixed format fast math mode hybrid kernel with maximum height of 6 for accumulation and updated heuristics
by Milos Puzovic
· 3 months ago
37d8445
Fix graph examples for WoA
by Pablo Marquez Tello
· 3 months ago
473b829
Adds Tests and reference implementation for scatter operator with 1D tensors.
by Mohammed Suhail Munshi
· 3 months ago
4908981
[ONCPUML-1451] Guard bf16 to bf16 tests with ARM_COMPUTE_ENABLE_FIXED_FORMAT_KERNELS
by Renato Arantes
· 3 months ago
7b3adf2
Fix for nightly build failures for android
by Mohammed Suhail Munshi
· 3 months ago
6a82787
Workaround to enable cross-compiling from macOS® to Android™
by Jakub Sujak
· 3 months ago
8609ca0
Add skeleton for CLScatter op, reference and tests
by Mohammed Suhail Munshi
· 4 months ago
36a75da
[ONCPUML-1451] Add matmul kernel to enable bf16 to bf16 operations via PyTorch® autocast() function
by Renato Arantes
· 5 months ago
d219115
Make Cpu/Gpu/Ref scalar/vectoral S32 division consistent
by Gunes Bayir
· 3 months ago
1618e95
Increase tolerance_num of Cpu RNNLayer tests
by Gunes Bayir
· 3 months ago
43ba0dd
Increase MatMul and DilatedConv test Q8 thresholds to 1
by Gunes Bayir
· 3 months ago
c00a82b
Fix overflow in NEMeanStdDevNormalizationKernel
by Pablo Marquez Tello
· 3 months ago
3e4b193
Fix quant. gemv kernel driver by adding set_quantized_bias()
by Gunes Bayir
· 4 months ago
5a67733
arm_gemm: Fix bias handling for sme2 FP16 GEMV.
by David Mansell
· 4 months ago
3ac0b87
Fix validation in pool2d assembly wrapper
by Pablo Marquez Tello
· 4 months ago
93e743f
Optimize CpuSoftmaxKernel for axis != 0 and neon kernels
by Omar Al Khatib
· 6 months ago
d0611c1
Update documentation for 24.02.1 release
by Felix Thomasmathibalan
· 4 months ago
57a8852
Fix WoA nightly failure
by Pablo Marquez Tello
· 4 months ago
9167c9c
Prefer indirect Gemm vs. Direct convolution if supported
by Gunes Bayir
· 4 months ago
e77736f
Set int8 test tolerance in FullyConnected to int8
by Gunes Bayir
· 4 months ago
40af090
Disable FP16 on 32 bit
by Pablo Marquez Tello
· 4 months ago
bf05373
Fix performance regression in fixed-format kernels
by Gunes Bayir
· 4 months ago
6fe9eaf
Set Neon™ as present for WoA
by Pablo Marquez Tello
· 4 months ago
2676424
Fix segfault in DWC in WoA
by Pablo Marquez Tello
· 4 months ago
c1787f0
Fix OpenBSD® build failure caused by patch 11144
by Gunes Bayir
· 4 months ago
ef63739
Integrate new pretranspose_b_array with extra fused transpose of B
by Gunes Bayir
· 5 months ago
0a48c4c
Requantization cases for offset changes only
by Mohammed Suhail Munshi
· 5 months ago
9469058
Fix linker errors in validation suite for WoA
by Pablo Marquez Tello
· 5 months ago
8528134
Fix validation suite on WoA
by Pablo Marquez Tello
· 5 months ago
e37a863
Fix escape character issues in format_code script
by Gunes Bayir
· 5 months ago
7976f08
Fix compiler errors in cl-clang
by Pablo Marquez Tello
· 5 months ago
0cba93f
[QTest] Use dynamic output quantization in Depthwise Conv tests
by Omar Al Khatib
· 5 months ago
8614077
Disable some DirectConv2d tests in Dynamic Fusion
by Gunes Bayir
· 5 months ago
d98e27e
Update documentation for 24.02 release
by Felix Thomasmathibalan
· 5 months ago
0c85334
Fix parallel depthwise perf regression from 2db938c
by Jonathan Deakin
· 5 months ago
0e73498
Add support for QSYMM8 in ClCastKernel
by Pablo Marquez Tello
· 5 months ago
0ee13af
Remove CKW prototype and Template Writer
by Gunes Bayir
· 5 months ago
a3e1b50
Fix the bug in GpuTanh operator in dynamic fusion
by Gunes Bayir
· 5 months ago
a5a81ae
Mark GpuSoftmax and GpuReshape as not supported
by Gunes Bayir
· 5 months ago
2db938c
Parallelize CPU depthwise over batch if only 1 row
by Jonathan Deakin
· 5 months ago
e695579
arm_gemm: SME: Remove artificial single-thread constraint on quantized int8 kernels.
by David Mansell
· 6 months ago
1561d40
Build CKW by default
by Gunes Bayir
· 5 months ago
9d8f4ed
Fix compilation issue in CKW due to unused variable
by Gunes Bayir
· 5 months ago
9e987b1
Fix path
by Jakub Sujak
· 5 months ago
8050d22
Disable FP16 tests compilation on Multi-Isa v8a
by Mohammed Suhail Munshi
· 5 months ago
0c17c4b
Fix leftover cols in CpuGemmLowpMatrixBReductionKernel
by Jonathan Deakin
· 6 months ago
9b72a6c
Add scripts to generate Doxygen documentation
by Jakub Sujak
· 7 months ago
2b9fa59
Use the stable CKW API in the GPU dynamic fusion backend
by Gunes Bayir
· 6 months ago
7ab7fca
Fix logic in SConscript
by Jakub Sujak
· 5 months ago
b5d6082
Add build options for Address and UndefinedBehavior sanitizers
by Jakub Sujak
· 5 months ago
ec89b91
Fix multi_isa build for arch=arm64-v8a
by Pablo Marquez Tello
· 5 months ago
fb92e22
arm_gemm: convolution: optimize convolver.hpp.
by David Mansell
· 7 months ago
2aec5f1
Fix tolerance issue in BF16 MatMul tests
by Gunes Bayir
· 5 months ago
277def4
Fix Debug mode in CMake
by Jonathan Deakin
· 6 months ago
bde6e78
Fix for Logically dead code detected in Coverity checks
by Anitha Raj
· 5 months ago
e8e016e
Fix for unchecked return value detected in Coverity checks.
by Anitha Raj
· 5 months ago
fdf56fb
Make GpuWorkloadContext own all tensor info objects
by Viet-Hoa Do
· 5 months ago
e812c0c
Don't build CKW as part of Android.bp
by Jakub Sujak
· 5 months ago
3a704ae
Update Documentation for 24.01 release
by Felix Thomasmathibalan
· 5 months ago
c7f550d
Improved documentation
by Pablo Marquez Tello
· 6 months ago
6829e02
Fix divide-by-zero compilation error
by Viet-Hoa Do
· 6 months ago
8896cf7
Fix minor issue, clean lut code
by Mohammed Suhail Munshi
· 6 months ago
27dee1e
Fix potential threading issue in LUTManager
by Mohammed Suhail Munshi
· 6 months ago
0eb9cfb
[ONCPUML-1387] Add ACL based reorder for f32 to bf16 data type conversion.
by Renato Arantes
· 7 months ago
c5df0c6
Fix test compilation error on GCC 13.2
by Jakub Sujak
· 6 months ago
5d7a93a
Fix compilation error on GCC 13.2
by Jakub Sujak
· 6 months ago
7467ba8
Use look up table for fp16 activation
by Mohammed Suhail Munshi
· 7 months ago
7fe7791
Prevent RELU from being processed thru LUT in INT8
by Sangwon Ha
· 6 months ago
11ab451
Implement dynamic quantization for GEMMLowp tests
by SiCong Li
· 8 months ago
Next »