Update release version and change log documentation

Resolves: COMPMID-5565

Change-Id: I9dca679f57f6c3cc9489669b80a5da2aba500d34
Signed-off-by: Jakub Sujak <jakub.sujak@arm.com>
Reviewed-on: https://review.mlplatform.org/c/ml/ComputeLibrary/+/9122
Tested-by: Arm Jenkins <bsgcomp@arm.com>
Reviewed-by: SiCong Li <sicong.li@arm.com>
Comments-Addressed: Arm Jenkins <bsgcomp@arm.com>
Benchmark: Arm Jenkins <bsgcomp@arm.com>
diff --git a/docs/user_guide/release_version_and_change_log.dox b/docs/user_guide/release_version_and_change_log.dox
index d658b53..8bb2a3f 100644
--- a/docs/user_guide/release_version_and_change_log.dox
+++ b/docs/user_guide/release_version_and_change_log.dox
@@ -46,11 +46,8 @@
    - Add the following operators to the experimental dynamic fusion API:
      - GpuAdd, GpuCast, GpuClamp, GpuDepthwiseConv2d, GpuMul, GpuOutput, GpuPool2d, GpuReshape, GpuResize, GpuSoftmax, GpuSub.
    - Add SME/SME2 kernels for GeMM, Winograd convolution, Depthwise convolution and Pooling.
+   - Add new CPU operator AddMulAdd for float and quantized types.
    - Add new flag @ref ITensorInfo::lock_paddings() to tensors to prevent extending tensor paddings.
-   - Add new OpenCL kernel to compute indirect convolution:
-     - \link opencl::kernels::ClIndirectConv2dKernel ClIndirectConv2dKernel \endlink
-   - Add new OpenCL kernel to compute transposed convolution:
-     - \link opencl::kernels::ClTransposedConvolutionKernel ClTransposedConvolutionKernel \endlink
    - Add experimental support for CPU only Bazel and CMake builds.
  - Performance optimizations:
    - Optimize CPU base-e exponential functions for FP32.
@@ -58,6 +55,11 @@
    - Optimize CPU quantized Subtraction by reusing the quantized Addition kernel.
    - Optimize CPU ReduceMean by removing quantization steps and performing the operation in integer domain.
    - Optimize GPU Scale and Dynamic Fusion GpuResize by removing quantization steps and performing the operation in integer domain.
+   - Update the heuristic for CLDepthwiseConvolutionNative kernel.
+   - Add new optimized OpenCL kernel to compute indirect convolution:
+     - \link opencl::kernels::ClIndirectConv2dKernel ClIndirectConv2dKernel \endlink
+   - Add new optimized OpenCL kernel to compute transposed convolution:
+     - \link opencl::kernels::ClTransposedConvolutionKernel ClTransposedConvolutionKernel \endlink
  - Update recommended/minimum NDK version to r20b.
  - Various optimizations and bug fixes.