commit | e52a3000d2c13bc1b66ca66b3d12b6b836982394 | [log] [tgz] |
---|---|---|
author | Gian Marco Iodice <gianmarco.iodice@arm.com> | Wed Apr 11 15:59:10 2018 +0100 |
committer | Anthony Barbier <anthony.barbier@arm.com> | Fri Nov 02 16:49:37 2018 +0000 |
tree | 70e8ef5ba216762604f84228805aac9bd65747b6 | |
parent | dd03870b63784abe499761da2b26b209b33f2db2 [diff] |
COMPMID-1026 - Add support for 4x4 output tile in CLWinogradConvolutionLayer The performance achieved can be found at the following confluence page: https://confluence.arm.com/display/MLENG/GEMM-based+convolution+vs+Winograd-based+convolution+on+OpenCL Change-Id: I4b690cfdd4eb4ff0cd17b14fdd49ccaa1d1dc85c Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/127729 Tested-by: Jenkins <bsgcomp@arm.com> Reviewed-by: Georgios Pinitas <georgios.pinitas@arm.com>