Blame - arm_compute/runtime/NEON/functions/NEDepthwiseConvolutionLayer.h - ml/ComputeLibrary

2019-03-11 14:03:23 +0000

[diff] [blame]

91

void prepare() override;

Michalis Spyrou

7362f0d

2017-10-18 17:58:22 +0100

[diff] [blame]

92

93

private:

Manuel Bottini

2019-09-26 17:18:26 +0100

[diff] [blame]

94

/** Static function to choose the best depthwise convolution function for @ref NEDepthwiseConvolutionLayer

Georgios Pinitas

2019-03-11 14:03:23 +0000

[diff] [blame]

95

*

Michele Di Giorgio

2020-01-07 15:06:41 +0000

[diff] [blame]

96

* @param[in] input Source tensor info. Data type supported: QASYMM8/QASYMM8_SIGNED/F16/F32

Michele Di Giorgio

df4cf57

2019-10-09 15:32:39 +0100

[diff] [blame]

97

* @param[in] weights Weights tensor info. These are 3D tensors with shape [kernel_x, kernel_y, IFM].

Michele Di Giorgio

2020-01-07 15:06:41 +0000

[diff] [blame]

98

* Data type supported: Same as @p input or QASYMM8/QASYMM8_SIGNED/QSYMM8_PER_CHANNEL when @p input is QASYMM8/QASYMM8_SIGNED.

Manuel Bottini

2019-09-26 17:18:26 +0100

[diff] [blame]

99

* @param[in] biases Biases tensor info. A 1D tensor with shape [IFM]. Must be nullptr if not needed.

Michele Di Giorgio

2020-01-07 15:06:41 +0000

[diff] [blame]

100

* Data type supported: Same as @p input, S32 when input is QASYMM8/QASYMM8_SIGNED.

Manuel Bottini

2019-09-26 17:18:26 +0100

[diff] [blame]

101

* @param[in] output Destination tensor. Data type supported: same as @p input.

102

* @param[in] conv_info Padding and stride information to use for the convolution.

103

* @param[in] depth_multiplier (Optional) Multiplier to apply to the input's depth in order to retrieve the output's depth. Defaults to 1.

Michele Di Giorgio

2020-01-07 15:06:41 +0000

[diff] [blame]

104

* @param[in] act_info (Optional) Activation layer information in case of a fused activation. Only RELU, BOUNDED_RELU and LU_BOUNDED_RELU for 3x3 quantized are supported.

Manuel Bottini

2019-09-26 17:18:26 +0100

[diff] [blame]

105

* @param[in] dilation (Optional) Dilation, in elements, across x and y. Defaults to (1, 1).

Usama Arif

881f2de

2019-04-12 10:29:17 +0100

[diff] [blame]

106

*

Manuel Bottini

2019-09-26 17:18:26 +0100

[diff] [blame]

107

* @return a Depthwise Convolution Function

Georgios Pinitas

2019-03-11 14:03:23 +0000

[diff] [blame]

108

*/

Manuel Bottini

2019-09-26 17:18:26 +0100

[diff] [blame]

109

static DepthwiseConvolutionFunction get_depthwiseconvolution_function(const ITensorInfo *input, const ITensorInfo *weights, const ITensorInfo *biases, const ITensorInfo *output,

110

const PadStrideInfo &conv_info, unsigned int depth_multiplier = 1,

111

ActivationLayerInfo act_info = ActivationLayerInfo(), const Size2D &dilation = Size2D(1U, 1U));

Georgios Pinitas

2019-03-11 14:03:23 +0000

[diff] [blame]

112

Manuel Bottini

2019-09-26 17:18:26 +0100

[diff] [blame]

113

/** Basic function to execute optimized depthwise convolution routines. This function calls the following NEON kernels:

114

*

115

* @note At the moment 3x3 and 5x5 convolution of stride 1, 2 are supported

116

*

117

* -# @ref NEFillBorderKernel (if pad_x or pad_y > 0) and no assembly kernel implementation is present

118

* -# @ref NEDepthwiseConvolutionLayer3x3Kernel if 3x3 and no assembly kernel implementation is present

119

* -# @ref NEDepthwiseConvolutionAssemblyDispatch if assembly kernel implementation is present

120

* -# @ref NEDirectConvolutionLayerOutputStageKernel if re-quantization of output is required

121

* -# @ref NEActivationLayer if fused activation is required

122

*

123

*/

124

class NEDepthwiseConvolutionLayerOptimizedInternal : public IFunction

125

{

126

public:

127

/** Default constructor */

128

NEDepthwiseConvolutionLayerOptimizedInternal(std::shared_ptr<IMemoryManager> memory_manager = nullptr);

129

/** Prevent instances of this class from being copied (As this class contains pointers) */

130

NEDepthwiseConvolutionLayerOptimizedInternal(const NEDepthwiseConvolutionLayerOptimizedInternal &) = delete;

131

/** Default move constructor */

132

NEDepthwiseConvolutionLayerOptimizedInternal(NEDepthwiseConvolutionLayerOptimizedInternal &&) = default;

133

/** Prevent instances of this class from being copied (As this class contains pointers) */

134

NEDepthwiseConvolutionLayerOptimizedInternal &operator=(const NEDepthwiseConvolutionLayerOptimizedInternal &) = delete;

135

/** Default move assignment operator */

136

NEDepthwiseConvolutionLayerOptimizedInternal &operator=(NEDepthwiseConvolutionLayerOptimizedInternal &&) = default;

Michalis Spyrou

ebcebf1

2020-10-21 00:04:14 +0100

[diff] [blame^]

137

/** Default destructor */

138

~NEDepthwiseConvolutionLayerOptimizedInternal() = default;

Manuel Bottini

2019-09-26 17:18:26 +0100

[diff] [blame]

139

/** Initialize the function's source, destination, kernels and border_size.

140

*

Michele Di Giorgio

2020-01-07 15:06:41 +0000

[diff] [blame]

141

* @param[in, out] input Source tensor. Data type supported: QASYMM8/QASYMM8_SIGNED/F16/F32. (Written to only for border filling).

Manuel Bottini

2019-09-26 17:18:26 +0100

[diff] [blame]

142

* @param[in] weights Weights tensor. These are 3D tensors with shape [kernel_x, kernel_y, IFM]. Data type supported: Same as @p input.

143

* @param[in] biases Biases tensor. A 1D tensor with shape [IFM]. Must be nullptr if not needed.

Michele Di Giorgio

2020-01-07 15:06:41 +0000

[diff] [blame]

144

* Data type supported: Same as @p input, S32 when input is QASYMM8/QASYMM8_SIGNED.

Manuel Bottini

2019-09-26 17:18:26 +0100

[diff] [blame]

145

* @param[out] output Destination tensor. Data type supported: same as @p input.

146

* @param[in] conv_info Padding and stride information to use for the convolution.

147

* @param[in] depth_multiplier (Optional) Multiplier to apply to the input's depth in order to retrieve the output's depth. Defaults to 1.

148

* @param[in] act_info (Optional) Activation layer information in case of a fused activation.

149

* @param[in] dilation (Optional) Dilation, in elements, across x and y. Defaults to (1, 1).

150

*/

151

void configure(ITensor *input, const ITensor *weights, const ITensor *biases, ITensor *output, const PadStrideInfo &conv_info,

152

unsigned int depth_multiplier = 1, const ActivationLayerInfo &act_info = ActivationLayerInfo(), const Size2D &dilation = Size2D(1U, 1U));

153

154

/** Static function to check if given info will lead to a valid configuration of @ref NEDepthwiseConvolutionLayer3x3

155

*

Michele Di Giorgio

2020-01-07 15:06:41 +0000

[diff] [blame]

156

* @param[in] input Source tensor. Data type supported: QASYMM8/QASYMM8_SIGNED/F16/F32. (Written to only for border filling).

Manuel Bottini

2019-09-26 17:18:26 +0100

[diff] [blame]

157

* @param[in] weights Weights tensor. These are 3D tensors with shape [kernel_x, kernel_y, IFM]. Data type supported: Same as @p input.

158

* @param[in] biases Biases tensor. A 1D tensor with shape [IFM]. Must be nullptr if not needed.

Michele Di Giorgio

2020-01-07 15:06:41 +0000

[diff] [blame]

159

* Data type supported: Same as @p input, S32 when input is QASYMM8/QASYMM8_SIGNED.

Manuel Bottini

2019-09-26 17:18:26 +0100

[diff] [blame]

160

* @param[in] output Destination tensor. Data type supported: same as @p input.

161

* @param[in] conv_info Padding and stride information to use for the convolution.

162

* @param[in] depth_multiplier (Optional) Multiplier to apply to the input's depth in order to retrieve the output's depth. Defaults to 1.

163

* @param[in] act_info (Optional) Activation layer information in case of a fused activation.

164

* @param[in] dilation (Optional) Dilation, in elements, across x and y. Defaults to (1, 1).

*

* @return a status

*/

static Status validate(const ITensorInfo *input, const ITensorInfo *weights, const ITensorInfo *biases, const ITensorInfo *output, const PadStrideInfo &conv_info,

169

unsigned int depth_multiplier = 1, const ActivationLayerInfo &act_info = ActivationLayerInfo(), const Size2D &dilation = Size2D(1U, 1U));

170

171

// Inherited methods overriden:

172

void run() override;

173

void prepare() override;

174

175

private:

Michalis Spyrou

ebcebf1

2020-10-21 00:04:14 +0100

[diff] [blame^]

176

MemoryGroup _memory_group;

177

NEDepthwiseConvolutionAssemblyDispatch _dwc_optimized_func;

178

NEPermute _permute_input;

179

NEPermute _permute_weights;

180

NEPermute _permute_output;

181

NEActivationLayer _activationlayer_function;

182

Tensor _accumulator;

183

Tensor _permuted_input;

184

Tensor _permuted_weights;

185

Tensor _permuted_output;

186

const ITensor *_original_weights;

bool _has_bias;

bool _is_quantized;

bool _is_nchw;

bool _permute;

bool _is_activationlayer_enabled;

192

bool _is_prepared;

Manuel Bottini

2019-09-26 17:18:26 +0100

[diff] [blame]

193

};

194

195

/** Basic function to execute a generic depthwise convolution. This function calls the following NEON kernel:

196

*

197

* -# @ref NEDepthwiseConvolutionLayerNativeKernel

198

*

199

*/

200

class NEDepthwiseConvolutionLayerGeneric : public IFunction

201

{

202

public:

203

/** Default constructor */

204

NEDepthwiseConvolutionLayerGeneric();

205

/** Prevent instances of this class from being copied (As this class contains pointers) */

206

NEDepthwiseConvolutionLayerGeneric(const NEDepthwiseConvolutionLayerGeneric &) = delete;

207

/** Default move constructor */

208

NEDepthwiseConvolutionLayerGeneric(NEDepthwiseConvolutionLayerGeneric &&) = default;

209

/** Prevent instances of this class from being copied (As this class contains pointers) */

210

NEDepthwiseConvolutionLayerGeneric &operator=(const NEDepthwiseConvolutionLayerGeneric &) = delete;

211

/** Default move assignment operator */

212

NEDepthwiseConvolutionLayerGeneric &operator=(NEDepthwiseConvolutionLayerGeneric &&) = default;

Michalis Spyrou

ebcebf1

2020-10-21 00:04:14 +0100

[diff] [blame^]

213

/** Default destructor */

214

~NEDepthwiseConvolutionLayerGeneric() = default;

Manuel Bottini

2019-09-26 17:18:26 +0100

[diff] [blame]

215

/** Initialize the function's source, destination, weights and convolution information.

216

*

Michele Di Giorgio

2020-01-07 15:06:41 +0000

[diff] [blame]

217

* @param[in, out] input Source tensor. Data type supported: QASYMM8/QASYMM8_SIGNED/F16/F32. (Written to only for border filling).

Manuel Bottini

2019-09-26 17:18:26 +0100

[diff] [blame]

218

* @param[out] output Destination tensor. Data type supported: same as @p input.

Michele Di Giorgio

df4cf57

2019-10-09 15:32:39 +0100

[diff] [blame]

219

* @param[in] weights Weights tensor. These are 3D tensors with shape [kernel_x, kernel_y, IFM].

Michele Di Giorgio

2020-01-07 15:06:41 +0000

[diff] [blame]

220

* Data type supported: Same as @p input or QASYMM8/QASYMM8_SIGNED/QSYMM8_PER_CHANNEL when @p input is QASYMM8/QASYMM8_SIGNED.

Manuel Bottini

2019-09-26 17:18:26 +0100

[diff] [blame]

221

* @param[in] biases Biases tensor. A 1D tensor with shape [IFM]. Must be nullptr if not needed.

Michele Di Giorgio

2020-01-07 15:06:41 +0000

[diff] [blame]

222

* Data type supported: Same as @p input, S32 when input is QASYMM8/QASYMM8_SIGNED.

Manuel Bottini

2019-09-26 17:18:26 +0100

[diff] [blame]

223

* @param[in] conv_info Padding and stride information to use for the convolution.

224

* @param[in] depth_multiplier (Optional) Multiplier to apply to the input's depth in order to retrieve the output's depth. Defaults to 1.

225

* @param[in] act_info (Optional) Activation layer information in case of a fused activation.

226

* @param[in] dilation (Optional) Dilation, in elements, across x and y. Defaults to (1, 1).

227

*/

228

void configure(ITensor *input, const ITensor *weights, const ITensor *biases, ITensor *output, const PadStrideInfo &conv_info,

229

unsigned int depth_multiplier = 1, const ActivationLayerInfo &act_info = ActivationLayerInfo(), const Size2D &dilation = Size2D(1U, 1U));

230

231

/** Static function to check if given info will lead to a valid configuration of @ref NEDepthwiseConvolutionLayerGeneric

232

*

Michele Di Giorgio

2020-01-07 15:06:41 +0000

[diff] [blame]

233

* @param[in] input Source tensor. Data type supported: QASYMM8/QASYMM8_SIGNED/F16/F32. (Written to only for border filling).

Manuel Bottini

2019-09-26 17:18:26 +0100

[diff] [blame]

234

* @param[in] output Destination tensor. Data type supported: same as @p input.

Michele Di Giorgio

df4cf57

2019-10-09 15:32:39 +0100

[diff] [blame]

235

* @param[in] weights Weights tensor. These are 3D tensors with shape [kernel_x, kernel_y, IFM].

Michele Di Giorgio

2020-01-07 15:06:41 +0000

[diff] [blame]

236

* Data type supported: Same as @p input or QASYMM8/QASYMM8_SIGNED/QSYMM8_PER_CHANNEL when @p input is QASYMM8/QASYMM8_SIGNED.

Manuel Bottini

2019-09-26 17:18:26 +0100

[diff] [blame]

237

* @param[in] biases Biases tensor. A 1D tensor with shape [IFM]. Must be nullptr if not needed.

Michele Di Giorgio

2020-01-07 15:06:41 +0000

[diff] [blame]

238

* Data type supported: Same as @p input, S32 when input is QASYMM8/QASYMM8_SIGNED.

Manuel Bottini