2024
DOI: 10.1109/jiot.2024.3395335
|View full text |Cite
|
Sign up to set email alerts
|

Optimizing Convolutions for Deep Learning Inference on ARM Cortex-M Processors

Antonio Maciá-Lillo,
Sergio Barrachina,
Germán Fabregat
et al.

Abstract: We perform a series of optimisations on the convolution operator within the ARM CMSIS-NN library to improve the performance of deep learning tasks on Arduino development boards equipped with ARM Cortex-M4 and M7 microcontrollers. To this end, we develop custom microkernels that efficiently handle the internal computations required by the convolution operator via the lowering approach and the direct method, and we design two techniques to avoid register spilling. We also take advantage of all the RAM on the Ard… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
references
References 24 publications
0
0
0
Order By: Relevance