Convolution Primitive

Convolution primitives are fundamental building blocks of convolutional neural networks (CNNs), and optimizing their performance is crucial for efficient deep learning. Current research focuses on developing and benchmarking alternative algorithms to the commonly used General Matrix Multiplication (GEMM) approach, including sliding window techniques and specialized kernels tailored to specific hardware architectures like ARM Cortex-M processors. These efforts aim to reduce computational cost and memory usage, enabling faster and more energy-efficient CNN inference, particularly on resource-constrained devices. The resulting improvements have significant implications for deploying AI in embedded systems and expanding the accessibility of deep learning applications.

Papers