Field Programmable Gate Array
Field-Programmable Gate Arrays (FPGAs) are reconfigurable hardware devices increasingly used to accelerate machine learning (ML) inference, particularly for resource-constrained applications like edge computing. Current research focuses on optimizing various ML model architectures, including transformers, convolutional neural networks (CNNs), and recurrent neural networks (RNNs), for efficient deployment on FPGAs, often employing techniques like quantization and model compression to reduce resource usage and latency. This work is significant because it enables the deployment of powerful ML models on low-power, embedded devices, impacting diverse fields from real-time image processing in robotics and scientific instrumentation to high-throughput data analysis in high-energy physics.
Papers
fpgaHART: A toolflow for throughput-oriented acceleration of 3D CNNs for HAR onto FPGAs
Petros Toupas, Christos-Savvas Bouganis, Dimitrios Tzovaras
APPRAISER: DNN Fault Resilience Analysis Employing Approximation Errors
Mahdi Taheri, Mohammad Hasan Ahmadilivani, Maksim Jenihhin, Masoud Daneshtalab, Jaan Raik
Efficient Implementation of a Multi-Layer Gradient-Free Online-Trainable Spiking Neural Network on FPGA
Ali Mehrabi, Yeshwanth Bethi, André van Schaik, Andrew Wabnitz, Saeed Afshar