Field Programmable Gate Array
Field-Programmable Gate Arrays (FPGAs) are reconfigurable hardware devices increasingly used to accelerate machine learning (ML) inference, particularly for resource-constrained applications like edge computing. Current research focuses on optimizing various ML model architectures, including transformers, convolutional neural networks (CNNs), and recurrent neural networks (RNNs), for efficient deployment on FPGAs, often employing techniques like quantization and model compression to reduce resource usage and latency. This work is significant because it enables the deployment of powerful ML models on low-power, embedded devices, impacting diverse fields from real-time image processing in robotics and scientific instrumentation to high-throughput data analysis in high-energy physics.
Papers
All-to-all reconfigurability with sparse and higher-order Ising machines
Srijan Nikhar, Sidharth Kannan, Navid Anjum Aadit, Shuvro Chowdhury, Kerem Y. Camsari
Harnessing FPGA Technology for Enhanced Biomedical Computation
Nisanur Alici, Kayode Inadagbo, Murat Isik
Shedding the Bits: Pushing the Boundaries of Quantization with Minifloats on FPGAs
Shivam Aggarwal, Hans Jakob Damsgaard, Alessandro Pappalardo, Giuseppe Franco, Thomas B. Preußer, Michaela Blott, Tulika Mitra