Extreme Edge
"Extreme edge" research focuses on deploying computationally intensive machine learning models and algorithms on resource-constrained edge devices, prioritizing efficiency and real-time performance without sacrificing accuracy. Current efforts concentrate on optimizing existing architectures like transformers and convolutional neural networks through techniques such as quantization, pruning, knowledge distillation, and novel attention mechanisms, alongside developing lightweight alternatives. This field is crucial for enabling AI applications in diverse areas like robotics, healthcare, and environmental monitoring, where immediate processing and limited power are critical constraints.
Papers
VARADE: a Variational-based AutoRegressive model for Anomaly Detection on the Edge
Alessio Mascolini, Sebastiano Gaiardelli, Francesco Ponzio, Nicola Dall'Ora, Enrico Macii, Sara Vinco, Santa Di Cataldo, Franco Fummi
EDGE-Rec: Efficient and Data-Guided Edge Diffusion For Recommender Systems Graphs
Utkarsh Priyam, Hemit Shah, Edoardo Botta
Privacy-Preserving Hierarchical Model-Distributed Inference
Fatemeh Jafarian Dehkordi, Yasaman Keshtkarjahromi, Hulya Seferoglu
AyE-Edge: Automated Deployment Space Search Empowering Accuracy yet Efficient Real-Time Object Detection on the Edge
Chao Wu, Yifan Gong, Liangkai Liu, Mengquan Li, Yushu Wu, Xuan Shen, Zhimin Li, Geng Yuan, Weisong Shi, Yanzhi Wang