Edge Device

Edge devices are resource-constrained computing units performing computation closer to data sources, aiming to reduce latency, bandwidth usage, and privacy concerns associated with cloud computing. Current research focuses on optimizing deep learning models (e.g., CNNs, LLMs, GNNs) for edge deployment through techniques like model compression (quantization, pruning, knowledge distillation), efficient parallel processing (pipeline parallelism, tensor parallelism), and federated learning. This work is significant for enabling the deployment of sophisticated AI applications, such as autonomous driving and medical imaging analysis, on low-power devices, thereby expanding the accessibility and applicability of advanced technologies.

132papers

Papers

May 5, 2025

EntroLLM: Entropy Encoded Weight Compression for Efficient Large Language Model Inference on Edge Devices
Entropy Coding Edge Device Language Model Inference Mixed Quantization Post Training Quantization

May 2, 2025

On-demand Test-time Adaptation for Edge Devices
Continual Test Time Adaptation Edge Device Memory Efficient Adaptation Domain Shift Test Time Adaptation

April 29, 2025

CarbonCall: Sustainability-Aware Function Calling for Large Language Models on Edge Devices
Resource Scaling Edge Device Edge AI Large Language Simple Function Full Model

April 28, 2025

xEdgeFace: Efficient Cross-Spectral Face Recognition for Edge Devices
Edge Device Hybrid CNN Transformer Recognition Task Face Recognition Heterogeneous Face Recognition Benchmark

April 24, 2025

Disaggregated Deep Learning via In-Physics Computing at Radio Frequency
Edge Device Wireless Edge Physical Computing Energy Efficient Radio Frequency

April 13, 2025

Tin-Tin: Towards Tiny Learning on Tiny Devices with Integer-based Neural Network Training
Device Training Training Network Embedded System Integer Only Training Edge Device Low Power McUs Tiny Machine Learning

April 12, 2025

Deploying Large AI Models on Resource-Limited Devices with Split Federated Learning
Large AI Model Resource Constrained Device Split Learning Split and Fit Edge Device

April 11, 2025

Jupiter: Fast and Resource-Efficient Collaborative Inference of Generative LLMs on Edge Devices
Generative LLM Collaborative Edge Speculative Decoding Edge Device Large Language Model Collaborative Inference

April 10, 2025

Token Level Routing Inference System for Edge Devices
Large Language Model Inference Efficiency Token Routing Language Model Token Generation Edge Device Device Inference

April 3, 2025

Advancing Air Quality Monitoring: TinyML-Based Real-Time Ozone Prediction with Cost-Effective Edge Devices
Sensor System Air Quality TinyML Model Edge Device Air Quality Index Human Prediction

March 31, 2025

MetaCLBench: Meta Continual Learning Benchmark on Resource-Constrained Edge Devices
Meta Continual Learning Continual LEArning Edge Device

March 27, 2025

A Low-Power Streaming Speech Enhancement Accelerator For Edge Devices
Speech Enhancement Model Compression Speech Enhancement Model Efficient Hardware Edge Device Transformer Based Network

March 26, 2025

ESSR: An 8K@30FPS Super-Resolution Accelerator With Edge Selective Network
Super Resolution Hardware Design Optimization Network Programming Edge Device

March 23, 2025

March 20, 2025

LeanTTA: A Backpropagation-Free and Stateless Approach to Quantized Test-Time Adaptation on Edge Devices
Backpropagation Free Test Time Adaptation Unlabeled Client Edge Device Back Propagation

March 18, 2025

RETHINED: A New Benchmark and Baseline for Real-Time High-Resolution Image Inpainting On Edge Devices
High Temporal Resolution New Benchmark Patch Based High Resolution Image Edge Device

March 7, 2025

Multi Agent based Medical Assistant for Edge Devices
Patient Centric Domain Specific Multi Agent Edge Device Medical Assistant

March 4, 2025

PointSplit: Towards On-device 3D Object Detection with Heterogeneous Low-power Accelerators
Phantom 2D Accelerator 3D Detector Multi Accelerator Edge Device Heterogeneous Setting

February 27, 2025

HazardNet: A Small-Scale Vision Language Model for Real-Time Traffic Safety Detection at Edge Devices
Vision Language Model Traffic Data HotPotQA Dataset Edge Device Traffic Incident Detection Hazard Boundary Safety Learning

Edge Device

Papers

EntroLLM: Entropy Encoded Weight Compression for Efficient Large Language Model Inference on Edge Devices

On-demand Test-time Adaptation for Edge Devices

CarbonCall: Sustainability-Aware Function Calling for Large Language Models on Edge Devices

xEdgeFace: Efficient Cross-Spectral Face Recognition for Edge Devices

Disaggregated Deep Learning via In-Physics Computing at Radio Frequency

Tin-Tin: Towards Tiny Learning on Tiny Devices with Integer-based Neural Network Training

Deploying Large AI Models on Resource-Limited Devices with Split Federated Learning

Jupiter: Fast and Resource-Efficient Collaborative Inference of Generative LLMs on Edge Devices

Token Level Routing Inference System for Edge Devices

Advancing Air Quality Monitoring: TinyML-Based Real-Time Ozone Prediction with Cost-Effective Edge Devices

MetaCLBench: Meta Continual Learning Benchmark on Resource-Constrained Edge Devices

A Low-Power Streaming Speech Enhancement Accelerator For Edge Devices

ESSR: An 8K@30FPS Super-Resolution Accelerator With Edge Selective Network

ShED-HD: A Shannon Entropy Distribution Framework for Lightweight Hallucination Detection on Edge Devices

Dynamic Gradient Sparse Update for Edge Training

LeanTTA: A Backpropagation-Free and Stateless Approach to Quantized Test-Time Adaptation on Edge Devices

RETHINED: A New Benchmark and Baseline for Real-Time High-Resolution Image Inpainting On Edge Devices

Multi Agent based Medical Assistant for Edge Devices

PointSplit: Towards On-device 3D Object Detection with Heterogeneous Low-power Accelerators

HazardNet: A Small-Scale Vision Language Model for Real-Time Traffic Safety Detection at Edge Devices