Device Vision Task

Device vision tasks focus on performing computer vision operations directly on resource-constrained devices like smartphones and IoT sensors, prioritizing efficiency and low latency. Current research emphasizes developing lightweight neural network architectures, including optimized convolutional neural networks (CNNs) and vision transformers (ViTs), often incorporating techniques like binary quantization to minimize memory footprint and computational cost. These advancements enable powerful vision capabilities on mobile devices, impacting applications ranging from mobile photography enhancement to real-time object detection and federated learning scenarios.

Papers