Edge AI
Edge AI focuses on performing artificial intelligence computations directly on resource-constrained devices at the network's edge, minimizing latency and bandwidth needs while enhancing privacy. Current research emphasizes developing energy-efficient model architectures (like lightweight CNNs and Transformers), efficient model compression techniques (pruning, quantization), and hardware acceleration (using TPUs, NPUs, FPGAs, and specialized ASICs) to enable real-time inference on edge devices. This field is crucial for applications ranging from autonomous systems and industrial monitoring to healthcare and smart homes, driving advancements in both hardware and software for efficient and privacy-preserving AI deployment.
Papers
January 10, 2025
January 4, 2025
December 23, 2024
December 13, 2024
November 20, 2024
November 19, 2024
November 1, 2024
October 30, 2024
October 28, 2024
October 15, 2024
October 14, 2024
October 7, 2024
October 5, 2024
September 23, 2024
September 2, 2024
August 8, 2024
July 31, 2024
July 22, 2024
July 18, 2024
July 11, 2024