Edge Deployment
Edge deployment focuses on efficiently executing machine learning models, particularly deep learning models like transformers and graph neural networks, on resource-constrained devices near data sources to minimize latency and bandwidth usage. Current research emphasizes optimizing model architectures (e.g., binarization, quantization) and developing algorithms for efficient resource allocation, task offloading, and model protection against attacks. This field is crucial for advancing applications like autonomous driving, speech recognition, and personalized recommendations while addressing concerns about energy efficiency, privacy, and security in AI deployments.
Papers
October 29, 2024
October 16, 2024
September 9, 2024
July 16, 2024
May 20, 2024
May 2, 2024
April 17, 2024
March 14, 2024
March 5, 2024
December 7, 2023
November 30, 2022
November 21, 2022
October 31, 2022
October 10, 2022
April 9, 2022
March 21, 2022
February 26, 2022
December 4, 2021