Latency Optimization
Latency optimization focuses on minimizing the delay in various computational processes, particularly in resource-constrained environments like edge computing and robotics. Current research emphasizes efficient algorithms and architectures, including neuromorphic computing, optimized deep neural network serving strategies (like dynamic instance allocation), and tailored solutions for specific applications such as federated learning and keyword spotting. These advancements are crucial for improving the responsiveness and energy efficiency of numerous systems, ranging from real-time robotic control to large-scale machine learning deployments.
Papers
November 14, 2024
November 1, 2024
April 28, 2024
March 13, 2024
November 30, 2023
September 26, 2023
July 6, 2023
March 15, 2023
June 15, 2022
March 24, 2022
March 18, 2022