Latency Optimization

Latency optimization focuses on minimizing the delay in various computational processes, particularly in resource-constrained environments like edge computing and robotics. Current research emphasizes efficient algorithms and architectures, including neuromorphic computing, optimized deep neural network serving strategies (like dynamic instance allocation), and tailored solutions for specific applications such as federated learning and keyword spotting. These advancements are crucial for improving the responsiveness and energy efficiency of numerous systems, ranging from real-time robotic control to large-scale machine learning deployments.

Papers