Computational Efficiency
Computational efficiency in machine learning focuses on minimizing the computational resources (time and energy) required for training and deploying models while maintaining accuracy. Current research emphasizes developing novel algorithms and architectures, such as lightweight convolutional networks, efficient attention mechanisms (e.g., entropy-based clustering), and optimized numerical systems (e.g., redundant residue number systems), to reduce computational complexity. These advancements are crucial for deploying machine learning models on resource-constrained devices and for scaling up the training of increasingly complex models, impacting fields ranging from medical diagnosis to robotics.
Papers
Enhancing Computation Efficiency in Large Language Models through Weight and Activation Quantization
Jangwhan Lee, Minsoo Kim, Seungcheol Baek, Seok Joong Hwang, Wonyong Sung, Jungwook Choi
Improving Computational Efficiency for Powered Descent Guidance via Transformer-based Tight Constraint Prediction
Julia Briden, Trey Gurga, Breanna Johnson, Abhishek Cauligi, Richard Linares