Adaptive Mixture
Adaptive Mixture models, a class of machine learning architectures, aim to improve efficiency and performance by selectively activating specialized sub-models (experts) based on input characteristics. Current research focuses on developing adaptive gating mechanisms to dynamically select experts, often employing low-rank adaptation (LoRA) for efficient parameterization and leveraging pre-trained dense models to accelerate training. These techniques are being applied across diverse fields, including natural language processing, computer vision, and federated learning, demonstrating improvements in accuracy, resource efficiency, and fairness while addressing challenges like data heterogeneity and model drift.
Papers
November 3, 2024
June 28, 2024
June 7, 2024
May 1, 2024
April 27, 2024
January 4, 2024
November 9, 2023
October 10, 2022
July 20, 2022