Multi Task Model
Multi-task models aim to train a single model capable of performing multiple tasks simultaneously, improving efficiency and generalization compared to training separate models for each task. Current research focuses on developing effective architectures and algorithms, including transformer-based models, mixture-of-experts, and various model merging techniques like task arithmetic and weight averaging, to address challenges such as catastrophic forgetting and representation bias. This field is significant because it offers improved resource utilization and enhanced performance across diverse applications, ranging from medical image analysis and natural language processing to robotics and recommender systems.
Papers
January 17, 2024
January 12, 2024
December 11, 2023
November 30, 2023
November 26, 2023
November 16, 2023
November 2, 2023
October 23, 2023
October 7, 2023
October 3, 2023
August 23, 2023
August 22, 2023
August 11, 2023
July 29, 2023
July 18, 2023
June 29, 2023
May 25, 2023
May 20, 2023