Multi Task Model
Multi-task models aim to train a single model capable of performing multiple tasks simultaneously, improving efficiency and generalization compared to training separate models for each task. Current research focuses on developing effective architectures and algorithms, including transformer-based models, mixture-of-experts, and various model merging techniques like task arithmetic and weight averaging, to address challenges such as catastrophic forgetting and representation bias. This field is significant because it offers improved resource utilization and enhanced performance across diverse applications, ranging from medical image analysis and natural language processing to robotics and recommender systems.
Papers
April 28, 2023
April 17, 2023
March 14, 2023
February 21, 2023
February 17, 2023
November 29, 2022
November 7, 2022
October 20, 2022
October 13, 2022
June 17, 2022
May 25, 2022
May 6, 2022
April 27, 2022
April 17, 2022
April 16, 2022
April 9, 2022
April 3, 2022
March 28, 2022