Model Capability
Model capability research focuses on understanding and evaluating the strengths and limitations of machine learning models, particularly large language models (LLMs), across diverse tasks. Current efforts concentrate on developing robust evaluation frameworks, including standardized benchmarks and qualitative assessments, and exploring techniques to enhance model expressivity through architectural innovations like novel attention mechanisms. This research is crucial for building trustworthy and reliable AI systems, informing responsible development and deployment across various applications, from robotics to healthcare.
Papers
November 18, 2024
November 5, 2024
October 8, 2024
October 6, 2024
September 19, 2024
September 8, 2024
September 1, 2024
June 12, 2024
May 14, 2024
April 7, 2024
February 16, 2024
November 21, 2023
October 25, 2023
October 17, 2023
September 2, 2023
August 28, 2023
August 1, 2023
June 14, 2023
May 16, 2023