Emergent Ability
Emergent abilities in large language models (LLMs) refer to the sudden appearance of unexpected capabilities in larger models that are absent in smaller ones, defying simple extrapolations of performance. Current research focuses on understanding the underlying mechanisms driving this phenomenon, investigating factors like model size, training data, and pre-training loss, often using transformer-based architectures. This research is crucial for improving LLMs and for developing a deeper understanding of how complex capabilities arise in artificial systems, with implications for both AI safety and the development of more powerful and reliable AI tools for scientific research and other applications.
Papers
November 5, 2024
October 2, 2024
September 17, 2024
August 22, 2024
July 14, 2024
July 1, 2024
April 2, 2024
March 23, 2024
February 23, 2024
January 4, 2024
December 15, 2023
October 20, 2023
October 5, 2023
October 2, 2023
September 25, 2023
September 4, 2023
August 9, 2023
July 16, 2023
May 24, 2023
May 1, 2023