Emergent Ability
Emergent abilities in large language models (LLMs) refer to the sudden appearance of unexpected capabilities in larger models that are absent in smaller ones, defying simple extrapolations of performance. Current research focuses on understanding the underlying mechanisms driving this phenomenon, investigating factors like model size, training data, and pre-training loss, often using transformer-based architectures. This research is crucial for improving LLMs and for developing a deeper understanding of how complex capabilities arise in artificial systems, with implications for both AI safety and the development of more powerful and reliable AI tools for scientific research and other applications.
Papers
January 10, 2025
January 3, 2025
January 2, 2025
December 10, 2024
November 25, 2024
November 19, 2024
November 18, 2024
November 5, 2024
October 2, 2024
September 17, 2024
August 22, 2024
July 14, 2024
July 1, 2024
April 2, 2024
March 23, 2024
February 23, 2024
January 4, 2024
December 15, 2023
October 20, 2023