Open Model
Open models, encompassing open-source large language models (LLMs) and other generative AI, aim to foster transparency, reproducibility, and accessibility in artificial intelligence research and applications. Current research focuses on improving model performance across diverse tasks (including image generation, translation, and reasoning) through techniques like instruction tuning, model merging, and advanced prompt engineering, often employing architectures such as Mixture-of-Experts. The availability of open models facilitates broader participation in AI development, enabling independent verification, ethical scrutiny, and the creation of specialized models for various domains, particularly in resource-constrained settings.
Papers
September 30, 2024
August 7, 2024
August 4, 2024
July 31, 2024
May 27, 2024
May 20, 2024
May 8, 2024
May 3, 2024
April 18, 2024
March 20, 2024
March 13, 2024
March 3, 2024
February 26, 2024
January 29, 2024
January 13, 2024
December 30, 2023
October 30, 2023
October 16, 2023
June 6, 2023