Open Large Language Model
Open large language models (LLMs) are powerful AI systems trained on massive datasets to perform various natural language tasks, with a current research focus on improving their capabilities and accessibility. Active research areas include enhancing few-shot learning for specialized domains like drug discovery and translation, optimizing model architectures for efficiency and compression (e.g., through quantization), and developing methods to mitigate issues like semantic errors in generated text. The open-source nature of many LLMs fosters collaboration and accelerates progress, with significant implications for diverse fields ranging from healthcare (e.g., clinical note generation) to software engineering (e.g., code generation).
Papers
November 2, 2024
October 16, 2024
June 27, 2024
June 13, 2024
May 7, 2024
February 20, 2024
February 15, 2024
January 18, 2024
January 11, 2024
October 13, 2023
May 19, 2023