Decoder Only LLM
Decoder-only large language models (LLMs) are a rapidly evolving area of research focusing on improving the efficiency and capabilities of LLMs by eliminating the encoder component. Current research emphasizes enhancing context length through techniques like parallel decoding and efficient memory management, as well as mitigating issues like hallucinations and improving performance on tasks such as machine translation and question answering. These advancements are significant because they offer potential for more efficient and effective LLMs across diverse applications, including speech processing, computer vision, and code generation, while also pushing the boundaries of fundamental LLM architecture and training methodologies.
Papers
February 8, 2024
December 31, 2023
December 19, 2023
November 5, 2023
October 24, 2023
October 17, 2023
August 18, 2023
May 18, 2023
October 3, 2022
September 17, 2022
July 16, 2022