Causal Language
Causal language modeling focuses on predicting the next word in a sequence, forming the basis for many large language models (LLMs). Current research emphasizes improving efficiency and knowledge acquisition in these models, exploring techniques like retrieval-based methods, attention mechanism modifications (e.g., masked mixers), and data augmentation strategies to enhance performance and address limitations such as the "reversal curse" and order sensitivity. This field is significant because advancements in causal language modeling directly impact the capabilities of LLMs across diverse applications, from text generation and translation to question answering and specialized domain expertise.
Papers
May 15, 2024
April 2, 2024
March 30, 2024
March 21, 2024
March 1, 2024
February 23, 2024
February 22, 2024
February 19, 2024
January 22, 2024
December 23, 2023
December 18, 2023
November 13, 2023
October 9, 2023
September 16, 2023
September 12, 2023
August 16, 2023
August 14, 2023
May 3, 2023