Transformer Based Language Model
Transformer-based language models are deep learning architectures designed to process and generate human language, aiming to understand and replicate the nuances of natural language understanding and generation. Current research focuses on improving model interpretability, addressing contextualization errors, and exploring the internal mechanisms responsible for tasks like reasoning and factual recall, often using models like BERT and GPT variants. These advancements are significant for both the scientific community, furthering our understanding of neural networks and language processing, and for practical applications, enabling improvements in machine translation, question answering, and other NLP tasks.
Papers
February 26, 2024
February 23, 2024
February 22, 2024
February 21, 2024
February 20, 2024
February 3, 2024
January 30, 2024
January 25, 2024
January 22, 2024
January 16, 2024
January 15, 2024
January 8, 2024
December 16, 2023
December 5, 2023
November 22, 2023
November 1, 2023
October 25, 2023