Modern Language Model

Modern language models (LLMs) are large neural networks trained on massive text datasets to generate human-like text and perform various language tasks. Current research focuses on improving their efficiency (e.g., through MixAttention architectures), reliability (e.g., via improved hallucination detection and knowledge editing), and understanding their learning mechanisms (e.g., exploring the role of in-context learning and the relationship between attention and Markov models). These advancements are significant because LLMs are transforming fields like natural language processing, impacting applications ranging from improved search engines and chatbots to aiding scientific research and clinical practice.

Papers

August 25, 2023

Leveraging Knowledge and Reinforcement Learning for Enhanced Reliability of Language Models
Nancy Tyagi, Surjodeep Sarkar, Manas Gaur
Language Model Reinforcement Learning Natural Language Processing Language Understanding Knowledge Based Knowledge Graph Embeddings Modern Language Model Knowledge Guided

August 6, 2023

Why Linguistics Will Thrive in the 21st Century: A Reply to Piantadosi (2023)
Jordan Kodner, Sarah Payne, Jeffrey Heinz
Last Decade Cognitive Learning Modern Language Model User Response Linguistic Study Generative Grammar

July 24, 2023

Evaluating the Ripple Effects of Knowledge Editing in Language Models
Roi Cohen, Eden Biran, Ori Yoran, Amir Globerson, Mor Geva
Language Model Knowledge Editing Modern Language Model Factual Knowledge Edited Knowledge Butterfly Effect Editing Method

June 15, 2023

Propagating Knowledge Updates to LMs Through Distillation
Shankar Padmanabhan, Yasumasa Onoe, Michael J. Q. Zhang, Greg Durrett, Eunsol Choi
Language Model Mutual Distillation Entity Mention Knowledge Editing Modern Language Model Tuned Lm Distillation Framework Knowledge Enhancement

May 30, 2023

What and How does In-Context Learning Learn? Bayesian Model Averaging, Parameterization, and Generalization
Yufeng Zhang, Fengzhuo Zhang, Zhuoran Yang, Zhaoran Wang
Large Language Model Strong Generalization Context Learning Latent Variable Model Modern Language Model Bayesian Model Averaging

May 23, 2023

NarrativeXL: A Large-scale Dataset For Long-Term Memory Models
Arseny Moskvichev, Ky-Vinh Mai
Language Model Long Short Term Memory Large Scale Dataset Context Length Modern Language Model Scene Recognition Comprehension Datasets

May 22, 2023

LM vs LM: Detecting Factual Errors via Cross Examination
Roi Cohen, May Hamri, Mor Geva, Amir Globerson
Language Model Factual Claim Modern Language Model Factual Error Oral Argument

March 14, 2023

The Learnability of In-Context Learning
Noam Wies, Yoav Levine, Amnon Shashua
Context Learning Modern Language Model

March 8, 2023

Stealing the Decoding Algorithms of Language Models
Ali Naseh, Kalpesh Krishna, Mohit Iyyer, Amir Houmansadr
Language Model Practical Algorithm Language Generation GPT Neo Related Hyperparameters Modern Language Model Decoding Method

January 11, 2023

The Role of Interactive Visualization in Explaining (Large) NLP Models: from Data to Inference
Richard Brath, Daniel Keim, Johannes Knittel, Shimei Pan, Pia Sommerauer, Hendrik Strobelt
Raw Data Line by Line Explanation Scientific Inference NLP Model Interactive Visualization Modern Language Model NLP Pipeline

December 30, 2022

MAUVE Scores for Generative Models: Theory and Practice
Krishna Pillutla, Lang Liu, John Thickstun, Sean Welleck, Swabha Swayamdipta, Rowan Zellers, Sewoong Oh, Yejin Choi, Zaid Harchaoui
Generative Model Generative Modeling Theoretical Understanding Generative Question Practice Mode Performance Score Modern Language Model Photorealistic Image Image Modality

December 19, 2022

LENS: A Learnable Evaluation Metric for Text Simplification
Mounica Maddela, Yao Dou, David Heineman, Wei Xu
Machine Translation Large Corpus Text Simplification Modern Language Model Lesion Detection Automatic Evaluation Metric

October 10, 2022

Montague semantics and modifier consistency measurement in neural language models
Danilo S. Carvalho, Edoardo Manino, Julia Rozanova, Lucas Cordeiro, André Freitas
Language Model Neural Language Model Language Representation Modern Language Model Distributional Model Analytical Semantics Consistency Metric

October 7, 2022

Are Representations Built from the Ground Up? An Empirical Examination of Local Composition in Language Models
Emmy Liu, Graham Neubig
Language Model Empirical Study Meaningful Representation Internal Representation Modern Language Model Compositional Representation Solid Ground Compositional Semantics Module Composition

August 30, 2022

Do language models make human-like predictions about the coreferents of Italian anaphoric zero pronouns?
James A. Michaelov, Benjamin K. Bergen
Language Model Language Understanding Neural Language Model Modern Language Model Harm Free Pronoun Use

July 21, 2022

The Birth of Bias: A case study on the evolution of gender bias in an English language model
Oskar van der Wal, Jaap Jumelet, Katrin Schulz, Willem Zuidema
Language Model Case Study Absolute Stance Bias Gender Bias Specie Evolution Modern Language Model Larger Language Model Input Embeddings

July 19, 2022

Analyzing Bagging Methods for Language Models
Pranab Islam, Shaan Khosla, Arthur Lok, Mudit Saxena
Language Model Modern Language Model Language Understanding Task Evolutionary Bagging Single Language

May 19, 2022

RankGen: Improving Text Generation with Large Ranking Models
Kalpesh Krishna, Yapei Chang, John Wieting, Mohit Iyyer
Large Language Model Language Model Text Generation Modern Language Model Model Generated Large Ranking Model

May 3, 2022

Mixed-effects transformers for hierarchical adaptation
Julia White, Noah Goodman, Robert Hawkins
Modern Language Model Mixed Effect Model Frequency Aware Transformer Domain Adaptation Benchmark Soft Prefix Level Adaptation

April 9, 2022

KOBEST: Korean Balanced Evaluation of Significant Tasks
Dohyeong Kim, Myeongjun Jang, Deuk Sin Kwon, Eric Davis
Natural Language Processing New Task Modern Language Model Historical Linguistics Korean Language