Nearest Neighbor Language Model

Nearest neighbor language models (NNLMs) enhance traditional language models by incorporating a large external memory, retrieving similar past examples to aid in predicting the next word. Current research focuses on understanding the limitations of NNLMs, particularly their struggles with reasoning tasks despite strong memory capabilities, and on improving their adaptability to new domains and stylistic control through techniques like datastore augmentation and learned rescoring of retrieved neighbors. This approach offers a promising avenue for improving language model performance, particularly in applications requiring access to a vast knowledge base or stylistic control, but further investigation is needed to fully realize its potential.

Papers

August 21, 2024

Great Memory, Shallow Reasoning: Limits of $k$NN-LMs
Shangyi Geng, Wenting Zhao, Alexander M Rush
Continuum Limit Personal Memory Next Word Prediction NLP Benchmark Nearest Neighbor Language Model

July 18, 2024

Nearest Neighbor Future Captioning: Generating Descriptions for Possible Collisions in Object Placement Tasks
Takumi Komatsu, Motonari Kambara, Shumpei Hatanaka, Haruka Matsuo, Tsubasa Hirakawa, Takayoshi Yamashita, Hironobu Fujiyoshi, Komei Sugiura
Attention Module Description Library Collision Scenario Home Robot Nearest Neighbor Language Model Object Placement Task

November 1, 2023

Style Locality for Controllable Generation with kNN Language Models
Gilles Nawezi, Lucie Flek, Charles Welch
Language Model Nearest Neighbor Controllable Generation Locality Sensitive Recent Language Model Word Prediction Nearest Neighbor Language Model

January 7, 2023

Why do Nearest Neighbor Language Models Work?
Frank F. Xu, Uri Alon, Graham Neubig
Language Model Retrieval Augmented Language Model Nearest Neighbor Language Model

November 15, 2022

Adaptation Approaches for Nearest Neighbor Language Models
Rishabh Bhardwaj, George Polovets, Monica Sunkara
Tuned Lm Nearest Neighbor Language Model Parameter Adaptation

October 27, 2022

Nearest Neighbor Language Models for Stylistic Controllable Generation
Severino Trotta, Lucie Flek, Charles Welch
Large Corpus Style Representation Style Generation Context Encoding Generation Capability Nearest Neighbor Language Model