Retrieval Augmentation

Retrieval augmentation enhances large language models (LLMs) by incorporating external knowledge sources to improve accuracy, address hallucinations, and handle long contexts. Current research focuses on optimizing retrieval methods (e.g., k-NN, dense retrieval), integrating retrieved information effectively into LLMs (e.g., through modality fusion), and developing frameworks for managing and utilizing this external knowledge (e.g., dynamic retrieval based on model confidence). This approach is proving valuable across diverse applications, including question answering, text summarization, code generation, and even medical diagnosis, by improving factual accuracy and mitigating the limitations of LLMs trained solely on parametric knowledge.

Papers

April 22, 2024

Tree of Reviews: A Tree-based Dynamic Iterative Retrieval Framework for Multi-hop Question Answering
Li Jiapeng, Liu Runze, Li Yabo, Zhou Tong, Li Mingling, Chen Xiang
Response Generation Retrieval Augmentation Reasoning Path Multi Hop Question Answering Prominent Review Iterative Retrieval

April 16, 2024

More Room for Language: Investigating the Effect of Retrieval on Language Models
David Samuel, Lucas Georges Gabriel Charpentier, Sondre Wold
Language Model Mixed Effect Human Language App to App Retrieval Retrieval Augmentation Retrieval Augmented Language Model Language Modeling Objective

April 10, 2024

Not All Contexts Are Equal: Teaching LLMs Credibility-aware Generation
Ruotong Pan, Boxi Cao, Hongyu Lin, Xianpei Han, Jia Zheng, Sirui Wang, Xunliang Cai, Le Sun
Retrieval Augmented Generation Context Information Retrieval Augmentation Credibility Assessment

April 9, 2024

Optimization Methods for Personalizing Large Language Models through Retrieval Augmentation
Alireza Salemi, Surya Kallumadi, Hamed Zamani
Large Language Model Language Model Retrieval Model Retrieval Augmentation Optimization Method Better Retrieval

March 11, 2024

RA-ISF: Learning to Answer and Understand from Retrieval Augmentation via Iterative Self-Feedback
Yanming Liu, Xinyue Peng, Xuhong Zhang, Weihao Liu, Jianwei Yin, Jiannan Cao, Tianyu Du
Large Language Model Retrieval Augmented Generation Human Understanding Model Performance Retrieval Augmentation Top Two Answer Self Feedback

February 28, 2024

Cutting Off the Head Ends the Conflict: A Mechanism for Interpreting and Mitigating Knowledge Conflicts in Language Models
Zhuoran Jin, Pengfei Cao, Hongbang Yuan, Yubo Chen, Jiexin Xu, Huaijun Li, Xiaojian Jiang, Kang Liu, Jun Zhao
Language Model Human Head Retrieval Augmentation Machine Translated Conflict Classification Knowledge Conflict Memory System

February 25, 2024

RAM-EHR: Retrieval Augmentation Meets Clinical Predictions on Electronic Health Records
Ran Xu, Wenqi Shi, Yue Yu, Yuchen Zhuang, Bowen Jin, May D. Wang, Joyce C. Ho, Carl Yang
Electronic Health Record Strong Baseline Retrieval Augmentation EHR Datasets Clinical Prediction Task Clinical Prediction

February 21, 2024

February 18, 2024

When Do LLMs Need Retrieval Augmentation? Mitigating LLMs' Overconfidence Helps Retrieval Augmentation
Shiyu Ni, Keping Bi, Jiafeng Guo, Xueqi Cheng
Large Language Model App to App Retrieval Retrieval Augmentation Overconfidence Problem Knowledge Boundary

February 16, 2024

February 2, 2024

Retrieval Augmented End-to-End Spoken Dialog Models
Mingqiu Wang, Izhak Shafran, Hagen Soltau, Wei Han, Yuan Cao, Dian Yu, Laurent El Shafey
Large Language Model Retrieval Augmentation Speech Foundation Model Contextual Automatic Speech Recognition Dialog Response Retrieval

January 30, 2024

NNOSE: Nearest Neighbor Occupational Skill Extraction
Mike Zhang, Rob van der Goot, Min-Yen Kan, Barbara Plank
Language Model Retrieval Augmentation Skill Extraction

January 29, 2024

KAUCUS: Knowledge Augmented User Simulators for Training Language Model Assistants
Kaustubh D. Dhole
Large Language Model Retrieval Augmentation User Simulator

November 21, 2023

ATLANTIC: Structure-Aware Retrieval-Augmented Language Model for Interdisciplinary Science
Sai Munikoti, Anurag Acharya, Sridevi Wagle, Sameera Horawalavithana
Retrieval Model Retrieval Augmentation Retrieval Augmented Language Model Interdisciplinary Research Atlantic Meridional Overturning Circulation Structure Aware Retrieval

November 20, 2023

Towards Robust Text Retrieval with Progressive Learning
Tong Wu, Yulei Qin, Enwei Zhang, Zihan Xu, Yuting Gao, Ke Li, Xing Sun
Large Language Model Retrieval Augmentation State of the Art Deep Progressive Learning Text Retrieval Robust Retrieval

November 16, 2023

On Retrieval Augmentation and the Limitations of Language Model Training
Ting-Rui Chiang, Xinyan Velocity Yu, Joshua Robinson, Ollie Liu, Isabelle Lee, Dani Yogatama
Language Model Information Retrieval Fundamental Limitation Softmax Function Retrieval Augmentation Language Model Generalization

November 10, 2023

Trends in Integration of Knowledge and Large Language Models: A Survey and Taxonomy of Methods, Benchmarks, and Applications
Zhangyin Feng, Weitao Ma, Weijiang Yu, Lei Huang, Haotian Wang, Qianglong Chen, Weihua Peng, Xiaocheng Feng, Bing Qin, Ting liu
Large Language Model New Benchmark Knowledge Based Comprehensive Taxonomy Knowledge Editing Retrieval Augmentation

November 2, 2023

The Effect of Scaling, Retrieval Augmentation and Form on the Factual Consistency of Language Models
Lovisa Hagström, Denitsa Saynova, Tobias Norlund, Moa Johansson, Richard Johansson
Language Model Large Corpus Mixed Effect Retrieval Augmentation Multiplicative Size Scaling Factual Consistency

Retrieval Augmentation

Papers

Tree of Reviews: A Tree-based Dynamic Iterative Retrieval Framework for Multi-hop Question Answering

More Room for Language: Investigating the Effect of Retrieval on Language Models

Not All Contexts Are Equal: Teaching LLMs Credibility-aware Generation

Optimization Methods for Personalizing Large Language Models through Retrieval Augmentation

RA-ISF: Learning to Answer and Understand from Retrieval Augmentation via Iterative Self-Feedback

Cutting Off the Head Ends the Conflict: A Mechanism for Interpreting and Mitigating Knowledge Conflicts in Language Models

RAM-EHR: Retrieval Augmentation Meets Clinical Predictions on Electronic Health Records

Retrieval Helps or Hurts? A Deeper Dive into the Efficacy of Retrieval Augmentation to Language Models

Retrieval-Augmented Data Augmentation for Low-Resource Domain Tasks

When Do LLMs Need Retrieval Augmentation? Mitigating LLMs' Overconfidence Helps Retrieval Augmentation

Persona-DB: Efficient Large Language Model Personalization for Response Prediction with Collaborative Data Refinement

Retrieve Only When It Needs: Adaptive Retrieval Augmentation for Hallucination Mitigation in Large Language Models

Retrieval Augmented End-to-End Spoken Dialog Models

NNOSE: Nearest Neighbor Occupational Skill Extraction

KAUCUS: Knowledge Augmented User Simulators for Training Language Model Assistants

ATLANTIC: Structure-Aware Retrieval-Augmented Language Model for Interdisciplinary Science

Towards Robust Text Retrieval with Progressive Learning

On Retrieval Augmentation and the Limitations of Language Model Training

Trends in Integration of Knowledge and Large Language Models: A Survey and Taxonomy of Methods, Benchmarks, and Applications

The Effect of Scaling, Retrieval Augmentation and Form on the Factual Consistency of Language Models