Retrieval Augmented Generation
Retrieval-Augmented Generation (RAG) enhances large language models (LLMs) by incorporating external knowledge sources to improve accuracy and address limitations like hallucinations. Current research focuses on optimizing retrieval strategies (e.g., using hierarchical graphs, attention mechanisms, or determinantal point processes for diverse and relevant information selection), improving the integration of retrieved information with LLM generation (e.g., through various prompting techniques and model adaptation methods), and mitigating biases and ensuring fairness in RAG systems. The impact of RAG is significant, improving performance on various tasks like question answering and enabling more reliable and contextually aware applications across diverse domains, including healthcare and scientific research.
Papers
Graph-Based Retriever Captures the Long Tail of Biomedical Knowledge
Julien Delile, Srayanta Mukherjee, Anton Van Pamel, Leonid Zhukov
Mafin: Enhancing Black-Box Embeddings with Model Augmented Fine-Tuning
Mingtian Zhang, Shawn Lan, Peter Hayes, David Barber
FeB4RAG: Evaluating Federated Search in the Context of Retrieval Augmented Generation
Shuai Wang, Ekaterina Khramtsova, Shengyao Zhuang, Guido Zuccon
Unveiling the Magic: Investigating Attention Distillation in Retrieval-augmented Generation
Zizhong Li, Haopeng Zhang, Jiawei Zhang
PoisonedRAG: Knowledge Poisoning Attacks to Retrieval-Augmented Generation of Large Language Models
Wei Zou, Runpeng Geng, Binghui Wang, Jinyuan Jia
CyberMetric: A Benchmark Dataset based on Retrieval-Augmented Generation for Evaluating LLMs in Cybersecurity Knowledge
Norbert Tihanyi, Mohamed Amine Ferrag, Ridhi Jain, Tamas Bisztray, Merouane Debbah
G-Retriever: Retrieval-Augmented Generation for Textual Graph Understanding and Question Answering
Xiaoxin He, Yijun Tian, Yifei Sun, Nitesh V. Chawla, Thomas Laurent, Yann LeCun, Xavier Bresson, Bryan Hooi
T-RAG: Lessons from the LLM Trenches
Masoomali Fatehkia, Ji Kim Lucas, Sanjay Chawla
DFA-RAG: Conversational Semantic Router for Large Language Model with Definite Finite Automaton
Yiyou Sun, Junjie Hu, Wei Cheng, Haifeng Chen
Enhancing Retrieval Processes for Language Generation with Augmented Queries
Julien Pierre Edmond Ghali, Kosuke Shima, Koichi Moriyama, Atsuko Mutoh, Nobuhiro Inuzuka
Financial Report Chunking for Effective Retrieval Augmented Generation
Antonio Jimeno Yepes, Yao You, Jan Milczek, Sebastian Laverde, Renyu Li
Multi-Lingual Malaysian Embedding: Leveraging Large Language Models for Semantic Representations
Husein Zolkepli, Aisyah Razak, Kamarul Adha, Ariff Nazhan
Enhancing Textbook Question Answering Task with Large Language Models and Retrieval Augmented Generation
Hessa Abdulrahman Alawwad, Areej Alhothali, Usman Naseem, Ali Alkhathlan, Amani Jamal
List-aware Reranking-Truncation Joint Model for Search and Retrieval-augmented Generation
Shicheng Xu, Liang Pang, Jun Xu, Huawei Shen, Xueqi Cheng