Document Relevance
Document relevance research focuses on efficiently identifying and utilizing information within large document collections to answer queries accurately. Current efforts concentrate on improving retrieval methods, particularly through large language models (LLMs) and graph-based approaches, and enhancing the effectiveness of retrieval-augmented generation (RAG) systems by addressing challenges like context compression and handling unanswerable questions. These advancements are crucial for improving information access in various applications, including search engines, eDiscovery, and clinical trial document generation, ultimately impacting efficiency and accuracy in knowledge-intensive tasks.
Papers
DocEDA: Automated Extraction and Design of Analog Circuits from Documents with Large Language Model
Hong Cai Chen, Longchang Wu, Ming Gao, Lingrui Shen, Jiarui Zhong, Yipin Xu
LLM Augmentations to support Analytical Reasoning over Multiple Documents
Raquib Bin Yousuf, Nicholas Defelice, Mandar Sharma, Shengzhe Xu, Naren Ramakrishnan
Drowning in Documents: Consequences of Scaling Reranker Inference
Mathew Jacob, Erik Lindgren, Matei Zaharia, Michael Carbin, Omar Khattab, Andrew Drozdov
A Pre-Trained Graph-Based Model for Adaptive Sequencing of Educational Documents
Jean Vassoyan (CB), Anan Schütt (UNIA), Jill-Jênn Vie (SODA), Arun-Balajiee Lekshmi-Narayanan (PITT), Elisabeth André (UNIA), Nicolas Vayatis (CB)
JudgeRank: Leveraging Large Language Models for Reasoning-Intensive Reranking
Tong Niu, Shafiq Joty, Ye Liu, Caiming Xiong, Yingbo Zhou, Semih Yavuz
Responsible Retrieval Augmented Generation for Climate Decision Making from Documents
Matyas Juhasz, Kalyan Dutia, Henry Franks, Conor Delahunty, Patrick Fawbert Mills, Harrison Pim