Document Relevance
Document relevance research focuses on efficiently identifying and utilizing information within large document collections to answer queries accurately. Current efforts concentrate on improving retrieval methods, particularly through large language models (LLMs) and graph-based approaches, and enhancing the effectiveness of retrieval-augmented generation (RAG) systems by addressing challenges like context compression and handling unanswerable questions. These advancements are crucial for improving information access in various applications, including search engines, eDiscovery, and clinical trial document generation, ultimately impacting efficiency and accuracy in knowledge-intensive tasks.
Papers
It's All Relative: Interpretable Models for Scoring Bias in Documents
Aswin Suresh, Chi-Hsuan Wu, Matthias Grossglauser
DocTr: Document Transformer for Structured Information Extraction in Documents
Haofu Liao, Aruni RoyChowdhury, Weijian Li, Ankan Bansal, Yuting Zhang, Zhuowen Tu, Ravi Kumar Satzoda, R. Manmatha, Vijay Mahadevan