Dense Retrieval
Dense retrieval aims to efficiently find relevant information within massive datasets by representing both queries and documents as dense vectors, enabling faster and more accurate similarity searches compared to traditional methods. Current research focuses on improving model architectures like bi-encoders and late-interaction models, exploring techniques such as knowledge distillation, multimodal retrieval (incorporating speech), and efficient training strategies to handle large-scale data and stale embeddings. These advancements are significantly impacting information retrieval applications, particularly in open-domain question answering, conversational search, and enterprise search, by enhancing both speed and accuracy.
Papers
Precise Zero-Shot Dense Retrieval without Relevance Labels
Luyu Gao, Xueguang Ma, Jimmy Lin, Jamie Callan
Fine-Grained Distillation for Long Document Retrieval
Yucheng Zhou, Tao Shen, Xiubo Geng, Chongyang Tao, Guodong Long, Can Xu, Daxin Jiang
What Are You Token About? Dense Retrieval as Distributions Over the Vocabulary
Ori Ram, Liat Bezalel, Adi Zicher, Yonatan Belinkov, Jonathan Berant, Amir Globerson
Unified Generative & Dense Retrieval for Query Rewriting in Sponsored Search
Akash Kumar Mohankumar, Bhargav Dodla, Gururaj K, Amit Singh
Non-Parametric Temporal Adaptation for Social Media Topic Classification
Fatemehsadat Mireshghallah, Nikolai Vogler, Junxian He, Omar Florez, Ahmed El-Kishky, Taylor Berg-Kirkpatrick