ColBERT XM
ColBERT XM is a modular, multi-vector retrieval model designed to improve multilingual information retrieval, particularly for low-resource languages. Research focuses on enhancing its efficiency and effectiveness through techniques like improved training pipelines, aggressive compression methods, and token pruning to reduce storage needs while maintaining accuracy. This work addresses the limitations of existing models that struggle with multilingual data and resource constraints, ultimately aiming to make information retrieval more accessible and sustainable across diverse languages.
Papers
August 29, 2024
August 20, 2024
April 11, 2024
February 23, 2024
March 24, 2022
December 13, 2021