Noisy Correspondence

Noisy correspondence, the presence of mismatched data pairs in cross-modal datasets, hinders accurate cross-modal retrieval and matching. Current research focuses on developing robust methods to identify and mitigate these mismatches, employing techniques like geometrical structure consistency learning, dual attention mechanisms, and memory banks to improve the reliability of learned correspondences. These advancements are crucial for improving the performance of various applications, including image-text retrieval, point cloud matching, and person re-identification, where noisy data is prevalent and impacts the accuracy of learned models. The ultimate goal is to enable reliable cross-modal learning even with imperfect or automatically collected data.

Papers