Text Based Person Retrieval
Text-based person retrieval (TPR) focuses on identifying individuals in images based solely on textual descriptions, bridging the gap between visual and linguistic information. Current research emphasizes improving model architectures by incorporating bidirectional embeddings, leveraging large language models for data augmentation, and refining alignment techniques to capture both positive and negative attributes within descriptions, often using contrastive learning methods. These advancements aim to enhance retrieval accuracy and robustness, particularly addressing challenges like limited training data and the inherent differences between visual and textual representations. Improved TPR has significant implications for various applications, including security, law enforcement, and multimedia search.
Papers
Look Before You Leap: Improving Text-based Person Retrieval by Learning A Consistent Cross-modal Common Manifold
Zijie Wang, Aichun Zhu, Jingyi Xue, Xili Wan, Chao Liu, Tian Wang, Yifeng Li
CAIBC: Capturing All-round Information Beyond Color for Text-based Person Retrieval
Zijie Wang, Aichun Zhu, Jingyi Xue, Xili Wan, Chao Liu, Tian Wang, Yifeng Li