Text Embeddings
Text embeddings are numerical representations of text that capture semantic meaning, enabling computers to understand and process language. Current research focuses on improving the quality and controllability of these embeddings, particularly through techniques like contrastive learning, fine-tuning large language models (LLMs), and developing novel architectures to better handle complex prompts and disentangle attributes within embeddings. These advancements are crucial for various applications, including image generation, information retrieval, and sentiment analysis, improving the performance and efficiency of numerous natural language processing tasks.
Papers
DECOR:Decomposition and Projection of Text Embeddings for Text-to-Image Customization
Geonhui Jang, Jin-Hwa Kim, Yong-Hyun Park, Junho Kim, Gayoung Lee, Yonghyun Jeong
When Text Embedding Meets Large Language Model: A Comprehensive Survey
Zhijie Nie, Zhangchi Feng, Mingxin Li, Cunwang Zhang, Yanzhao Zhang, Dingkun Long, Richong Zhang
Towards Real-Time Open-Vocabulary Video Instance Segmentation
Bin Yan, Martin Sundermeyer, David Joseph Tan, Huchuan Lu, Federico Tombari
Pre-train, Align, and Disentangle: Empowering Sequential Recommendation with Large Language Models
Yuhao Wang, Junwei Pan, Xiangyu Zhao, Pengyue Jia, Wanyu Wang, Yuan Wang, Yue Liu, Dapeng Liu, Jie Jiang
CLIP meets DINO for Tuning Zero-Shot Classifier using Unlabeled Image Collections
Mohamed Fazli Imam, Rufael Fedaku Marew, Jameel Hassan, Mustansar Fiaz, Alham Fikri Aji, Hisham Cholakkal
Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary Segmentation
Luca Barsellotti, Lorenzo Bianchi, Nicola Messina, Fabio Carrara, Marcella Cornia, Lorenzo Baraldi, Fabrizio Falchi, Rita Cucchiara