Zero Shot
Zero-shot learning aims to enable models to perform tasks on unseen data without any task-specific training, leveraging pre-trained knowledge to generalize to new situations. Current research focuses on improving zero-shot capabilities across diverse modalities (vision, language, audio) using large language models (LLMs), vision-language models (VLMs), and diffusion models, often incorporating techniques like chain-of-thought prompting, knowledge retrieval, and prompt engineering to enhance performance and interpretability. This field is significant because it promises more efficient and adaptable AI systems, impacting various applications from image editing and medical diagnosis to robotics and natural language processing.
Papers
Unlikelihood Tuning on Negative Samples Amazingly Improves Zero-Shot Translation
Changtong Zan, Liang Ding, Li Shen, Yibin Lei, Yibing Zhan, Weifeng Liu, Dacheng Tao
AutoCLIP: Auto-tuning Zero-Shot Classifiers for Vision-Language Models
Jan Hendrik Metzen, Piyapat Saranrittichai, Chaithanya Kumar Mummadi
D$^3$Fields: Dynamic 3D Descriptor Fields for Zero-Shot Generalizable Rearrangement
Yixuan Wang, Mingtong Zhang, Zhuoran Li, Tarik Kelestemur, Katherine Driggs-Campbell, Jiajun Wu, Li Fei-Fei, Yunzhu Li
Zero-Shot Reinforcement Learning from Low Quality Data
Scott Jeen, Tom Bewley, Jonathan M. Cullen
RankVicuna: Zero-Shot Listwise Document Reranking with Open-Source Large Language Models
Ronak Pradeep, Sahel Sharifymoghaddam, Jimmy Lin
Noise-Tolerant Unsupervised Adapter for Vision-Language Models
Eman Ali, Dayan Guan, Shijian Lu, Abdulmotaleb Elsaddik
Harnessing the Zero-Shot Power of Instruction-Tuned Large Language Model in End-to-End Speech Recognition
Yosuke Higuchi, Tetsuji Ogawa, Tetsunori Kobayashi
Augmenting Tactile Simulators with Real-like and Zero-Shot Capabilities
Osher Azulay, Alon Mizrahi, Nimrod Curtis, Avishai Sintov
Using fine-tuning and min lookahead beam search to improve Whisper
Andrea Do, Oscar Brown, Zhengjie Wang, Nikhil Mathew, Zixin Liu, Jawwad Ahmed, Cheng Yu
Corpus Synthesis for Zero-shot ASR domain Adaptation using Large Language Models
Hsuan Su, Ting-Yao Hu, Hema Swetha Koppula, Raviteja Vemulapalli, Jen-Hao Rick Chang, Karren Yang, Gautam Varma Mantena, Oncel Tuzel
Contrastive Learning for Enhancing Robust Scene Transfer in Vision-based Agile Flight
Jiaxu Xing, Leonard Bauersfeld, Yunlong Song, Chunwei Xing, Davide Scaramuzza
Selecting which Dense Retriever to use for Zero-Shot Search
Ekaterina Khramtsova, Shengyao Zhuang, Mahsa Baktashmotlagh, Xi Wang, Guido Zuccon