Universal Visual Perception

Universal visual perception aims to create single, adaptable computer vision systems capable of performing diverse tasks, such as object detection, segmentation, and pose estimation, across various domains. Current research focuses on developing unified model architectures, often based on transformers, that can process visual data and associated textual prompts to achieve this versatility through techniques like few-shot learning and point-based representations. This pursuit promises to significantly streamline the development of computer vision applications and improve their generalizability, impacting fields ranging from automated animal monitoring to broader image understanding tasks.

Papers

December 4, 2023

Aligning and Prompting Everything All at Once for Universal Visual Perception
Yunhang Shen, Chaoyou Fu, Peixian Chen, Mengdan Zhang, Ke Li, Xing Sun, Yunsheng Wu, Shaohui Lin, Rongrong Ji
Visual Grounding Cross Modality Fusion Universal Visual Perception

August 19, 2023

UniAP: Towards Universal Animal Perception in Vision via Few-shot Learning
Meiqi Sun, Zhonghan Zhao, Wenhao Chai, Hanjun Luo, Shidong Cao, Yanting Zhang, Jenq-Neng Hwang, Gaoang Wang
Shot Learning Vision Paper Visual Perception Deep Learning Based Perception Universal Visual Perception

August 18, 2022

Unifying Visual Perception by Dispersible Points Learning
Jianming Liang, Guanglu Song, Biao Leng, Yu Liu
Visual Task Learning Based Point Universal Visual Perception

Universal Visual Perception

Papers

Aligning and Prompting Everything All at Once for Universal Visual Perception

UniAP: Towards Universal Animal Perception in Vision via Few-shot Learning

Unifying Visual Perception by Dispersible Points Learning