Pixel Level
Pixel-level analysis focuses on understanding and manipulating individual pixels within images and videos, aiming for precise object segmentation and detailed scene understanding. Current research emphasizes developing efficient and robust methods for pixel-level tasks, particularly in weakly or unsupervised settings, often leveraging transformer architectures, diffusion models, and foundation models like SAM and CLIP to reduce reliance on extensive manual annotation. This work has significant implications for diverse applications, including remote sensing, medical image analysis, and computer vision generally, by enabling more accurate and automated image interpretation and manipulation.
Papers
BoxMask: Revisiting Bounding Box Supervision for Video Object Detection
Khurram Azeem Hashmi, Alain Pagani, Didier Stricker, Muhammamd Zeshan Afzal
ACSeg: Adaptive Conceptualization for Unsupervised Semantic Segmentation
Kehan Li, Zhennan Wang, Zesen Cheng, Runyi Yu, Yian Zhao, Guoli Song, Chang Liu, Li Yuan, Jie Chen