Semantic Segmentation
Semantic segmentation, the task of assigning a semantic label to each pixel in an image, aims to achieve precise pixel-level scene understanding. Current research emphasizes improving accuracy and efficiency across diverse data modalities (RGB, depth, lidar, hyperspectral, and time series) and challenging conditions (low light, adverse weather, imbalanced datasets), often employing advanced architectures like transformers and diffusion models alongside innovative loss functions and training strategies. This field is crucial for numerous applications, including autonomous driving, medical image analysis, remote sensing, and robotics, driving advancements in both model robustness and interpretability.
Papers
Deformable Mamba for Wide Field of View Segmentation
Jie Hu, Junwei Zheng, Jiale Wei, Jiaming Zhang, Rainer Stiefelhagen
A Study on Unsupervised Domain Adaptation for Semantic Segmentation in the Era of Vision-Language Models
Manuel Schwonberg, Claus Werner, Hanno Gottschalk, Carsten Meyer
A Performance Increment Strategy for Semantic Segmentation of Low-Resolution Images from Damaged Roads
Rafael S. Toledo, Cristiano S. Oliveira, Vitor H. T. Oliveira, Eric A. Antonelo, Aldo von Wangenheim
Learn from Foundation Model: Fruit Detection Model without Manual Annotation
Yanan Wang, Zhenghao Fei, Ruichen Li, Yibin Ying
SynDiff-AD: Improving Semantic Segmentation and End-to-End Autonomous Driving with Synthetic Data from Latent Diffusion Models
Harsh Goel, Sai Shankar Narasimhan, Oguzhan Akcin, Sandeep Chinchali
A Multimodal Approach Combining Structural and Cross-domain Textual Guidance for Weakly Supervised OCT Segmentation
Jiaqi Yang, Nitish Mehta, Xiaoling Hu, Chao Chen, Chia-Ling Tsai
SAM Carries the Burden: A Semi-Supervised Approach Refining Pseudo Labels for Medical Segmentation
Ron Keuth, Lasse Hansen, Maren Balks, Ronja Jäger, Anne-Nele Schröder, Ludger Tüshaus, Mattias Heinrich
ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural Enhancements
M. Arda Aydın, Efe Mert Çırpar, Elvin Abdinli, Gozde Unal, Yusuf H. Sahin
Calibrated and Efficient Sampling-Free Confidence Estimation for LiDAR Scene Semantic Segmentation
Hanieh Shojaei Miandashti, Qianqian Zou, Claus Brenner
ULTra: Unveiling Latent Token Interpretability in Transformer Based Understanding
Hesam Hosseini, Ghazal Hosseini Mighan, Amirabbas Afzali, Sajjad Amini, Amir Houmansadr
Y-MAP-Net: Real-time depth, normals, segmentation, multi-label captioning and 2D human pose in RGB images
Ammar Qammaz, Nikolaos Vasilikopoulos, Iason Oikonomidis, Antonis A. Argyros
CorrCLIP: Reconstructing Correlations in CLIP with Off-the-Shelf Foundation Models for Open-Vocabulary Semantic Segmentation
Dengke Zhang, Fagui Liu, Quan Tang