Semantic Segmentation
Semantic segmentation, the task of assigning a semantic label to each pixel in an image, aims to achieve precise pixel-level scene understanding. Current research emphasizes improving accuracy and efficiency across diverse data modalities (RGB, depth, lidar, hyperspectral, and time series) and challenging conditions (low light, adverse weather, imbalanced datasets), often employing advanced architectures like transformers and diffusion models alongside innovative loss functions and training strategies. This field is crucial for numerous applications, including autonomous driving, medical image analysis, remote sensing, and robotics, driving advancements in both model robustness and interpretability.
Papers
AI-SAM: Automatic and Interactive Segment Anything Model
Yimu Pan, Sitao Zhang, Alison D. Gernand, Jeffery A. Goldstein, James Z. Wang
DGInStyle: Domain-Generalizable Semantic Segmentation with Image Diffusion Models and Stylized Semantic Control
Yuru Jia, Lukas Hoyer, Shengyu Huang, Tianfu Wang, Luc Van Gool, Konrad Schindler, Anton Obukhov
SAM-Assisted Remote Sensing Imagery Semantic Segmentation with Object and Boundary Constraints
Xianping Ma, Qianqian Wu, Xingyu Zhao, Xiaokang Zhang, Man-On Pun, Bo Huang
Towards Granularity-adjusted Pixel-level Semantic Annotation
Rohit Kundu, Sudipta Paul, Rohit Lal, Amit K. Roy-Chowdhury
Class-Discriminative Attention Maps for Vision Transformers
Lennart Brocki, Jakub Binda, Neo Christopher Chung
Strong but simple: A Baseline for Domain Generalized Dense Perception by CLIP-based Transfer Learning
Christoph Hümmer, Manuel Schwonberg, Liangwei Zhou, Hu Cao, Alois Knoll, Hanno Gottschalk
Few Clicks Suffice: Active Test-Time Adaptation for Semantic Segmentation
Longhui Yuan, Shuang Li, Zhuo He, Binhui Xie
Contrastive Learning-Based Spectral Knowledge Distillation for Multi-Modality and Missing Modality Scenarios in Semantic Segmentation
Aniruddh Sikdar, Jayant Teotia, Suresh Sundaram
SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference
Feng Wang, Jieru Mei, Alan Yuille
Virtual Category Learning: A Semi-Supervised Learning Method for Dense Prediction with Extremely Limited Labels
Changrui Chen, Jungong Han, Kurt Debattista
Semantic segmentation of SEM images of lower bainitic and tempered martensitic steels
Xiaohan Bie, Manoj Arthanari, Evelin Barbosa de Melo, Juancheng Li, Stephen Yue, Salim Brahimi, Jun Song
Sequential Modeling Enables Scalable Learning for Large Vision Models
Yutong Bai, Xinyang Geng, Karttikeya Mangalam, Amir Bar, Alan Yuille, Trevor Darrell, Jitendra Malik, Alexei A Efros
Efficient Multimodal Semantic Segmentation via Dual-Prompt Learning
Shaohua Dong, Yunhe Feng, Qing Yang, Yan Huang, Dongfang Liu, Heng Fan
A Lightweight Clustering Framework for Unsupervised Semantic Segmentation
Yau Shing Jonathan Cheung, Xi Chen, Lihe Yang, Hengshuang Zhao
MRFP: Learning Generalizable Semantic Segmentation from Sim-2-Real with Multi-Resolution Feature Perturbation
Sumanth Udupa, Prajwal Gurunath, Aniruddh Sikdar, Suresh Sundaram
ALSTER: A Local Spatio-Temporal Expert for Online 3D Semantic Reconstruction
Silvan Weder, Francis Engelmann, Johannes L. Schönberger, Akihito Seki, Marc Pollefeys, Martin R. Oswald
Continual Learning for Image Segmentation with Dynamic Query
Weijia Wu, Yuzhong Zhao, Zhuang Li, Lianlei Shan, Hong Zhou, Mike Zheng Shou